Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v.2.1 Missing @cite values in urn:cts:greekLit:tlg0012.tlg001.perseus-grc1.tb #27

Open
Eumaeus opened this issue Jun 26, 2020 · 4 comments

Comments

@Eumaeus
Copy link

Eumaeus commented Jun 26, 2020

The @cite attribute is empty when the token is a mark of punctuation. If punctuation is part of the Edition, it belongs to citable passages as much as any word-token.

@francescomambrini
Copy link

Assigning @cite attribute to punctuation would also help reconstructing the text from the treebank. So definitely +1 to that!

@gcelano
Copy link
Contributor

gcelano commented Jun 26, 2020

I do not remember I have introduced @cite, but rather kept them if present. @balmas, was @cite assigned in Arethusa?

@balmas
Copy link
Contributor

balmas commented Jun 26, 2020

I don't believe this was done by Perseids or Arethusa. At least I can't find any reference to it in the code.

@gcelano
Copy link
Contributor

gcelano commented Jun 30, 2020

In any case, yes, cts should also be applied to punctuation marks. This is what I am doing:

https://git.informatik.uni-leipzig.de/celano/latinnlp/-/tree/master/temporary

Is there any new reference style for cite? It can be inferred from, for example:

https://git.informatik.uni-leipzig.de/celano/latinnlp/-/blob/master/temporary/phi0588.abo005.perseus-lat2/phi0588.abo005.perseus-lat2.tok01.xml

but this can be transformed into something more easily readable.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants