You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Changelog
Added drag & drop for uploading files
Made docker compose buildable
Fixed null appearing in lemma and pos when exporting. Instead, we export empty strings. E.g. <w lemma="" pos="">. (This occurs when a tagger does not produce an annotation.)
Build CMDI-files using xpath instead of naive string substitution. (It would have broken if you called your corpus e.g. "YEAR".)
Add spaces to the plaintext of parsed tei documents where <w> follow each other without spaces. Old: <w>a</w><w>b</w> -> "ab"; New: -> "a b".
Fixed OTHER_POS only matching POS of length 1. E.g. matching 6 but not 630. (Relevant for the old Gysseling/CRM tagset).
Fix truncated punctuation matching throwing on empty strings.