-
-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make Annotation.write_rttm follow most of RTTM specs #75
base: develop
Are you sure you want to change the base?
Conversation
Thanks. Would you mind sharing a link to the RTTM file format specification? |
Thank you for sharing and build this project. |
(sorry for the delay in getting back to you) It looks like RTTM files may contain much more than just Therefore, unless you convince me otherwise and we find a way to really map |
Hi, @hbredin. Don't worry about the delay, and thanks for taking the time to answer back. I'm with you. Maybe this request is too poor. I think The thing for me is that if the method |
I'd definitely consider a PR that covers all the specs (or at least RTTM specs vs.
|
RTTM | Annotation |
---|---|
type |
see below |
file |
annotation.uri |
chnl |
see below |
tbeg |
segment.start |
tdur |
segment.duration |
ortho |
N/A |
stype |
see below |
name |
label when type is SPEAKER |
conf |
N/A |
N/A = information is not provided by Annotation
About type
While track
is used to differentiate two identical segments (think: perfect overlap between two speakers), we could try to divert its use to provide a cue about what type
it is (while still allowing to differentiate two identical segments). Note, however, thattrack
is expected to be either a string or an int.
For instance, we could use track
with the following convention {type}_{original_track}
where type
can be any type between LEXEME
and SPEAKER
(see column Type of Table A.2) and original_track
allows to keep the original role of differentiating identical segments.
About subtype
Once we infer type
from track
,
- if
type
isA/P
orSPEAKER
,subtype
should be"<NA>"
- otherwise,
subtype
should belabel
.
About chnl
We could trick annotation.uri
into containing channel information (e.g. using {file}:{chnl}
convention)
What do you think?
The method
write_rttm()
only allow the type SPEAKER in the first field of the RTTM File.This pull request is for adding the type NON-SPEECH in field 1, if the label of the segment it's one of the 3 subtypes allowed in the RTTM File Format Specification (noise, music or other).