Make Annotation.write_rttm follow most of RTTM specs #75

JMasr · 2022-01-05T13:22:28Z

The method write_rttm() only allow the type SPEAKER in the first field of the RTTM File.

This pull request is for adding the type NON-SPEECH in field 1, if the label of the segment it's one of the 3 subtypes allowed in the RTTM File Format Specification (noise, music or other).

hbredin · 2022-01-05T13:56:38Z

Thanks. Would you mind sharing a link to the RTTM file format specification?

JMasr · 2022-01-05T14:03:33Z

Thank you for sharing and build this project.
Of course, in this NIST's paper in the Appendix A you can find the RTTM File Format Specification.

hbredin · 2022-01-18T13:15:16Z

(sorry for the delay in getting back to you)

It looks like RTTM files may contain much more than just SPEAKER and NON-SPEECH (column Type of Table A.2).
Also, there is no clear correspondance between pyannote.core.Annotation labels and RTTM type, subtype, and name fields.

Therefore, unless you convince me otherwise and we find a way to really map Annotation to the RTTM specs, I probably won't merge this PR.

JMasr · 2022-01-18T23:29:05Z

Hi, @hbredin. Don't worry about the delay, and thanks for taking the time to answer back.

I'm with you. Maybe this request is too poor. I think pyannote.core.Annotation is very useful for VAD, SAD, and SPK-Diarization. If we figure out a way to map better with the RTTM specs, it could be equally useful for Acoustic Events Detection or Rich Transcription.

The thing for me is that if the method pyannote.core.Annotation.write_rttm only prints with the subtype SPEAKER I can't include acoustics events such as music in the annotation. Maybe a refactoring that covers all the specs will be better. What do you think?

hbredin · 2022-01-19T08:23:30Z

I'd definitely consider a PR that covers all the specs (or at least STT and MDE categories).

RTTM specs vs. `Annotation`

There is not a 100% correspondance between RTTM specs and what Annotation can handle.

for segment, track, label in annotation.itertracks(yield_label=True):
    pass

RTTM	`Annotation`
`type`	see below
`file`	`annotation.uri`
`chnl`	see below
`tbeg`	`segment.start`
`tdur`	`segment.duration`
`ortho`	N/A
`stype`	see below
`name`	`label` when `type` is `SPEAKER`
`conf`	N/A

N/A = information is not provided by Annotation

About `type`

While track is used to differentiate two identical segments (think: perfect overlap between two speakers), we could try to divert its use to provide a cue about what type it is (while still allowing to differentiate two identical segments). Note, however, thattrack is expected to be either a string or an int.

For instance, we could use track with the following convention {type}_{original_track} where type can be any type between LEXEME and SPEAKER (see column Type of Table A.2) and original_track allows to keep the original role of differentiating identical segments.

About `subtype`

Once we infer type from track,

if type is A/P or SPEAKER, subtype should be "<NA>"
otherwise, subtype should be label.

About `chnl`

We could trick annotation.uri into containing channel information (e.g. using {file}:{chnl} convention)

What do you think?

Adding the type **NON-SPEECH** to the RTTM file writer method

09b694f

hbredin changed the title ~~Adding the type NON-SPEECH to the RTTM file writer method~~ Make Annotation.write_rttm follows most of RTTM specs Jan 19, 2022

hbredin changed the title ~~Make Annotation.write_rttm follows most of RTTM specs~~ Make Annotation.write_rttm follow most of RTTM specs Jan 19, 2022

hbredin mentioned this pull request Aug 29, 2022

Add support for SPKR-INFO lines in load_rttm pyannote/pyannote-database#86

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Annotation.write_rttm follow most of RTTM specs #75

Make Annotation.write_rttm follow most of RTTM specs #75

JMasr commented Jan 5, 2022

hbredin commented Jan 5, 2022

JMasr commented Jan 5, 2022

hbredin commented Jan 18, 2022

JMasr commented Jan 18, 2022

hbredin commented Jan 19, 2022 •

edited

Loading

Make Annotation.write_rttm follow most of RTTM specs #75

Are you sure you want to change the base?

Make Annotation.write_rttm follow most of RTTM specs #75

Conversation

JMasr commented Jan 5, 2022

hbredin commented Jan 5, 2022

JMasr commented Jan 5, 2022

hbredin commented Jan 18, 2022

JMasr commented Jan 18, 2022

hbredin commented Jan 19, 2022 • edited Loading

RTTM specs vs. Annotation

About type

About subtype

About chnl

hbredin commented Jan 19, 2022 •

edited

Loading

RTTM specs vs. `Annotation`

About `type`

About `subtype`

About `chnl`