nAMI

Scripts for mixing acoustic events with AMI corpus MDM recordings.

The repository presents scripts for spatial reverberation of the noise database (simulation.py) and for mixing reverberated noise to the MDM8 data of the AMI corpus (add_noise_ami.py). The algorithms are discussed in paper [1]. For the noise database the standard Freesound database is utilized.

Corpus mixing has several stochastic components, thus one-to-one reproduction of the corpus, the evaluation results of which were presented in paper [1], is impossible. For one-to-one reproduction it is advised to use the logs of mixing, and apply the same augmentation and mixing parameters.

[1] S. Astapov, G. Svirskiy, A. Lavrentyev, T. Prisyach, D. Popov, D. Ubskiy, and V. Kabarov, "Acoustic Event Mixing to Multichannel AMI Data for Distant Speech Recognition and Acoustic Event Classification Benchmarking," In: Proc. Int. Conf. on Speech and Computer (SPECOM 2019), pp. 31-42, Sept. 2019. https://doi.org/10.1007/978-3-030-26061-3_4

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
noised_corpus_info		noised_corpus_info
speech_regions		speech_regions
LICENSE		LICENSE
README.md		README.md
add_noise_ami.py		add_noise_ami.py
average_lengths_of_wavs.py		average_lengths_of_wavs.py
config		config
noises_config		noises_config
noises_not_used_in_ami.txt		noises_not_used_in_ami.txt
silence_postions.py		silence_postions.py
simulated_files_info.txt		simulated_files_info.txt
simulation.py		simulation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nAMI

About

Releases

Packages

Languages

License

glebss/nAMI

Folders and files

Latest commit

History

Repository files navigation

nAMI

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages