Requirements

by Marc C. Green

A clustering approach to sound source tracking in Ambisonic audio. This modules contains code for:

Spherical harmonic eamforming using Plane Wave Decomposition and Cross-pattern coherence beams.
Rotation of non axis-symmetric spherical functions using Wigner-D matrices.
Fibonacci, regular and geodesic spherical sampling schemes.
Clustering (DBSCAN) and regression (SVR) for estimating coherent sound sources from power maps.
find_sources wrapper function for automating estimation of source trajectories from an Ambisonic audio file.
Functions to plot outputs.
Implementations of Frame Recall and DOA Error performance metrics from DCASE 2019.

find_sources(input, *args, **kwargs)

input should be a path to an Ambisonic audio file.

*kwargs passed to sph_peaks_t:

max_n_peaks=20 - the maximum number of peaks that will be saved per frame.
audio_length_seconds=None - optional variable replacing output frame numbers with time in seconds.

*args passed to obj_trajectories:

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
README.md		README.md
area_plots.py		area_plots.py
batch_srp.py		batch_srp.py
metrics.py		metrics.py
postsrp.py		postsrp.py
shbeamforming.py		shbeamforming.py
spherical_sampling.py		spherical_sampling.py
text_formatting.py		text_formatting.py
utilities.py		utilities.py