Support for AST Model #726

TheRealSal · 2024-07-25T20:10:59Z

🌟 New adapter setup

Model description

The Audio Spectrogram Transformer model was proposed in AST: Audio Spectrogram Transformer by Yuan Gong, Yu-An Chung, James Glass. The Audio Spectrogram Transformer applies a Vision Transformer to audio, by turning audio into an image (spectrogram).

Open source status

the model implementation is available: available in the HF Transformers library. Original Implementation: https://github.com/YuanGongND/ast
the model weights are available: "MIT/ast-finetuned-audioset-10-10-0.4593"
who are the authors: Yuan Gong @YuanGongND , Yu-An Chung and James Glass

TheRealSal added the enhancement New feature or request label Jul 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for AST Model #726

Support for AST Model #726

TheRealSal commented Jul 25, 2024

Support for AST Model #726

Support for AST Model #726

Comments

TheRealSal commented Jul 25, 2024

🌟 New adapter setup

Model description

Open source status