Wav2CLIP: Learning Robust Audio Representations From CLIP.
Project Links
Meta
Author: Ho-Hsiang Wu
Classifiers
Intended Audience
- Developers
- Education
- Science/Research
Natural Language
- English
Programming Language
- Python :: 3.7
Topic
- Artistic Software
- Multimedia
- Multimedia :: Sound/Audio
- Multimedia :: Sound/Audio :: Editors
- Software Development :: Libraries
Wav2CLIP
Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
Installation
pip install wav2clip
Usage
Clip-Level Embeddings
import wav2clip
model = wav2clip.get_model()
embeddings = wav2clip.embed_audio(audio, model)
Frame-Level Embeddings
import wav2clip
model = wav2clip.get_model(frame_length=16000, hop_length=16000)
embeddings = wav2clip.embed_audio(audio, model)