Two Ways to Learn Audio Embeddings
Author(s): Edward Ma Originally published on Towards AI. Speech2Vec with Skip-gram and CBOW Photo by Álvaro Bernal on Unsplash Mel-frequency cepstral coefficients (MFCC), zero-crossing rate are some of classical feature for audio. It can be extracted via the library easily. However, it …