Whisper Variants Comparison: What Are Their Features And How To Implement Them?
Author(s): Yuki Shizuya Originally published on Towards AI. Photo by Pawel Czerwinski on Unsplash Recently, I research automatic speech recognition (ASR) to make transcription from speech data. When it comes to an open-source ASR model, Whisper [1], which is developed by OpenAI, …
Vision Embedding Comparison for Image Similarity Search: EfficientNet vs. ViT vs. VINO vs. CLIP vs. BLIP2
Author(s): Yuki Shizuya Originally published on Towards AI. Photo by gilber franco on Unsplash Recently, I needed to research image similarity search. I wonder if there are any differences among embeddings based on the architecture training methods. However, few blogs compare embeddings …