🎙�� Speech Recognition

voice-activity-detection

pyannote/voice-activity-detection

Get AI Model →
1.7M
Downloads
�����
235
Likes
�����
14
Tags
📦
pyannote-audio
Library
Model Details
Full Model IDpyannote/voice-activity-detection
Pipeline / Taskautomatic-speech-recognition
Librarypyannote-audio
Downloads (all-time)1.7M
Likes235
Last Modified5/10/2024
Author / Orgpyannote
PrivateNo � public
⚡ Quick Usage (Python)

Using the 🤗 Transformers library. Install with pip install transformers

from transformers import pipeline

# Load the model
pipe = pipeline("automatic-speech-recognition", model="pyannote/voice-activity-detection")

# Run inference
result = pipe("Your input here")
print(result)
����� Tags
pyannote-audiopyannotepyannote-audio-pipelineaudiovoicespeechspeakervoice-activity-detectionautomatic-speech-recognitiondataset:amidataset:diharddataset:voxconverselicense:mitregion:us
More Speech Recognition Models
See all →
whisperkit-coreml

argmaxinc/whisperkit-coreml

9.9M����� 184
Get AI Model →
speaker-diarization-3.1

pyannote/speaker-diarization-3.1

9.8M����� 2.1K
Get AI Model →
whisper-large-v3-turbo

openai/whisper-large-v3-turbo

8.1M����� 3.0K
Get AI Model →
🚀 Use This Model

Access model files, inference API, and full documentation on Hugging Face.

Open on Hugging Face →Browse Model Files ↗�� Browse All Models
🎙�� Task: Speech Recognition

This model is designed for the Speech Recognition task. Explore more models for this use case.

All Speech Recognition Models →
📊 Popularity
Downloads1.7M
����� Community Likes235
🛠�� Requirements
  • Install: pip install pyannote-audio
  • Python 3.8+ recommended for Transformers.
  • GPU (CUDA) speeds up inference significantly.
  • Use model.half() for fp16 on limited VRAM.
👋 Need help with code?