?? AI Models
Trending AI models from Hugging Face � sorted by most downloads.
All Categories?? Text Generation?? Text to Image??? Image Classification??? Text Classification?? Translation?? Summarization??? Speech Recognition?? Object Detection
??? Speech Recognition
speaker-diarization-3.1
pyannote/speaker-diarization-3.1
? 9.8M?? 2.1K
pyannote-audiopyannotepyannote-audio-pipelineaudio
whisper-large-v3-turbo
openai/whisper-large-v3-turbo
? 8.1M?? 3.0K
transformerssafetensorswhisperautomatic-speech-recognition
wav2vec2-large-xlsr-53-russian
jonatasgrosman/wav2vec2-large-xlsr-53-russian
? 3.5M?? 75
transformerspytorchjaxwav2vec2
wav2vec2-large-xlsr-53-portuguese
jonatasgrosman/wav2vec2-large-xlsr-53-portuguese
? 3.1M?? 54
transformerspytorchjaxwav2vec2
speaker-diarization-community-1
pyannote/speaker-diarization-community-1
? 2.8M?? 440
pyannote-audiopyannotepyannote-audio-pipelineaudio
mms-300m-1130-forced-aligner
MahmoudAshraf/mms-300m-1130-forced-aligner
? 2.8M?? 90
transformerspytorchsafetensorswav2vec2
Qwen3-ASR-1.7B
Qwen/Qwen3-ASR-1.7B
? 2.0M?? 836
safetensorsqwen3_asrautomatic-speech-recognitionarxiv:2601.21337
voice-activity-detection
pyannote/voice-activity-detection
? 1.7M?? 235
pyannote-audiopyannotepyannote-audio-pipelineaudio
wav2vec2-large-xlsr-53-polish
jonatasgrosman/wav2vec2-large-xlsr-53-polish
? 1.6M?? 12
transformerspytorchjaxwav2vec2
wav2vec2-large-xlsr-53-japanese
jonatasgrosman/wav2vec2-large-xlsr-53-japanese
? 1.6M?? 57
transformerspytorchjaxwav2vec2
faster-whisper-base
Systran/faster-whisper-base
? 1.3M?? 27
ctranslate2audioautomatic-speech-recognitionen
Voxtral-Mini-4B-Realtime-2602
mistralai/Voxtral-Mini-4B-Realtime-2602
? 1.3M?? 861
vllmsafetensorsvoxtral_realtimemistral-common
wav2vec2-large-xlsr-53-chinese-zh-cn
jonatasgrosman/wav2vec2-large-xlsr-53-chinese-zh-cn
? 1.3M?? 133
transformerspytorchjaxwav2vec2
wav2vec2-base-960h
facebook/wav2vec2-base-960h
? 1.2M?? 398
transformerspytorchtfsafetensors
faster-whisper-tiny.en
Systran/faster-whisper-tiny.en
? 1.2M?? 10
ctranslate2audioautomatic-speech-recognitionen
parakeet-tdt-0.6b-v3
mlx-community/parakeet-tdt-0.6b-v3
? 1.1M?? 42
mlxsafetensorsautomatic-speech-recognitionspeech
distil-large-v3
distil-whisper/distil-large-v3
? 929.8K?? 376
transformersjaxtensorboardonnx
faster-whisper-large-v3
Systran/faster-whisper-large-v3
? 927.2K?? 585
ctranslate2audioautomatic-speech-recognitionen
faster-whisper-small
Systran/faster-whisper-small
? 895.4K?? 33
ctranslate2audioautomatic-speech-recognitionen