?? AI Models

Trending AI models from Hugging Face � sorted by most downloads.

??? Speech Recognition
whisperkit-coreml

argmaxinc/whisperkit-coreml

? 9.9M?? 184
whisperkitcoremlwhisperasr
speaker-diarization-3.1

pyannote/speaker-diarization-3.1

? 9.8M?? 2.1K
pyannote-audiopyannotepyannote-audio-pipelineaudio
whisper-large-v3-turbo

openai/whisper-large-v3-turbo

? 8.1M?? 3.0K
transformerssafetensorswhisperautomatic-speech-recognition
whisper-large-v3

openai/whisper-large-v3

? 5.3M?? 5.8K
transformerspytorchjaxsafetensors
wav2vec2-large-xlsr-53-russian

jonatasgrosman/wav2vec2-large-xlsr-53-russian

? 3.5M?? 75
transformerspytorchjaxwav2vec2
wav2vec2-large-xlsr-53-portuguese

jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

? 3.1M?? 54
transformerspytorchjaxwav2vec2
speaker-diarization-community-1

pyannote/speaker-diarization-community-1

? 2.8M?? 440
pyannote-audiopyannotepyannote-audio-pipelineaudio
mms-300m-1130-forced-aligner

MahmoudAshraf/mms-300m-1130-forced-aligner

? 2.8M?? 90
transformerspytorchsafetensorswav2vec2
whisper-base

openai/whisper-base

? 2.6M?? 270
transformerspytorchtfjax
whisper-small

openai/whisper-small

? 2.5M?? 563
transformerspytorchtfjax
Qwen3-ASR-1.7B

Qwen/Qwen3-ASR-1.7B

? 2.0M?? 836
safetensorsqwen3_asrautomatic-speech-recognitionarxiv:2601.21337
voice-activity-detection

pyannote/voice-activity-detection

? 1.7M?? 235
pyannote-audiopyannotepyannote-audio-pipelineaudio
wav2vec2-large-xlsr-53-polish

jonatasgrosman/wav2vec2-large-xlsr-53-polish

? 1.6M?? 12
transformerspytorchjaxwav2vec2
wav2vec2-large-xlsr-53-japanese

jonatasgrosman/wav2vec2-large-xlsr-53-japanese

? 1.6M?? 57
transformerspytorchjaxwav2vec2
faster-whisper-base

Systran/faster-whisper-base

? 1.3M?? 27
ctranslate2audioautomatic-speech-recognitionen
Voxtral-Mini-4B-Realtime-2602

mistralai/Voxtral-Mini-4B-Realtime-2602

? 1.3M?? 861
vllmsafetensorsvoxtral_realtimemistral-common
wav2vec2-large-xlsr-53-chinese-zh-cn

jonatasgrosman/wav2vec2-large-xlsr-53-chinese-zh-cn

? 1.3M?? 133
transformerspytorchjaxwav2vec2
wav2vec2-base-960h

facebook/wav2vec2-base-960h

? 1.2M?? 398
transformerspytorchtfsafetensors
faster-whisper-tiny.en

Systran/faster-whisper-tiny.en

? 1.2M?? 10
ctranslate2audioautomatic-speech-recognitionen
parakeet-tdt-0.6b-v3

mlx-community/parakeet-tdt-0.6b-v3

? 1.1M?? 42
mlxsafetensorsautomatic-speech-recognitionspeech
distil-large-v3

distil-whisper/distil-large-v3

? 929.8K?? 376
transformersjaxtensorboardonnx
faster-whisper-large-v3

Systran/faster-whisper-large-v3

? 927.2K?? 585
ctranslate2audioautomatic-speech-recognitionen
whisper-tiny

openai/whisper-tiny

? 917.3K?? 430
transformerspytorchtfjax
faster-whisper-small

Systran/faster-whisper-small

? 895.4K?? 33
ctranslate2audioautomatic-speech-recognitionen
👋 Need help with code?