W
Whisper is an open-source automatic speech recognition system by OpenAI trained on 680,000 hours of multilingual web audio data, offering near human-level robustness and accuracy in English and 99 other languages. It is available on GitHub and via the OpenAI API.
Category
Audio & Music
Subcategory
Speech-to-Text
// ACCESS METHODS
APICLI
// COMPLIANCE
○SOC2○ISO27001○GDPR○HIPAA
● certified · ○ not verified
// DATA STORAGE
Region
—
Trains on Data
—
Self-hostable
Yes
// PRICING DETAIL
Free Tier
Free open-source model
Paid Plans
Not available yet
API Cost
$0.006/min via OpenAI API
// MORE IN SPEECH-TO-TEXT
A
AssemblyAIAudio & MusicSpeech-to-Text
#speech-to-text api#transcription api#audio intelligence
D
DeepgramAudio & MusicSpeech-to-Text
#speech recognition api#real-time transcription#voice ai
R
Rev AIAudio & MusicSpeech-to-Text
#transcription api#speech-to-text#captions
// USE CASES
Transcribing audio files offline and locallyBuilding speech-to-text applications with APIConverting multilingual audio to textGenerating closed captions for video contentIntegrating speech recognition into custom workflows
