Whisper is an open-source automatic speech recognition system by OpenAI trained on 680,000 hours of multilingual web audio data, offering near human-level robustness and accuracy in English and 99 other languages. It is available on GitHub and via the OpenAI API.

    Category

    Audio & Music

    Subcategory

    Speech-to-Text

    // ACCESS METHODS

    APICLI

    // COMPLIANCE

    SOC2ISO27001GDPRHIPAA

    ● certified · ○ not verified

    // DATA STORAGE

    Region

    Trains on Data

    Self-hostable

    Yes

    // PRICING DETAIL

    Free Tier

    Free open-source model

    Paid Plans

    Not available yet

    API Cost

    $0.006/min via OpenAI API
    >> OPEN TOOL

    // MORE IN SPEECH-TO-TEXT

    A
    AssemblyAI
    Freemium
    Audio & MusicSpeech-to-Text
    #speech-to-text api#transcription api#audio intelligence
    D
    Deepgram
    Freemium
    Audio & MusicSpeech-to-Text
    #speech recognition api#real-time transcription#voice ai
    R
    Rev AI
    Freemium
    Audio & MusicSpeech-to-Text
    #transcription api#speech-to-text#captions

    // USE CASES

    Transcribing audio files offline and locallyBuilding speech-to-text applications with APIConverting multilingual audio to textGenerating closed captions for video contentIntegrating speech recognition into custom workflows