Pricing
- Input tokens0.083000000000 tokens
- Output tokens0.089000000000 tokens
- Context windowPENDING INFORMATION
- ThroughputPENDING INFORMATION
Whisper-1 is OpenAI’s premier speech-to-text model, accessible via their API and optimized for high-accuracy, multilingual audio processing. Built on an encoder-decoder Transformer architecture, it was trained on 680,000 hours of diverse, weakly supervised web data, making it exceptionally robust against background noise, various accents, and technical jargon. Whisper-1 functions as a multitask system capable of automatic speech recognition (ASR), language identification, and seamless speech translation from dozens of languages into English. It is widely recognized for delivering near-human-level transcription and is the industry standard for creating accessible, searchable, and translated audio content.