This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Speech to text documentation
Speech to text from the Speech service, also known as speech recognition, enables real-time and batch transcription of audio streams into text. With additional reference text input, it also enables real-time pronunciation assessment and gives speakers feedback on the accuracy and fluency of spoken audio.
About speech to text
- What is real-time speech to text?
- What is batch speech to text?
- What is custom speech?
- Use the Speech CLI for speech to text with no code
- Get started with speech to text
- Try real-time diarization
Develop with speech to text
How-to guide.
- Use the fast transcription API
- Create a custom speech project
- Train a model for custom speech
- Use compressed audio input formats
- Whisper model from OpenAI
- Improve accuracy with custom speech
- Display text formatting
- Language support
- Speech to text FAQ
- Speech to text pricing
Help and feedback
- Support and help options
IMAGES
VIDEO