Speechmatics logo

Speechmatics

Free tier

Low-latency speech-to-text APIs powering multilingual, multi-speaker Voice AI

Free tier available·All audiences·API available

Key strengths

Sub-second real-time speech-to-text with high accuracy55+ language support covering over half the world's populationFlexible deployment: cloud, on-premises, and on-deviceEnterprise-grade security: ISO 27001, GDPR, HIPAA, SOC 2 Type II certifiedSpecialized models for verticals like medical, legal, and contact centers
Free tier + paid plans
Cambridge, United Kingdom
Founded 2006
Self-hostable
No ratings yet
  • Voice agent backends — Integrate sub-second STT and TTS into agent frameworks like LiveKit to build responsive, speaker-aware conversational AI pipelines across 55+ languages.
  • On-device transcription — Deploy quantized Speechmatics models locally within desktop applications (e.g., Adobe Premiere) for cloud-grade accuracy without network dependency.
  • Micro-batching workflows — Use the Speechmatics API to chunk and submit audio in micro-batches, stitch JSON results, and achieve near-real-time performance with batch-level control.
  • Contact center analytics — Stream call audio through the API with diarization enabled to extract per-speaker transcripts, sentiment signals, and interaction metadata at scale.
  • Medical transcription pipelines — Apply the specialized Medical Model to ambient scribe and dictation workflows, reducing errors on clinical terminology by up to 50%.
  • Live captioning infrastructure — Build real-time captioning systems for broadcast, sports, and news events using the low-latency WebSocket API with high-accuracy multilingual output.