Speechmatics logo

Speechmatics

Free tier

Low-latency speech-to-text APIs powering multilingual, multi-speaker Voice AI

Free tier available·All audiences·API available

Key strengths

Sub-second real-time speech-to-text with high accuracy55+ language support covering over half the world's populationFlexible deployment: cloud, on-premises, and on-deviceEnterprise-grade security: ISO 27001, GDPR, HIPAA, SOC 2 Type II certifiedSpecialized models for verticals like medical, legal, and contact centers
Free tier + paid plans
Cambridge, United Kingdom
Founded 2006
Self-hostable
No ratings yet

Speechmatics exposes a flexible REST and WebSocket API that supports both real-time (streaming) and batch transcription workflows, including a micro-batching pattern for bridging the two paradigms. Its architecture allows deployment across cloud, on-premises, and on-device environments — enabling edge inference on hardware as constrained as a laptop, as demonstrated in the Adobe Premiere integration. The platform offers native integrations with popular agent frameworks (e.g., LiveKit), a Medical Model that cuts errors on clinical terminology by up to 50%, and speaker-diarization for multi-speaker conversations. Response format is structured JSON, and the API supports rich configuration including language selection, speaker labels, and custom vocabulary.