New · Speaker diarization v2

Transcribe audio & video
in minutes, not hours.

Accurate AI transcription for podcasts, interviews, and meetings. 100+ languages, speaker labels, and exports to SRT, DOCX, and JSON.

Drop a file or click to upload

Up to 5 GB · MP3, MP4, WAV, MOV, M4A, WEBM

Upload

Free tier · 30 min/month No credit card

interview-03.mp3

SPEAKER 1 · 00:14

So the thing about good transcription is it shouldn't feel like work — you upload, you wait a minute, and it's just... done.

SPEAKER 2 · 00:22

Right. And the export to SRT is what saved us last week▍

EN · auto2 speakers● transcribing 48%

boro

on it.

Features

Everything you need to turn talk into text.

Built for podcasters, journalists, and product teams who need transcripts they can actually trust.

Fast

A 60-minute file in under 3 minutes. Streaming output for long files.

Accurate

Word error rate under 6% on clean English audio, with custom vocabulary.

Secure

Files encrypted at rest and in transit. Auto-delete after 24 hours.

100+ languages

Auto-detection or pin a language. Mixed-language audio supported.

How it works

Three steps. Then a transcript.

Try it now

STEP 01
Upload
Drop in a file, paste a URL, or send via API. Up to 5 GB per file.
interview-03.mp3 · 84 MB
STEP 02
Transcribe
Our model processes the audio with speaker diarization and timestamps.
STEP 03
Export
Download as TXT, DOCX, SRT, VTT, or JSON. Or pipe into your tools.
.txt.docx.srt.json

Pricing

Cheap. Accurate. No subscription traps.

High-quality audio and video transcription with honest per-minute billing.

Pay as you go: 4 ₽/min. 700-minute pack: 1.5 ₽/min.

Free

$0/mo

Try AI speech-to-text with no card required.

· 30 transcription minutes / month
· Files up to 1 GB
· Export to TXT, SRT, VTT, DOCX

Start free

Pay as you go

4 ₽/min

For one-off files and irregular transcription work.

· Pay only for processed minutes
· Files up to 1 GB
· Speaker diarization and timecodes
· All export formats

Start per minute

BEST VALUE

700-minute pack

1 050 ₽/pack

For regular interviews, meetings and podcasts.

· 700 transcription minutes
· 1.5 ₽ per minute inside the pack
· Files up to 5 GB
· AI cleanup, speakers and timecodes

Buy pack

FAQ

Common questions.

What languages do you support?+

100+ languages including English, Spanish, French, German, Russian, Mandarin, Japanese, Arabic, and Hindi. Auto-detection is on by default.

How accurate is the transcription?+

For clean studio English audio our word error rate stays under 6%. Noisy or heavily-accented audio lands between 8-15% — comparable to top-tier models on the market.

Is my data private?+

Files are encrypted at rest and in transit, and auto-deleted after 24 hours unless you opt to keep them. We never use your audio to train our models.

Do you have an API?+

Yes. REST and webhook-based access is available for paid usage. SDKs for JavaScript, Python, and Go.

Ready to ship cleaner transcripts?

Start free with 30 minutes. Upgrade only when you need more.

Start transcribing See pricing

Transcribe audio & videoin minutes, not hours.