AI voice transcription for sales teams.
Prospects record a 60-second voice note. HeySpeak transcribes it in seconds and writes a context-aware summary. EU-hosted by default, no call needed.
The short answer
Why sales teams stopped trusting generic transcription
Most transcription tools were built for meetings. You record a 45-minute call, the tool spits out a wall of text, and someone has to read it later to find the one thing that mattered. That works for retros. It does not work for early-funnel prospect feedback, where the question is small and the time pressure is real.
The other problem is jurisdiction. Sales teams selling into the EU have to answer the GDPR question every time they pick a vendor. Most call-recording stacks store and process audio in the US, and the legal review takes longer than the integration. Voxtral on EU infrastructure removes that conversation by default for the primary path.
Be honest about the limits: transcription quality depends on audio clarity. A prospect recording from a quiet office gets a near-perfect transcript. A prospect on a windy street with traffic noise gets a transcript with gaps. The model is good. It is not magic.
The workflow
What happens between the prospect tapping send and you reading the summary.
- 1
Prospect records a 60-second voice note
You send a Magic Link with one question. The prospect opens it on any phone or laptop, taps record, and speaks for up to sixty seconds. No app, no login, no account. The audio uploads to a private Cloudflare R2 bucket the moment they hit send.
- 2
Voxtral transcribes within seconds
Mistral Voxtral runs the transcription on EU-hosted servers. Most 60-second notes finish transcribing in under ten seconds. If a request fails, the system retries automatically up to three times before marking the response for manual review. All transcription stays on Mistral AI infrastructure.
- 3
Mistral Small writes the summary
Once the transcript exists, Mistral Small reads it against the question you asked and produces a one-line summary plus the intent signals worth flagging: pricing concern, competitor mention, decision timeline, blocker. The summary is context-aware, not a generic abstract.
- 4
Sales rep reads the result in the dashboard
You open the dashboard. Each response shows the summary, the full transcript, and the original audio behind a 1-hour signed URL. Scan ten responses in a few minutes. Listen to the two that surprised you. Reply or queue the next move.
The stack, in plain terms
No black box. Three named models, each doing one job.
- Mistral Voxtral. Primary transcription engine. EU-hosted. Handles audio in seven or more European languages with automatic detection. All recordings go through Voxtral.
- Retry logic. If Voxtral fails or times out, the system retries automatically up to three times before marking the response for review. All retries stay on Mistral AI infrastructure. No audio leaves the EU.
- Mistral Small. Context-aware summaries. Reads the transcript next to the question you asked, then writes a one-line summary that picks up pricing concerns, competitor mentions, decision timelines, and blockers. Not a generic abstract.
Related pages
Where AI transcription fits in a broader async voice sales flow.
Use case
What to send after a deal goes cold
Use the same Magic Link to ask lost prospects one question. Read the transcript pattern across five deals.
Playbook
Customer discovery, twenty interviews, no calls
A six-step playbook for running real interviews async. Transcripts and summaries do the synthesis work.
Use case
Async user interviews
When voice notes replace the live call, and when they do not. Worked examples and limits.
Guide
Voice to text feedback, end to end
Longer pillar guide: the full async voice feedback workflow, from question design to acting on the transcript.
Common questions
How accurate is the transcription?
Is the transcription EU-hosted?
What languages does it support?
How is this different from Gong, Fathom, or Otter?
Can I export transcripts to my CRM?
How long are recordings stored?
Stop transcribing calls. Skip the call.
Five free responses to start. Setup takes under a minute.
Create your first Magic Link