Voice notes at work: do US professionals actually send them?
The evidence-first answer to the most common objection about asynchronous voice feedback. What changed between 2022 and 2024, the ten-second hesitation curve, and the cases where the objection is still right.
The short answer
For senders
Will my recipient actually record something?
This is the question most people ask before they send their first HeySpeak link. The honest answer is that the worry was true in 2018. It is no longer true now. Voice messaging crossed from a teenage WhatsApp habit into mainstream US professional communication somewhere between 2022 and 2024, and the cultural softening keeps moving in one direction.
Slack added native voice clips in 2022. Microsoft Teams supports voice messages. WhatsApp Business is a default support and sales channel for thousands of US companies. Loom built a 25 million user product on a closely related instinct. LinkedIn shipped voice messages on mobile years ago. Every one of those products had to defend the same objection, and every one of them won the argument by shipping anyway.
Your recipient does not need to be a daily voice-note user on day one. They need to be willing to try it once. The link is designed to make that one try almost free: one URL, one button, no signup, no app install. The friction is so low that the choice is no longer between recording and not recording. It is between recording and ignoring the message entirely, and a surprising share of people pick the recording.
For receivers
Is it weird if I send a voice note back?
No. Nobody on the other end of a HeySpeak link is grading your audio. The person who sent you the link is asking for the fastest, most honest version of your answer, not a polished radio clip. They will probably hear a pause, an “um,” a reset halfway through a sentence, the background noise of wherever you happen to be. Every other recording they receive has the same texture. That is the point.
The recording is private. It goes to one person, not to a public page. There is no comment thread. There is no social feed. The audio sits in an encrypted bucket and only the sender can play it back through a link that expires after an hour. If you have ever sent a voice note inside a WhatsApp thread you trusted, this is the same level of exposure, just outside a chat app.
And if voice still does not feel right for the question you are answering, the same page gives you a calendar. You book a slot, you talk on a real call, you are done. The option is right there. The link is not asking you to commit to a new mode of communication. It is asking you to pick the one that fits the next ten minutes of your life.
The ten-second hesitation
Every voice-feedback objection lives in the first ten seconds after someone opens the link. The receiver looks at the page, registers that they are being asked to talk into their phone, hesitates, and then decides. Most of the design work on HeySpeak goes into shortening that hesitation.
One button, labelled clearly. A short prompt above it, written by the sender, that tells the recipient exactly what to talk about. A second option on the same page, in plain view, that books a call instead. No login wall. No app store detour. No privacy theatre. The receiver does not have to commit to anything beyond the next 60 seconds.
Once the recording starts, the hesitation is gone. People talk. They tend to talk for longer than they would have typed. The answer is more honest, because speaking out loud is harder to edit into a polished non-answer than a paragraph in a text field. The sender reads a one-paragraph AI summary the same evening. The whole loop is closed in less time than it would have taken to schedule the call that almost happened instead.
Voice is already the default outside the US
The US is one of the few large markets where voice messaging is still treated as a question rather than a habit. In most of the rest of the world, the argument was settled years ago. WhatsApp voice notes are a daily mode of communication in Brazil, Spain, Italy, and across Latin America. WeChat hold-to-record voice messages have been a default form of chat in mainland China for over a decade. KakaoTalk in Korea and LINE in Japan show the same pattern.
For Mandarin, Japanese, Korean, Arabic, and Hindi speakers there is also a typing tax that does not exist in English. Mobile input methods like Pinyin require a phonetic-to-character conversion on every word. Japanese mixes three scripts in one sentence. Speech bypasses all of it, which is why voice is not just preferred in those markets, it is faster than the alternative by a large factor. If your audience is in any of these regions, the burden of proof flips: text is the unusual ask, not voice.
None of this is reason to dismiss the US hesitation. It is reason to treat it as a lagging cultural pattern rather than a permanent one. The product is already familiar to most of the world. The US is the market where the habit is still settling in, and the curve is steep.
More detail on the typing tax and the markets where this matters most: voice feedback for non-Latin scripts.
Where the objection is still partially right
There are real cases where a voice note is not the right ask. Highly regulated industries, like legal discovery or some clinical contexts, sometimes require written records as the primary artefact. A senior executive who has a strong personal preference for written communication will probably keep that preference. A recipient who is in a quiet office where they cannot speak out loud will not record on the spot.
None of these break HeySpeak. The calendar option catches all of them. The recipient who cannot record right now books a slot for later. The executive who would rather not talk into a phone takes the meeting. The legal contact gets a follow-up email with the same prompt in writing. The link is not betting everything on voice. It is offering voice as the lower-friction option for the majority who will use it, and a clean fallback for the minority who will not.
Read next
Two pages that extend the same idea, with more numbers and more examples.
Audience
Voice feedback for Chinese, Japanese, Korean and other non-Latin audiences
Why Pinyin and IME input make voice the obvious format for half the world, and what that means for cross-border feedback.
Comparison
The discovery call alternative that actually gets answered
When a 60-second voice note gives you more signal than a 30-minute calendar invite, and when it doesn't.
Common questions
Will US professionals really record a voice note for a vendor or a partner?
Isn't it unprofessional to ask for a voice note instead of an email?
What if the recipient refuses to record audio?
Does the recipient need an account, an app, or a login?
Where does the recording go? Is it public anywhere?
Is voice feedback already standard outside the US?
Send one link. See what happens.
Five free responses, no credit card. The fastest way to find out whether your audience hesitates or just talks.
Create your first link