Guide

Voice notes at work: do US professionals actually send them?

The evidence-first answer to the most common objection about asynchronous voice feedback. What changed between 2022 and 2024, the ten-second hesitation curve, and the cases where the objection is still right.

Try it on one prompt

5 free responses, no credit card

The short answer

Yes. Voice notes crossed into US business chat between 2022 and 2024. Slack shipped native voice clips, WhatsApp Business became a standard sales channel, and Meta reported 7 billion voice messages a day on WhatsApp alone. On HeySpeak, every link offers both options on one page: record 60 seconds, or book a call. The hesitation lasts about ten seconds. Then people just talk.

voice messages sent every day on WhatsApp, per Meta

2022

Slack shipped native voice clips into the default workspace product

Voice or call

every HeySpeak link offers both, so no recipient is ever forced into audio

For senders

Will my recipient actually record something?

This is the question most people ask before they send their first HeySpeak link. The honest answer is that the worry was true in 2018. It is no longer true now. Voice messaging crossed from a teenage WhatsApp habit into mainstream US professional communication somewhere between 2022 and 2024, and the cultural softening keeps moving in one direction.

Slack added native voice clips in 2022. Microsoft Teams supports voice messages. WhatsApp Business is a default support and sales channel for thousands of US companies. Loom built a 25 million user product on a closely related instinct. LinkedIn shipped voice messages on mobile years ago. Every one of those products had to defend the same objection, and every one of them won the argument by shipping anyway.

Your recipient does not need to be a daily voice-note user on day one. They need to be willing to try it once. The link is designed to make that one try almost free: one URL, one button, no signup, no app install. The friction is so low that the choice is no longer between recording and not recording. It is between recording and ignoring the message entirely, and a surprising share of people pick the recording.

For receivers

Is it weird if I send a voice note back?

No. Nobody on the other end of a HeySpeak link is grading your audio. The person who sent you the link is asking for the fastest, most honest version of your answer, not a polished radio clip. They will probably hear a pause, an “um,” a reset halfway through a sentence, the background noise of wherever you happen to be. Every other recording they receive has the same texture. That is the point.

The recording is private. It goes to one person, not to a public page. There is no comment thread. There is no social feed. The audio sits in an encrypted bucket and only the sender can play it back through a link that expires after an hour. If you have ever sent a voice note inside a WhatsApp thread you trusted, this is the same level of exposure, just outside a chat app.

And if voice still does not feel right for the question you are answering, the same page gives you a calendar. You book a slot, you talk on a real call, you are done. The option is right there. The link is not asking you to commit to a new mode of communication. It is asking you to pick the one that fits the next ten minutes of your life.

The ten-second hesitation

Every voice-feedback objection lives in the first ten seconds after someone opens the link. The receiver looks at the page, registers that they are being asked to talk into their phone, hesitates, and then decides. Most of the design work on HeySpeak goes into shortening that hesitation.

One button, labelled clearly. A short prompt above it, written by the sender, that tells the recipient exactly what to talk about. A second option on the same page, in plain view, that books a call instead. No login wall. No app store detour. No privacy theatre. The receiver does not have to commit to anything beyond the next 60 seconds.

Once the recording starts, the hesitation is gone. People talk. They tend to talk for longer than they would have typed. The answer is more honest, because speaking out loud is harder to edit into a polished non-answer than a paragraph in a text field. The sender reads a one-paragraph AI summary the same evening. The whole loop is closed in less time than it would have taken to schedule the call that almost happened instead.

Voice is already the default outside the US

The US is one of the few large markets where voice messaging is still treated as a question rather than a habit. In most of the rest of the world, the argument was settled years ago. WhatsApp voice notes are a daily mode of communication in Brazil, Spain, Italy, and across Latin America. WeChat hold-to-record voice messages have been a default form of chat in mainland China for over a decade. KakaoTalk in Korea and LINE in Japan show the same pattern.

For Mandarin, Japanese, Korean, Arabic, and Hindi speakers there is also a typing tax that does not exist in English. Mobile input methods like Pinyin require a phonetic-to-character conversion on every word. Japanese mixes three scripts in one sentence. Speech bypasses all of it, which is why voice is not just preferred in those markets, it is faster than the alternative by a large factor. If your audience is in any of these regions, the burden of proof flips: text is the unusual ask, not voice.

None of this is reason to dismiss the US hesitation. It is reason to treat it as a lagging cultural pattern rather than a permanent one. The product is already familiar to most of the world. The US is the market where the habit is still settling in, and the curve is steep.

More detail on the typing tax and the markets where this matters most: voice feedback for non-Latin scripts.

Where the objection is still partially right

There are real cases where a voice note is not the right ask. Highly regulated industries, like legal discovery or some clinical contexts, sometimes require written records as the primary artefact. A senior executive who has a strong personal preference for written communication will probably keep that preference. A recipient who is in a quiet office where they cannot speak out loud will not record on the spot.

None of these break HeySpeak. The calendar option catches all of them. The recipient who cannot record right now books a slot for later. The executive who would rather not talk into a phone takes the meeting. The legal contact gets a follow-up email with the same prompt in writing. The link is not betting everything on voice. It is offering voice as the lower-friction option for the majority who will use it, and a clean fallback for the minority who will not.

Common questions

Will US professionals really record a voice note for a vendor or a partner?

Yes, and far more often than people assume. Slack added native voice clips in 2022. Microsoft Teams supports voice messages. WhatsApp Business is a default channel for sales and customer support in the US. Loom built a 25 million user product on the same instinct. The receiver does not need to be a voice-note person on day one. They need to be willing to try it once, and the HeySpeak link makes that try cost about ten seconds.

Isn't it unprofessional to ask for a voice note instead of an email?

The frame is backwards. Asking for a voice note respects the recipient's time more than a long email thread or a 30-minute call. You are saying: speak for 60 seconds, in your own words, on your own schedule, with no calendar dance. Most recipients read that as considerate, not strange. The few who do not still see the calendar option and book a slot instead.

What if the recipient refuses to record audio?

Every HeySpeak link shows two options on the same page: record a voice note, or book a meeting on your calendar. A recipient who would never record audio still has a frictionless path forward. You do not lose the conversation. You just route it the way they prefer. That is the entire reason both options sit on one page.

Does the recipient need an account, an app, or a login?

No. The link opens a single mobile-friendly web page with a record button. No signup, no app install, no email confirmation. The recipient taps record, talks, and submits. The whole flow runs in the browser. This is the part of the experience that most often turns a skeptic into a user inside one session.

Where does the recording go? Is it public anywhere?

The audio file is stored privately in encrypted Cloudflare R2 storage. Only the sender who created the link can play it back, through a signed URL that expires in one hour. There is no public page, no shareable clip, no social embed. The recipient is talking to one person, not posting to the internet.

Is voice feedback already standard outside the US?

Yes, and the US is catching up to a habit that is already mainstream in most of the world. WhatsApp voice notes are a daily mode of communication in Brazil, Spain, Italy, and across Latin America. WeChat voice messages are a default in mainland China. KakaoTalk in Korea and LINE in Japan show the same pattern. The cultural argument that voice is weird is mostly a US story, and a fading one even there.

Send one link. See what happens.

Five free responses, no credit card. The fastest way to find out whether your audience hesitates or just talks.

Create your first link