Product
Product overview Pricing Integrations
Solutions
E-commerce Restaurants Healthcare
Resources
Blog About API
ESENFR
Contact us →
Product Mar 1, 2026 · 5 min · Equipo VENDAQ

Your customer sent a voice note. Now what?

It's 9:47 PM. A customer opens WhatsApp and sends you a 40-second voice note. She speaks fast. Something about a dress she saw on Instagram, asking if it's available in size M, wants to know the price, and whether it can arrive by Friday.

40 seconds. Four questions. One clear intent: she wants to buy.

What does your current chatbot do? Nothing. Literally nothing. Most chatbots don't even register that a voice note arrived. Some respond with a generic message: "Sorry, I can't process voice messages." Others simply ignore it and wait for text.

That customer isn't going to type everything again. She's going to leave.

What happens when VENDAQ receives that voice note

Let's break down exactly what happens in the next 8 seconds:

Second 0-2: Transcription

The voice note arrives via the WhatsApp Business API. VENDAQ downloads it and sends it to our transcription engine. We don't use a generic service — we use models optimized for conversational Latin American Spanish, with all its variations, idioms, and speed.

95%+
transcription accuracy for LATAM Spanish
2 sec
average transcription time
70%
of WhatsApp messages in LATAM are voice notes

The result: "Hi, I saw a dress on your Instagram, the blue one with flowers, do you have it in M? How much is it? And I need it to arrive by Friday because it's for a birthday."

Second 2-4: Intent understanding

VENDAQ doesn't read text literally. It analyzes intent. From that transcription it extracts:

  • Product: blue dress with flowers (Instagram reference)
  • Size: M
  • Queries: availability + price
  • Shipping constraint: must arrive before Friday
  • Emotional context: it's a gift (birthday) — real urgency

This isn't keyword matching. It's contextual understanding. VENDAQ understands that "I need it to arrive by Friday" isn't a shipping question — it's a purchase condition.

Second 4-6: Catalog lookup

With the intent clear, VENDAQ queries your catalog in real time. Searches for the blue flower dress. Checks stock in size M. Calculates whether shipping can arrive before Friday based on the customer's location.

In this case: yes, it's in stock, the price is $45.99, and with express shipping it arrives Thursday.

Second 6-8: Personalized response

VENDAQ responds via WhatsApp:

Hi! 🌸 The blue flower dress is available in size M. It's $45.99. With express shipping ($3.99), it'll arrive Thursday — perfect for the birthday. Want me to reserve it for you?

8 seconds. From voice note to response with availability, price, and shipping options. At 9:47 PM. No human intervention.

Now let's compare with the alternative

Without VENDAQ, here's what happens with that same voice note:

Scenario A: Traditional chatbot. Ignores the voice note. Responds "Please type your question" or just doesn't answer. The customer waits 5 minutes, gets frustrated, and looks at another store.

Scenario B: No chatbot. The voice note arrives on the business WhatsApp. Nobody sees it until 9 AM the next morning. Someone listens to it, looks up the dress, responds 12 hours later. The customer already bought elsewhere.

Scenario C: 24/7 human team. A night-shift agent listens to the voice note, searches the catalog, responds in 3-5 minutes. Works, but costs $2,000-3,000/month in night shifts. For a store that gets 15 messages after 8 PM, it's not viable.

68% of purchase inquiries on WhatsApp arrive outside business hours. If you don't respond, you don't sell.

It's not just transcribing. It's understanding.

Anyone can transcribe audio — there are cheap APIs for that. What differentiates VENDAQ is what happens after the transcription.

When a customer says "the blue one with flowers you posted on Insta," VENDAQ understands it refers to a recent Instagram post. It cross-references with your catalog. It doesn't need a SKU code or exact description.

When she says "I need it to arrive by Friday," VENDAQ doesn't just log the date — it calculates logistics. Is it possible? At what cost? Is there an express option?

When she says "it's for a birthday," VENDAQ detects emotional context. It knows this sale has real urgency and that a "let me check and get back to you" could kill it.

Voice notes are the dominant channel

In LATAM, voice messages aren't the exception. They're the norm.

70%
of WhatsApp messages in LATAM are voice notes
3x
more information per voice note vs text
0%
of traditional chatbots that process voice

Think about your own behavior. When was the last time you typed a long paragraph on WhatsApp? You probably sent a voice note. Your customers do the same.

And a 40-second voice note contains more information than 5 text messages. It has tone, urgency, emotion. A customer who sends a voice note about a product wants to buy. It's the strongest intent signal there is.

Ignoring those voice notes is like having a salesperson who covers their ears when a customer talks to them.

What about complicated voice notes?

Not all voice notes are as clean as the example. Sometimes there's background noise. Sometimes the customer is talking to someone else while recording. Sometimes they change topics three times in 60 seconds.

VENDAQ handles these cases:

  • Background noise: filtered before transcription. Tested in real environments (buses, markets, parties).
  • Multiple topics: separated and responded to in priority order.
  • Voice + text: some customers send a voice note then a text clarifying. VENDAQ unifies the context.
  • Long voice notes (2+ minutes): fully transcribed and summarized for the human agent if intervention is needed.

The impact in numbers

Businesses that activated voice processing with VENDAQ report:

  • +35% response rate: voice notes that were previously ignored now get answered.
  • +22% conversion: because customers who send voice notes have high purchase intent.
  • -40% human team workload: 80% of voice notes are resolved without intervention.
  • NPS +18 points: customers feel heard — literally.

Voice notes aren't a problem to solve. They're an opportunity nobody else is capturing.

Start listening

If your current tool doesn't process voice, you're missing 70% of the conversation. Not 70% of messages — 70% of the information, because voice notes contain more context than any text.

VENDAQ was built to listen. Not as an add-on feature, but as a core part of the experience. Because in LATAM, commerce is conversational. And conversations are in audio.

Your customer sent a voice note at 9:47 PM. The question isn't whether you can respond. It's whether you can afford not to.

Ready to change how your customers talk to you?

15 minutes. No strings attached.

Schedule a conversation →