๐ŸŽง

Platform

Whisper Coaching

Real-time guidance the customer cannot hear

A private audio channel directly to the agent's ear. The customer's WebRTC consumer is never subscribed to the whisper producer โ€” separation is enforced at the SFU, not the UI.

<150ms

Whisper voice latency to agent

0 audio bleed

Into customer downlink

1 click

Monitor โ†’ whisper switchover

Multi-coach

Several supervisors can co-coach

01

Selective audio routing โ€” by design

Whisper uses a dedicated mediasoup audio Producer subscribed only by the agent's Consumer. The customer's downstream RTP packets never include the supervisor's audio frames โ€” there is no client-side filter to bypass. If the supervisor is unmuted and the agent is in the room, only the agent hears them.

02

Two coaching modes

Switch between push-to-talk (good for high-risk regulated calls) and open-mic (good for new-agent ramp-up). Visual indicators on the agent UI show whisper status without revealing the supervisor's identity to the customer.

  • Push-to-talk โ€” hold spacebar / push-to-talk button to speak
  • Open-mic โ€” supervisor's audio is on by default
  • Per-coach mute, no waking the agent up unintentionally
  • Picture-in-picture script panel for sending text cues
  • Optional whisper-session recording for training reviews
03

Use cases we see in production

New agent ramp-up: a senior agent coaches a junior in real time on the first 50 calls. Compliance interventions: a QA lead pushes mandatory disclosure language during a regulated call. Multilingual escalation: a Hindi-speaking supervisor whispers translations to a Tamil-speaking agent handling a Hindi customer.

04

Multi-coach scenarios

Two supervisors can whisper into the same session at the same time โ€” for example, a product expert and a process owner during an escalated call. Each whisper producer is isolated; the agent hears both, the customer hears neither.

SPEC

Technical specifications

Transport
mediasoup SFU ยท DTLS-SRTP audio-only producer
Latency
<150ms one-way to the agent
Codec
Opus 16kHz mono, 32 kbps
Bandwidth
~40 kbps per whisper channel
Authentication
JWT with `whisper` role claim
Recording
Optional โ€” separate track in dual-recording MP4
FAQ

Frequently asked questions

No. The customer's WebRTC consumer is never given a subscription to the whisper producer. Even if the supervisor's microphone is loud, the audio frames are not in the customer's downstream RTP stream.

Yes โ€” the agent dashboard shows a small badge with the supervisor's name and a 'whisper active' indicator. The customer view is unchanged.

Yes. Whisper audio can be captured as a separate track in the session's dual recording, or excluded entirely. This is configurable per queue.