Skip to main content

Transcription viewer

The transcription viewer is the auditor's workspace. It combines audio playback, speaker-segmented transcription, emotion analysis, in-text search, and a qualitative speaker profile for each interlocutor.

Transcription viewer with speaker segments, emotions, and audio player

Opening the viewer

  1. From the Dashboard → quick-access card View your history.
  2. In the task list, filter by Completed and click a row or ID.
  3. The viewer opens on the Transcription tab.
info

Direct URL

If you know the task_uuid, go directly to /transcription/{task_uuid}.

Anatomy

ZoneContent
HeaderBreadcrumb Dashboard > Transcription, Advanced mode toggle, avatar
Left columnSegments of the operator (agent)
Right columnSegments of the customer
PlayerPlay/pause, speed, scrubber, download
Side panelToggles for Word search and Speaker profile evaluation
tip

Two columns

Diarization splits the conversation into two columns so it reads like a chat. Each message includes timestamp and an emotion icon.

Segments and diarization

Each segment shows:

ElementDescription
SpeakerSpeaker 00 (operator) and Speaker 01 (customer) by convention — renameable
TimestampSegment start and end (e.g. 0m 34.62s - 0m 39.28s)
Transcribed textOutput from Whisper/Deepgram/OpenAI
Emotion indicatorVisual label reflecting the dominant sentiment
CopyCopy control to copy text to clipboard

Rename speakers

Click Speaker 00 or Speaker 01 to assign descriptive names — "Agent Maria", "Customer", "Supervisor". Changes persist and apply to exports.

Clickable timestamps

Any timestamp plays the audio from that moment. The currently playing segment is highlighted with a glowing border so you can follow along.

Audio player

Persistent at the bottom of the viewer:

ControlAction
Play / Pause
-10s / +10sQuick jumps
ScrubberVisual navigation
Volume0–100 slider
Speed0.5× / 0.75× / 1× / 1.25× / 1.5× / 2×
DownloadDownloads the source file (permission required)
info

The source file is served from MinIO via a short-lived signed URL (≤5 min) to prevent leaks.

Emotion analysis per segment

Each segment shows a visual indicator for the dominant emotion:

EmotionOn-screen indicatorTypical reading
HappyPositive stateSatisfaction, enthusiasm
NeutralNeutral stateInformational tone
SurpriseSurprise stateNovelty, unexpectedness
SadSad stateDisappointment, mild frustration
AngryAngry stateStrong frustration
FearConcern stateWorry
DisgustRejection stateRejection, discomfort
tip

Hover any indicator to see the probability breakdown. See Sentiment analysis.

Word search within the audio

The Word search side panel lets you detect specific terms and locate them in the transcription.

Word search panel with editable term list

Flow

  1. Type a term and confirm with Enter or ✓.
  2. Edit any term with the edit icon or remove it with the delete icon.
  3. Click Search.
  4. The system highlights matches directly on the segments.

Word search result with matches and non-matches

Result legend

IconMeaning
YesWord found
Required not foundRequired word not found (e.g. in Compliance)
Optional not mentionedOptional word not mentioned
tip

Compliance use

For regulated audits, load the list of required phrases (greeting, close, legal disclaimers) and the viewer flags any that are missing.

Speaker profile evaluation

Next to word search, the panel includes Evaluate speaker profile. Generate a qualitative summary of each interlocutor from all their segments.

Speaker profile panel with AI-generated summary

How it works

  1. Tabs: Speaker 00 / Speaker 01 (or your renamed labels).
  2. Click the tab for the interlocutor.
  3. The GPT model generates a behavior summary — style, tone, professionalism, empathy, clarity.
  4. Use the copy button to copy the evaluation to the clipboard.

Use cases

  • Coaching: objective feedback for the agent.
  • Compliance: evidence that the script was followed.
  • Hiring: profile candidates in recorded interviews.
  • Disputes: summarize each party's stance in seconds.

Task actions

Depending on your permissions:

ActionDescriptionPermission
ApproveMark the audit as okSupervisor, Quality
RejectMark the audit as failedSupervisor, Quality
Edit transcriptionFix model errorsQuality, Admin
ExportDownload in TXT/JSON/CSVAll
Re-runReprocess with another engineAdmin
ArchiveMove to historical archiveSupervisor
ShareGenerate signed URL (configurable TTL)Supervisor, Admin
warning

Audit trail

Every edit is logged (who, what, when). Editing does not alter the source audio.

Export

FormatContent
TXTPlain transcription only
JSONTranscription + full metadata (timestamps, emotions, tags, score)
CSVOne row per segment — great for Excel
PDFFormatted report with cover, transcription, analysis
SRT/VTTSubtitles with timestamps

Troubleshooting

ProblemDiagnosisSolution
Segments out of sync with audioVBR or corrupted headerRe-encode with ffmpeg -i in.mp3 -b:a 192k out.mp3
Diarization mixes speakersMono with lots of overlapTry stereo transcription
Wrong proper nounsOut-of-vocabulary termsAdd terms to tenant dictionary
Can't see the Approve buttonMissing permissionRequest Quality or Supervisor role

Next steps