Media Library
What the Media Library is
A central place for all the audio your phone system uses:
- Voicemail greetings
- IVR prompts ("Press 1 for sales...")
- On-hold music
- Queue announcements
- Outbound campaign audio
- Whatever else needs to be played to a caller
Three ways to add audio: upload an existing file, generate from text-to-speech, or record from your browser or a phone.
Where it lives
Top nav → Media Library.
The list shows every audio file with columns:
- Display name
- File type (WAV / MP3 / OGG)
- File size
- Duration
- Tags
- Used by (which call flows, voicemail boxes, queues, campaigns reference this file)
- Actions (play, download, edit metadata, delete)
Search by name, filter by tag, sort any column.
Upload an existing file
Click New → Upload tab.
Drag a file onto the drop zone, or click Browse. Supported formats:
- WAV — best quality, no conversion needed
- MP3 — converted to WAV on upload
- OGG — converted to WAV on upload
Form fields:
- Display Name — auto-filled from filename, editable
- Description — optional
- Tags — comma-separated, for organization
Click Upload. The file is processed (transcoding if needed), then appears in the library.
Audio specs
For best quality:
- 16-bit PCM WAV
- 8 kHz mono (telephony standard) or 16 kHz mono (HD voice)
- No DC offset
- -3 dB peak max (avoid clipping)
Audio you grab from YouTube or other web sources is usually fine but loud — normalize before uploading or callers will be blown out.
Generate from text (TTS)
Click New → Text-to-Speech tab.
Form fields:
- Display Name
- Text — what to say (max ~5000 characters)
- Voice — pick from available voices (typically Polly voices: Joanna, Matthew, Salli, etc.)
- Language —
en-US,es-ES,fr-FR, etc. - Tags
Click Preview to hear it before saving. Adjust the text or voice as needed.
Click Generate & Save. The file is generated and appears in the library, ready to use.
Writing for TTS
A few tips:
- Use full punctuation — periods, commas, ellipses. They control pacing.
- Spell tricky words phonetically — TTS often mispronounces names and acronyms. "EE-DAH" instead of "Ada"; "S-Q-L" instead of "SQL"; "yor-NIK" instead of "Yarnyk".
- Keep sentences short — long sentences sound robotic
- Test with the actual voice — voices interpret the same text differently. Joanna might pronounce "Yarnyk" wrong while Matthew gets it right. Pick the best voice for your text.
For greetings and IVR, TTS is usually good enough. For polished marketing-style intros, consider human voice talent and upload the resulting WAV.
Record from the browser
Click New → Record tab.
Browser asks for microphone permission. Click Start Recording, speak, click Stop.
Preview by playing back. Re-record if needed. Save with a display name and tags.
Works in Chrome, Firefox, Safari (latest versions). Mobile browser support varies.
Quality depends entirely on your microphone. A laptop's built-in mic is okay for personal voicemail greetings; a USB headset is better; a dedicated condenser mic is best.
Record from a phone (call-out method)
For people who don't want to do the browser thing or who want a "studio" feel using their desk phone:
Click New → Record tab → Use Phone option.
Form:
- Target — pick an extension
- Caller ID — which DID to use for the outbound call
- Display Name for the resulting file
Click Start Recording. The platform calls the chosen extension. The user answers, follows the voice prompt: "Recording starts after the beep. Press # when done." The recording is saved to the Media Library automatically.
Editing metadata
Click any file. Update display name, description, tags. Save. Audio content can't be edited in place (re-upload to replace).
Replacing audio in place
Click a file → Replace Audio. Upload a new file. The old file is overwritten; everything that referenced this file (call flows, voicemail boxes, queues) automatically uses the new audio.
Use this for seasonal updates ("Happy Holidays" greeting that switches back in January) without changing call flow logic.
Deleting files
Click Delete on a file. If the file is in use anywhere, the platform warns you. Either:
- Remove the references first (in call flows, voicemail boxes, queues)
- Delete and live with the breakage temporarily
Organizing with tags
For a small library (< 30 files), no organization needed. For a larger one:
- Tag by use:
voicemail-greeting,ivr-prompt,hold-music,campaign-audio - Tag by team:
sales,support,marketing - Tag by campaign:
q1-2026-launch
Filter by tag using the search bar.
Library quota
Your service provider sets a storage quota for media. Most tenants stay well under it (audio files are small — a 30-second WAV is ~500 KB).
If you hit the quota, the upload form warns you. Delete unused files or contact your service provider to raise the limit.
Common patterns
IVR prompt set
For a multi-prompt IVR, generate all prompts as TTS at once with the same voice, tagged ivr-main. Then your IVR sounds consistent. Don't mix TTS voices across prompts — it sounds disjointed.
Hold music
For one polished hold music track, source from a royalty-free music library (Bensound, Free Music Archive, Epidemic Sound). Upload as MP3. Tag hold-music.
Don't use copyrighted music. Performance Rights Organizations (ASCAP, BMI) can fine you for putting unlicensed music on hold.
Voicemail greetings
Personal: each user records their own via *97. No Media Library entry needed.
Shared boxes: TTS-generate the greeting and assign in the Voicemail box settings.