Script intelligence that reads your format
Paste your script. The parser detects speakers, SFX cues, music tags, and scene markers — and confidence-scores every line so you only review the ones it was unsure about.
flexVox turns a written script into a fully produced, multi-voice podcast. Paste dialogue, assign AI voices to every character, generate speech, sound effects, and music — then mix and export, all on your iPhone or iPad.
ALEX: You hear that? The whole city is on a delay.
JORDAN: [whispers] Three seconds behind the lightning.
[SFX: distant thunder (3s) @underlay volume=0.4]
ALEX: [laughs] Promise me we'll never count down together.
[Music: slow tape-piano hush (12s) @underlay loop]
A multi-speaker script lands. The parser detects ALEX, JORDAN, SFX, and music cues automatically.
Auto-Cast picks a voice for every character. Fine-tune stability, style, and speed.
Dialogue, SFX, and music are mixed with auto-ducking. Export to your podcast host.
Every other text-to-speech app asks you to paste one paragraph for one voice. That's fine for a notification. It's not fine for a podcast, a table read, or an audio drama. flexVox preserves speaker, SFX, and music information through the entire pipeline — so what you get back actually sounds like a conversation.
No web views. No Electron. No accounts to start. Bring your own ElevenLabs key when you're ready for real audio — or stay in demo mode and explore every screen for free.
HOST: Welcome back to the show.
HOST: [warmly] Today, we're talking about timing.
[SFX: rain on windows (5s) @underlay volume=0.3 loop]
GUEST: [laughs] You picked the right week.
[Music: warm rhodes intro (10s)]
Three tabs — Script, Production, Export — and one continuous workflow. Jump back and forth freely. flexVox remembers where you were.
Drop in a multi-speaker script. The parser detects HOST:, [Host], standalone names, and your SFX and music tags across common formats.
Script tab · auto-parse + confidence scoring
Auto-Cast picks a voice for every speaker. Or open the library and audition voices yourself — the session auto-advances to the next speaker.
Voice Mapping · Auto-Cast All + AI archetype banner
flexVox produces dialogue, sound effects, and music in one pass. Failed turns are retried automatically. Cancel anytime — already-generated audio is preserved.
Generation Status · resumable + resilient
Regenerate any line as a new variant. Adjust pauses. Mark music as underlay. LUFS-aware auto-ducking sits the bed at the right level on its own.
Post-Production · variants, ducking, flags
Pick an export preset — Apple Podcasts, Spotify, YouTube, Broadcast, or Custom LUFS. Get an M4A, plus SRT, VTT, JSON, or plain-text transcript.
Export sheet · Cmd+E
The whole production line. The free tier covers the full script-to-audio path; Studio adds the professional edge.
Paste your script. The parser detects speakers, SFX cues, music tags, and scene markers — and confidence-scores every line so you only review the ones it was unsure about.
Browse the ElevenLabs catalog inline, audition voices instantly, and tune stability, style, and speed per speaker. Auto-Cast picks distinct voices for your whole cast in one tap.
Type [SFX: thunder (3s) @underlay] and hear it happen. flexVox generates, ducks, and places ambient beds and underlay music without leaving the editor.
Each take is saved as a variant. Compare side-by-side, pick the best, and discard the rest. Your other audio never gets touched.
Describe an episode and get a production-ready draft. The engine knows 62 ElevenLabs v3 tags — [whispers], [laughs], [trying to sound brave] — and places them where they fit.
The Export view is a live teleprompter. Words highlight in time with the audio. Tap any line to jump there. Spot a mispronunciation without scrubbing.
flexVox is the tool we wished existed when we wanted to hear a script before we recorded it.
Run a narrative show but can't afford a cast? Write the parts. flexVox performs them — and lets you iterate without rebooking talent.
Generate a table read of your script with distinct character voices. Swap a voice, re-read a scene, share a rough audio draft in a single afternoon.
Build dialogue-based scenarios and role-plays. When your script updates, regenerate the lines that changed — keep everything else.
It does one thing well: turn a multi-speaker script into produced audio. Here's where it fits.
flexVox depends on ElevenLabs for voice generation. Audio quality and available voices come from that service. We add the script intelligence, the production workflow, and the native mobile experience.
Free to download. Free demo mode. Bring your own ElevenLabs key — or just explore the workflow first.