Skip to content
One-liner

The shortest version

flexVox is an iOS app that turns multi-speaker scripts into fully produced podcast audio using AI voices, sound effects, and music generation.
100 words

A paragraph

flexVox is a native iOS app for producing multi-voice podcasts from written scripts. Users paste a dialogue script, and the parser automatically identifies speakers, sound effects, and music cues. Each character is assigned a distinct AI voice from the ElevenLabs catalog, with per-speaker tuning for stability, style, and speed. The app generates speech, sound effects, and music for every line, then provides post-production tools to regenerate individual takes, compare variants, adjust pauses, and exclude segments. The final output is a mixed M4A file ready for distribution. A demo mode lets users explore the full workflow without an API key.

300 words

The full angle

flexVox is a podcast production app for iPhone and iPad that transforms written scripts into polished, multi-voice audio. It is designed for solo podcasters, audio drama writers, and educators who need professional-sounding dialogue without hiring voice actors or using desktop editing software.

The workflow starts with a script. Users paste their dialogue into the app, and a built-in parser detects speaker names, sound effect prompts, and music cues across several common script formats. A confidence-scored review screen lets users verify attributions, correct mistakes, and merge duplicate speakers.

Next, users assign AI voices. The app connects to the ElevenLabs voice library, where users can search, filter, and preview voices before mapping them to characters. Each voice profile includes independent controls for stability, similarity boost, style exaggeration, and playback speed. A quick preview streams a sample dialogue with assigned voices before committing to a full generation run.

During generation, the app produces speech audio for every dialogue line, plus sound effects and music from text prompts embedded in the script. A real-time progress screen shows the current turn, completion percentage, and estimated time remaining. If generation is interrupted, users can resume from where it stopped.

Post-production tools let users play each segment individually, regenerate lines that need improvement, compare multiple takes side by side, adjust per-turn pauses, and exclude segments from the final output. When editing is complete, flexVox mixes all active turns into a single M4A file for export via the iOS share sheet.

A built-in demo mode produces silent placeholder audio, allowing users to explore every screen and feature before connecting an ElevenLabs account.

Quick facts

Spec sheet

Tagline candidates

A few lines

Contact

Talk to us

Press inquiries: press@pixelantern.com
General contact: hello@pixelantern.com
Studio: pixelantern.com