How many voices can I use in one project? +
No hard limit on the number of speakers. Each gets their own voice profile. Dialogue batch mode batches up to 10 unique voices per API call; larger casts auto-split across batches.
Can I fine-tune how a voice sounds? +
Yes. Each voice profile has independent sliders for stability, similarity boost, style exaggeration, and speed, plus a speaker boost toggle. v3 models use a simplified control set.
What are expression tags? +
ElevenLabs v3 audio directives that control how a voice performs a line — like [happy], [whispers], [laughs], [sighs]. Place them inline in dialogue text. Stack tags for complex emotions ([angry][laughing]). Write custom descriptive tags like [trying to sound brave] — v3 interprets natural language in brackets.
Can I add expression tags to an existing script? +
Yes. Use Enhance Expression Tags from the toolbar (or long-press a single turn). The app uses Apple Intelligence on-device when available, or falls back to your configured AI provider (OpenAI or Claude). Suggestions are context-aware — surrounding dialogue informs each tag. Review each before applying.
What modifiers can I add to SFX and music tags? +
@underlay plays the audio under dialogue instead of sequentially. volume=0.3 sets underlay volume (0–1). loop generates seamless looping audio. influence=0.8 controls how literally the AI interprets the prompt (0–1). Example: [SFX: rain on windows (5s) @underlay volume=0.3 loop].
What if generation fails or I cancel it? +
Audio that was already generated is preserved. Resume picks up where it stopped, skipping completed turns. Network errors and rate limits are retried automatically with exponential backoff.