Audio Tags

Add natural reactions, emotions, and delivery cues to your AI hosts with inline audio tags.

Audio tags let you insert expressive cues — like laughs, pauses, whispers, and more — directly into your episode script. When audio is generated, these tags are converted into natural-sounding vocal effects, making your AI hosts sound more human and engaging.

Audio tags appear as inline chips within speech blocks and are triggered using a simple / slash command.

Inserting an Audio Tag#

  1. Click into any speech block in the script editor.
  2. Type / to open the audio tag picker.
  3. Browse the categorized list or start typing to filter tags.
  4. Select a tag using Enter or by clicking it.

The tag is inserted as a styled chip inline with your text. For example:

"I just found out we hit a million downloads [laughs] — I honestly can't believe it."

Available Preset Tags#

Tags are organized into three categories:

Reactions#

  • laughs — A light chuckle or laugh
  • sighs — An exhale expressing emotion
  • gasps — A sharp intake of breath
  • clears throat — A brief throat clear

Emotions#

  • excited — Energetic, upbeat tone
  • nervous — Hesitant, uneasy delivery
  • calm — Relaxed, steady voice
  • frustrated — Tense, irritated tone
  • sarcastic — Dry, ironic delivery

Delivery#

  • pauses — A brief silence
  • hesitates — Stumbling, uncertain speech
  • dramatic — Intense, theatrical delivery
  • whispers — Soft, hushed voice

Custom Tags#

Not seeing the right tag? You can create your own:

  1. Type / to open the picker.
  2. Scroll to the bottom of the list or type a tag name that doesn't match any preset.
  3. Enter your custom tag text in the Custom tag input field.
  4. Press Enter to insert it.

Custom tags follow the same rules as presets — they're converted to vocal effects during audio generation. Keep custom tags short and descriptive for best results (e.g., chuckles nervously, takes a deep breath).

Editing and Removing Tags#

  • Click a tag chip in the editor to re-open the picker and replace it with a different tag.
  • Backspace or Delete over a tag chip to remove it, just like deleting any other character.

How Tags Work During Audio Generation#

When you generate episode audio:

  • Each [tag] in the script is converted to a sound effect directive.
  • The AI voice renders the tag as a natural vocal expression blended into the surrounding speech.
  • Tags do not appear in captions, transcripts, or video subtitles — they're stripped from all user-facing text outputs.

Frequently Asked Questions#

Do audio tags use extra credits?

No. Audio tags are part of the speech generation and don't consume additional credits beyond the normal audio generation cost.

Can I use audio tags with any voice?

Audio tags work best with Horizon voices. If your host uses a Classic voice, audio tags are silently stripped during generation — your episode will still generate without errors, but the tags won't produce vocal effects.

Will audio tags show up in my captions or transcript?

No. Audio tags are automatically stripped from captions, SRT files, clip transcripts, and all other text outputs. They only affect the audio.

Can I add audio tags to music blocks?

No. Audio tags can only be inserted inside speech blocks. The slash command is scoped to speak nodes only.

What happens if I type a custom tag the AI doesn't understand?

The AI voice will do its best to interpret the tag. For best results, use clear, short descriptions of sounds or delivery styles. If a custom tag doesn't produce the effect you want, try rephrasing it or use one of the presets.