1. Upload a song — Whisper transcribes the vocals and times each word to drive the lyric video below.

Try a sample song

Runs the full pipeline: transcribe lyrics → generate AI storyboard backgrounds → render kinetic typography → assemble the video (~30–90s depending on song length).

2. Choose how the lyrics look

Visual Theme
Sets the on-screen lyric text color: Dark = white, Light = warm gold, Neon = cyan glow. AI backgrounds are always slightly darkened, so pick whichever color reads best against your Visual Prompt.
Lyric Font
Typeface used for the on-screen lyrics. Bold sans-serif suits most songs; try Serif or Monospace for a different look.

Tips:

  • Best with clear vocals (ballads, pop, spoken word)
  • Describe the visuals you want in the Visual Prompt — it shapes both the AI backgrounds and the on-screen mood
  • Try different Visual Themes and Fonts to match your song's vibe
  • Processing takes ~30–90s depending on song length