The video prompt stack I use to generate scroll-stopping content.
— BILLY CODER
// the rule: every video has 3 beats — HOOK (0-2s) → BODY (2-6s) → CLOSER (6-10s). If your prompt doesn't separate them, the AI blurs them and the video flops. Every template below uses this structure.
// hook-first formats
// 01
The 3-Beat Reel
The workhorse. 8-10 sec. Use Kling, Veo, or Runway.
scene: [frame 1 composition — the hook moment, described as a still]
hook (0-2s): [the scroll-stopper — what happens in first 2 seconds]
body (2-6s): [the payoff — reveal, explanation, escalation]
closer (6-10s): [the final beat — loop back, CTA, or visual punch]
camera: [ONE move that supports all 3 beats — usually static OR slow push-in, never stacked]
lighting: [one setup, high-contrast]
color: [punchy, saturated, short-form-optimized]
audio: [hook audio 0-2s, body audio 2-6s, closer audio 6-10s — spec each]
mood: [attention-grabbing / direct / surprising]
duration: 10
Use for: 90% of your reels. Master this first.
// 02
Pattern Interrupt Reveal
Black frame / wrong-way / disorientation → resolves into the brand.
scene: [pattern-breaking visual — black frame / upside-down / extreme close-up on unexpected detail]
hook (0-2s): the disorientation moment — bass tone, no visual content, single sound cue
body (2-6s): the resolution — slow rotation, pull-back, or reveal of [subject]
closer (6-10s): [product / brand / CTA] lands clean
camera: [the move that resolves the disorientation]
lighting: pure dark transitioning to [reveal lighting]
color: [matches reveal — high contrast, neon noir, etc]
audio: silence or low rumble during hook, full audio on reveal
mood: disorienting → satisfying
duration: 8
Use for: mystery / brand reveals / dramatic launches.
// 03
Direct-Address Hook (Veo only)
Spoken line straight to camera. Veo handles dialogue native.
[Shot type] of [character] in [setting], looking directly into camera, [light source].
[Character] says, "[HOOK LINE — under 80 chars, under 2 seconds to say]."
[Continuation beat or pause].
[Character] says, "[payoff line]."
Fixed camera, [50mm f/2.0 / cinematic style].
[Color grade], subtle film grain.
[Mood — direct, grounded, slightly confrontational].
Natural room ambient, no music.
Use for: talking-head hooks, lead gen, call-outs to specific audience.
// commerce + product
// 04
Product Reveal (E-commerce Ad)
Hook stops scroll, body shows product, closer lands the offer.
scene: [product on clean surface, top-down or 3/4 angle, branded environment]
hook (0-2s): [pattern interrupt OR contradiction — text flashes "wait for it" / object covers product / unexpected motion]
body (2-6s): clean reveal of [product] — single motion (lift / pour / unwrap / open)
closer (6-10s): macro push-in on key feature, on-screen text "[hook claim]" + "[CTA]"
camera: static through hook, push-in for closer, 50mm feel, f/2.8
lighting: warm overhead practical, soft fill
color: [brand palette], cinematic, slight grain
audio: silence 0-1s, [product sound] at 1s, [whoosh] at 2.5s, [ambient] through body
mood: appetite-triggering / desire-creating
duration: 10
hard rules: only [product + minimal props] in frame. Do not invent features not in the input image.
Use for: any product launch / TikTok Shop / DTC ad.
// 05
Unboxing / 360 Spin
Tactile build-up + reveal. Works for premium products.
scene: [closed box / wrapped product] center frame, premium surface, branded environment
hook (0-2s): hands enter frame, single dramatic sound cue (knife slice / tape rip / lid lift)
body (2-6s): slow methodical reveal — wrapping pulls back, product emerges, single 360 rotation
closer (6-10s): final hero shot of product at rest, on-screen text "[product name]" + "[where to buy]"
camera: locked tight on box, follows the reveal motion, ends on macro detail
lighting: dramatic single-source key, deep shadows, premium product feel
color: dark moody with one accent color from the product
audio: silence then sharp open sound, paper rustle, soft ambient through reveal, no music
mood: anticipation → satisfaction
duration: 10
Use for: premium drops, limited releases, gift-able products.
// transitions + craft
// 06
Whip-Pan Transition
Stitch two scenes / two products / two angles into one continuous flow.
scene A: [first composition — the setup]
scene B: [second composition — the payoff]
transition: at 2-second mark, fast horizontal whip-pan from scene A to scene B, motion blur during pan, continuous audio, single take feel
camera A: [static or subtle move]
camera B: [matches A's energy on resolve]
lighting: continuous between scenes, one direction
color: matched grade across both scenes
audio: continuous through whip, slight whoosh on pan, no cut audible
mood: kinetic, premium production value
duration: 6
Use for: before/after, two-product comparisons, location jumps.
// 07
Character Consistency Lock
Stop AI from morphing your character between shots.
Using input image 1 for [character name]'s face/body identity.
[scene + action prompt as normal]
Hard rules:
- Character must match input image 1 exactly: [hair color], [skin tone], [eye color], [build], [defining feature]
- No [list features that AI commonly drifts toward — e.g. "no blonde hair, no pale skin, no blue eyes, no professional makeup"]
- Outfit must match: [exact wardrobe — colors, type, no logos unless specified]
- Same lighting direction as input image
- Same age range as input image
Use for: recurring characters, talking-head series, branded avatars.
// THE 0.5-SECOND RULE
The first 0.5 seconds of audio matter as much as the first frame. Never start with silence unless silence IS the hook. Start with: a sharp sound, a voice mid-sentence, a music cue on a beat, or a bass hit.
Get this right, your watch time doubles. That's the whole game.
More cheat codes drop weekly in the newsletter. Inside the Skool — full system, examples, and live builds.