Treatment Generation

Treatment generation is the creative planning phase. FilmGen turns your project inputs into a detailed scene-by-scene plan before any visual generation begins.

Creative Planning

Planning

Treatment Planning

Uses the audio and project setup to generate a complete video treatment: scene-by-scene descriptions, timing, camera angles, transitions, and lyric mapping. The treatment is shaped by your chosen generation mode (performance, story, advertising, etc.).

Visual Design

Prompt Preparation

Translates the creative treatment into concrete image prompts and video prompts for each scene. Handles style consistency, reference image integration, and lip-sync instructions for performance mode.

What the Treatment Contains

The video treatment is a structured JSON document stored on the project. It includes:

  • Scene breakdown — Number of scenes, each with a section label (intro, verse, chorus, etc.)
  • Timing— Start time, end time, and planned duration for each scene, aligned to the provider's duration buckets
  • Visual description — Detailed scene description with subject, setting, lighting, and mood
  • Camera direction — Camera angle, movement, and framing for each scene
  • Transitions — How scenes connect (cut, dissolve, match cut, etc.)
  • Lyric mapping — Which lyrics appear in each scene (when lyrics/script are provided)

Mode-Aware Planning

The generation mode you selected at setup significantly shapes the treatment:

Performance

Plans around a performer/vocalist. Includes lip-sync cues, delivery style, and stage presence direction.

Story

Plans a narrative arc with characters, settings, and dramatic progression across scenes.

Advertising

Plans product-focused scenes with brand consistency, call-to-action framing, and commercial pacing.

Anime

Plans with anime visual conventions: dynamic poses, expressive reactions, stylized environments.

Short Film

Plans cinematic compositions with atmospheric lighting, minimal dialogue, and visual storytelling.

Documentary

Plans observational scenes with natural lighting, location diversity, and informational composition.

Planner Model Selection

The planner model is the AI backbone used for treatment planning and prompt preparation. You can choose between:

  • Gemini Pro — Strong at structured creative output
  • GPT-5.4 — Excellent at nuanced creative direction
  • Claude Sonnet 4.6 — Fast and cost-effective planning
  • Claude Opus 4.6 — Highest quality reasoning for complex treatments

See Planner Models for detailed comparison and selection guidance.

Art Direction Output

FilmGen produces two prompts per scene:

Image Prompt

Optimized for the image generation model. Describes the scene composition, subjects, lighting, colors, and style in detail.

Video Prompt

Optimized for the video generation model. Describes the motion, camera movement, transitions, and temporal progression. Includes lip-sync cues when in performance mode.

Credits

Treatment generation consumes credits from your subscription. A single treatment generation for one project typically uses 1 credit. The credit check happens before generation begins.