happyhorse-prompt-studio

Interactive prompt studio for HappyHorse 1.0 video generation. Guides users through scenario discovery with vivid examples, then assembles production-ready prompts in JP/CN/EN. Use when someone wants to create AI video content with HappyHorse but doesn't know where to start, or when they have a specific scenario and need a polished prompt. Covers manga drama, character PV, manga motion, virtual idol MV, and free-form scenarios.

Skill file

Preview skill file
---
name: happyhorse-prompt-studio
description: Interactive prompt studio for HappyHorse 1.0 video generation. Guides users through scenario discovery with vivid examples, then assembles production-ready prompts in JP/CN/EN. Use when someone wants to create AI video content with HappyHorse but doesn't know where to start, or when they have a specific scenario and need a polished prompt. Covers manga drama, character PV, manga motion, virtual idol MV, and free-form scenarios.
---

# HappyHorse Prompt Studio

A 4-phase guided skill that turns "I want to make a video" into a production-ready HappyHorse prompt — starting from inspiration, not from a blank page.

## Overview

This skill guides the Agent through a structured conversation:

```
Phase 1 · 灵感菜单 (Inspiration)    →  "Here's what's possible"
Phase 2 · 需求发现 (Discovery)      →  "What do YOU want to make?"
Phase 3 · Prompt 组装 (Assembly)    →  "Let me build it for you"
Phase 4 · 质量检查 (Quality Check)  →  "Here's your prompt — want to tweak?"
```

## Phase 1 · Inspiration Menu (灵感菜单)

**Start every conversation here.** Before asking any questions, show the user what HappyHorse can do. Present these as four "flavors" — each one a door into a different creative world.

Use the language the user is using (JP/CN/EN). The descriptions below are in English for the Agent's reference — translate them to match the user's language.

---

### Flavor A · "让你的角色开口说话"
**Voiced Manga Drama (漫画配音剧 / ボイスコミック)**

> You have a manga, a webtoon, or an original story. You've drawn the characters, written the dialogue — now you want them to *speak*.
>
> Upload 2-3 character reference images + a short script. HappyHorse generates a 15-30 second voiced drama where characters talk, emote, and stay visually consistent across cuts. Lip-sync included.
>
> **Vibe:** Movie dub meets manga animation. Your characters, their voices.

### Flavor B · "一张立绘,开口自我介绍"
**Character Voice PV (角色语音 PV / キャラボイス PV)**

> You have a game character, a VTuber, or an original OC. You want a short 8-10 second PV where they introduce themselves — or let out a battle cry.
>
> Upload 1-3 character art images + a line or two. HappyHorse generates a voiced, lip-synced character PV.
>
> **Vibe:** Character reveal trailer. One illustration, one voice, one PV.

### Flavor C · "让一格漫画活过来"
**Manga Panel Motion (漫画分格动态化 / コマ動画化)**

> You have manga panels, comic pages, or illustrated scenes. You want to turn them into 5-10 second motion clips — perfect for social media.
>
> Upload one panel as the first frame. HappyHorse animates it while preserving your art style.
>
> **Vibe:** Your drawing, but it breathes. Hair moves, eyes blink, wind blows.

### Flavor D · "你的虚拟偶像,30 秒成 MV"
**Virtual Idol MV (虚拟偶像 MV / バーチャルアイドル MV)**

> You have a virtual idol, a VTuber group, or an original idol project. You want an MV — with stage lighting, lip-sync singing, and multi-shot choreography.
>
> Upload 3-5 multi-angle character images + a licensed song segment. HappyHorse generates a 30-second MV clip.
>
> **Vibe:** Your idol, center stage. No Live2D. No MMD. Just one prompt.
>
> ⚠️ **Note:** This scenario requires the strongest compliance guardrails. We'll check together.

---

### How to present the menu

Present the four flavors conversationally, not as a dry list. Something like:

> "HappyHorse can bring your characters to life in a few different ways. Think of it as four flavors:
>
> A · **Voiced Drama** — your manga characters talk to each other, with voice and lip-sync
> B · **Character PV** — your game character or OC introduces itself out loud
> C · **Panel Motion** — a single manga panel starts moving, hair blowing, eyes blinking
> D · **Idol MV** — your virtual idol performs a 30-second MV on stage
>
> Which one sounds closest to what you're imagining? Or tell me about your project and I'll suggest."

If the user already knows what they want, skip to Phase 2.

---

## Phase 2 · Discovery (需求发现)

Once a flavor is chosen (or the user describes their own scenario), ask these questions. **Ask them conversationally, not as a form.** Group related questions together.

### 2.1 Character & World (角色与世界)

- What's your character's name and role? (protagonist / antagonist / side character)
- What do they look like? (hair, eyes, outfit, accessories, any signature items)
- What's their personality vibe? (cool / energetic / shy / mysterious / cheerful)
- Where does the scene take place? (school rooftop / fantasy castle / neon city / café / etc.)

### 2.2 Scene Intent (场景意图)

- What's happening in this scene? (a confession / a battle / a quiet moment / a group dance)
- What emotion should the viewer feel? (heart-fluttering / adrenaline / nostalgic / hype / calm)
- How long should the output be? (5s / 10s / 15s / 30s)

### 2.3 Voice & Sound (声音与音频)

- Does your character speak? If yes:
  - What language? (Japanese / Chinese / English)
  - Voice type? (young woman / young man / child / mature / elderly)
  - Voice color? (bright / low / soft / powerful / cool)
  - What do they say? (provide the exact line, or ask me to suggest)
- Background audio? (silence / ambient sounds / BGM style)

### 2.4 Visual Style (视觉风格)

- Art style reference? (anime / photorealistic / Pixar / watercolor / pixel art / etc.)
- Color palette? (warm / cool / neon / pastel / high-contrast)
- Camera preference? (close-up / medium / wide / rotating / slow push / static)

### 2.5 Compliance Quick-Check (合规快检)

Before proceeding, verify:

- ☐ Is the character **your own original creation** or properly licensed?
- ☐ Is the character depicted as **18 or older** (especially for idol scenarios)?
- ☐ Is the outfit **SFW** (no suggestive or revealing clothing)?
- ☐ Is the scene **SFW** (no sensitive locations like bedrooms/pools)?
- ☐ If there's music, is it **licensed or original** (not a commercial song)?

If any answer is NO, **pause and suggest an alternative** — don't proceed with a non-compliant prompt.

---

## Phase 3 · Prompt Assembly (Prompt 组装)

Now build the prompt using the **HappyHorse Formula**:

```
场景 (Scene) + 主体 (Subject) + 動き (Motion) + 音声 (Audio) + 品質修飾 (Quality)
```

### 3.1 The Formula (公式)

| Component | What it does | Example (JP) |
|-----------|-------------|--------------|
| 场景 Scene | Where + when + atmosphere | 夕暮れの古城の庭、金色の逆光、落ち葉が舞い散る |
| 主体 Subject | Who + appearance + state | 甲冑とマントをまとった若い剣士が、剣の柄に手を添えて立っている |
| 動き Motion | What happens + camera movement | 剣士がゆっくり顔を上げ、カメラが全身から胸像へとゆっくり寄る |
| 音声 Audio | Voice lines + voice type + ambient sound + BGM | [剣士、ネイティブな日本語、低めの落ち着いた青年声] 言う:「俺の刃は、守るためにある。」背景に風、鐘の音 |
| 品質 Quality | Style + consistency constraints | 映画級質感、リアルな光影、キャラの表情と装備が維持 |

### 3.2 R2V Character Consistency Syntax

When the user provides multiple reference images, use this syntax:

```
参考「Image 1」=[正面立绘]、「Image 2」=[侧面]、「Image 3」=[表情差分]
```

Or when referencing a specific character in a multi-character scene:

```
@「Image 1」のキャラが [action]、@「Image 2」のキャラが [reaction]
```

Key rules:
- Always use `@「Image n」` to lock character identity across shots
- Describe what each reference image shows (正面 / 側面 / 表情差分)
- End with: `キャラの顔・髪・衣装が変わらない` (character's face/hair/outfit stays unchanged)

### 3.3 Video-Edit Style Unification

When the user wants to unify style across multiple shots:

```
参考「Image 1」の画風・筆致・色調に、動画全体のスタイルを変換。
人物の動作、表情、運鏡、セリフのタイミングは 100% 保持。
```

Key rule: always add `100% 保持` (100% preserved) constraints for things that must not change.

### 3.4 Language Rules

| Language | When to use | Key markers |
|----------|-------------|-------------|
| Japanese (JP) | User speaks JP, or targeting JP market | `ネイティブな日本語`, use `「」` for dialogue, avoid translation-style phrasing |
| Chinese (CN) | User speaks CN | `中文母语`, use `「」` or `""` for dialogue |
| English (EN) | User speaks EN, or targeting global market | `native English`, use `""` for dialogue |

**Japanese-specific tips:**
- Use `ネイティブな日本語` to ensure natural Japanese (not translation-style)
- Specify voice color with JP adjectives: `明るく元気な少女声`, `低めの落ち着いた青年声`, `柔らかい囁くような声`
- Keep dialogue in `「」` brackets
- Avoid mixing languages in dialogue unless intentionally bilingual

### 3.5 Prompt Templates by Flavor

#### Flavor A · Voiced Manga Drama

```
[场景描述:时间/地点/光线/氛围]

[角色A描述] と [角色B描述] が [位置关系]。

[角色A动作描述]。
[角色A, 语言, 声色描述, 情绪] 言う:「[台词]」

[角色B反应动作]。
[角色B, 语言, 声色描述, 情绪] 答える:「[台词]」

[镜头运动描述]。
[环境音/BGM描述]。

映画級質感、リアルな光影、キャラの顔・髪・衣装が変わらない。
```

#### Flavor B · Character Voice PV

```
[场景描述:地点/光线/氛围]

[角色外观描述] が [姿态]。

[角色动作 + 镜头运动]。
[角色, 语言, 声色描述, 情绪] 言う:「[台词]」

[环境音/BGM描述]。

映画級質感、キャラの顔・髪・衣装・装備が変わらない。
```

#### Flavor C · Manga Panel Motion

```
[图片中已有内容,无需重复描述]

[角色简短动作:眨眼/头发飘动/微笑/转头]。
[环境动态:风吹/光斑/落叶/雨滴]。
[镜头运动:缓慢推近/横移/拉远/固定]。

[环境音/BGM描述]。

原画風、キャラの顔・髪・衣装・構図・色調が変わらない。
```

#### Flavor D · Virtual Idol MV

```
参考「Image 1」=[正面]、「Image 2」=[侧面]、「Image 3」=[表情差分]
中的虚拟偶像角色形象。

[舞台场景描述]
[灯光描述]

分镜 1(N 秒):[景别],角色 [姿态/动作]。
[角色, 语言, 声色, 情绪] 歌う:「[歌词]」

分镜 2(N 秒):[景别],角色 [舞蹈动作]。
[歌词]

分镜 3(N 秒):[景别],角色 [表情]。
[歌词]

[J-POP MV 美学 / 电影级光影],キャラの顔・髪・衣装が変わらない。
口型与歌词重音精准匹配。
```

### 3.6 Assembling the Output

Present the final prompt in a code block so the user can copy it directly. Include:

1. **The prompt itself** (in the user's language)
2. **A brief breakdown** of what each part does
3. **Suggested model variant** (t2v / i2v / r2v / video-edit)
4. **Estimated cost** (720P: ¥0.9/sec, 1080P: ¥1.6/sec)

Example output format:

```
Here's your prompt — ready to paste into HappyHorse:

```
[PROMPT HERE]
```

**Breakdown:**
- 场景: [what this part does]
- 主体: [what this part does]
- 動き: [what this part does]
- 音声: [what this part does]

**Model:** happyhorse-1.0-r2v (you have 2+ reference images)
**Duration:** ~10 seconds
**Cost estimate:** 720P ≈ ¥9, 1080P ≈ ¥16

Want me to adjust anything? (voice tone, camera angle, add another character, change the line?)
```

---

## Phase 4 · Quality Check (质量检查)

Before finalizing, run through this checklist silently. If anything fails, fix before presenting.

### 4.1 Prompt Quality

- ☐ Does the prompt follow the Scene + Subject + Motion + Audio + Quality structure?
- ☐ Is the camera movement explicitly stated? (not left to chance)
- ☐ Is the voice type described with specific adjectives? (not vague)
- ☐ Is the dialogue in the correct brackets for the language? (「」 for JP, "" for EN)
- ☐ Is the "stays unchanged" constraint included at the end?
- ☐ Is the prompt length between 150-300 characters? (too short = under-specified; too long = hard to control)

### 4.2 Compliance Check

- ☐ No existing anime/manga/game IP referenced?
- ☐ No real person likeness?
- ☐ Character depicted as adult?
- ☐ Outfit is SFW?
- ☐ Scene location is SFW?
- ☐ If music is involved, it's licensed/original?

### 4.3 Optimization Tips

If the prompt looks good, offer these pro-tips:

- **"Try 3 variants"** — HappyHorse results vary; generating 3-5 and picking the best is standard practice
- **"Start 720P, finish 1080P"** — do test runs at 720P (cheaper), then re-generate the winner at 1080P
- **"Shorter lines = better lip-sync"** — if the voice line is over 15 characters, consider splitting into two shots
- **"Specific beats vague"** — "camera slowly pushes from full-body to chest close-up" beats "camera moves"

---

## Free-Form Mode (自由模式)

If the user's scenario doesn't fit Flavors A-D, use the formula directly:

1. Ask: "What's your scene? Describe it like you're telling a friend about a movie you just watched."
2. Extract: scene, subject, motion, audio, quality from their description
3. Assemble using the formula
4. Apply the quality check

This mode is especially useful for:
- Product advertisements
- Educational explainers
- Abstract / artistic videos
- Non-character-driven content

---

## Common Pitfalls (常见问题)

| Problem | Fix |
|---------|-----|
| Character face drifts across shots | Add more reference images + emphasize "顔・髪・衣装が変わらない" |
| Voice sounds like machine translation | Add "ネイティブな日本語" marker; avoid translation-style phrasing |
| Lip-sync is off | Shorten dialogue to ≤15 characters; split into two shots |
| Motion is stiff | Replace generic "動いている" with specific actions: "髪が風に右へ流れる" |
| Camera wanders | Always specify camera: "固定", "ゆっくり寄る", "横にパン" |
| Style drifts from original art | Add "原画風・色調が変わらない" at the end |
| Multiple characters swap positions | Use `@「Image n」` syntax to lock each character's position |
| Q-version character becomes realistic | Add "デフォルメ / ちびキャラ / Q 版" style keyword |

---

## CLI Quick-Start (for users who want to run it immediately)

If the user has `bailian-cli` installed, they can run the prompt directly:

```bash
# Text-to-video (no input image)
bl video generate --prompt "[PROMPT]" --download output.mp4

# Image-to-video (one input image)
bl video generate --image ./char.png --prompt "[PROMPT]" --download output.mp4

# Reference-to-video (multiple input images for character consistency)
bl video generate --image ./char1.png --image ./char2.png --prompt "[PROMPT]" --download output.mp4

# Video editing (style transfer on existing video)
bl video edit --video ./input.mp4 --image ./style-ref.png --prompt "[PROMPT]" --download output.mp4
```

---

## Example Interactions

### Example 1 · First-time user (flavor A)

```
User: "I have a manga character I want to animate. She's a schoolgirl with pink hair, and I want her to say something cute."

Agent: "That sounds like Flavor B — a Character Voice PV! Let me ask a few things:

1. What's her name and personality? (shy? cheerful? cool?)
2. What's she saying? Got a line in mind, or want me to suggest one?
3. Where is she? (school rooftop? cherry blossom park? classroom?)
4. Voice vibe — bright and cute, or soft and shy?

And quick compliance check: is she your own original character?"

User: "Her name is Sakura, she's super cheerful, she's on a school rooftop at lunch. She says 'Hey! You made it!' in Japanese. Bright voice!"

Agent: [assembles prompt using Flavor B template]

"Here's your prompt:

```
昼休みの学校の屋上、青空と白い雲、風が心地よい。
桜色のロングヘアの少女がフェンスに寄りかかり、こちらを見て笑っている。

少女が手を振り、カメラがゆっくり寄る。
[少女、ネイティブな日本語、明るく元気な若い女性声、嬉しそう] 言う:
「ねえ!来てくれたんだ!」

背景に風の音、遠くで校庭のざわめき、明るいピアノの BGM。
映画級質感、キャラの顔・髪・制服が変わらない。
```

Model: happyhorse-1.0-i2v (one image input)
Duration: ~8 seconds
Cost: 720P ≈ ¥7.2, 1080P ≈ ¥12.8

Want to tweak anything?"
```

### Example 2 · Experienced user (free-form)

```
User: "I need a 10-second shot of a samurai drawing his sword at sunset, with a dramatic voice line."

Agent: "Got it — free-form, single character. Quick questions:

1. What does he say? (Japanese line?)
2. Voice type — deep and gravelly?
3. Camera — slow push to close-up? Or wide shot?
4. Art style — realistic? Anime?

Compliance: original character?"
```

---

## Final Notes for the Agent

- **Always start with Phase 1** unless the user is clearly experienced and already knows what they want
- **Be creative with descriptions** — don't just ask "what's the scene?", say "paint the picture for me — where are we, what time of day, what's the vibe?"
- **Suggest, don't just ask** — if the user seems unsure, offer defaults: "How about a sunset rooftop scene with a gentle breeze?"
- **Show the prompt in a code block** so it's easy to copy
- **Always offer to iterate** — "Want me to adjust the voice tone? Change the camera angle? Add a second character?"
- **Keep compliance friendly, not scary** — "Just to make sure everything's smooth, is this your original character?" not "COMPLIANCE CHECK: CONFIRM IP STATUS"

Source

Creator's repository · modelstudioai/skills

View on GitHub

Security

Security checks in progress
Results will appear here once audits complete
What this skill can do
Reads your filesConnects to the internetRuns code on your machine
Checked by 3 independent security firms
Does it try to trick the AI?Not yet checkedPending · Gen Agent Trust Hub
Does it sneak in hidden code?Not yet checkedPending · Socket
Does it have known bugs?Not yet checkedPending · Snyk