|
---
name: image-portrait
version: 1.0.0
description: |
Identity-consistent portrait generation: professional headshots, dating photos, style transfers, themed portraits, photo series, avatars, ID photos.
Use when generating styled portraits from a reference photo (e.g. professional headshot, anime avatar, cyberpunk portrait, travel photo, dating profile photo, ID photo).
metadata:
starchild:
emoji: "πΈ"
skillKey: image-portrait
requires:
env: [FAL_KEY]
user-invocable: true
disable-model-invocation: false
---
# image-portrait
Use this skill for **all identity-consistent portrait generation requests** on Starchild.
Covers: professional headshots, dating/social photos, artistic style transfers, themed/holiday portraits, photo series, digital avatars, children/family photos, ID/passport photos.
**Core principle:** call the provided script. Do not re-implement proxy/billing plumbing.
---
## 1. Quick start β single portrait (most common)
```python
exec(open('skills/image-portrait/generate_portrait.py').read())
result = generate_portrait(
image_path="path/to/user/photo.jpg",
style="professional",
)
# result -> {"success": True, "images": [{"local_path": "output/images/..."}], ...}
```
The script reads the local file, base64-encodes it, and sends it to fal.ai as a data URI β no manual URL publishing needed.
## 2. Quick start β public URL
```python
exec(open('skills/image-portrait/generate_portrait.py').read())
result = generate_portrait(
face_image_url="https://example.com/photo.jpg",
style="anime",
)
```
## 3. Quick start β text-to-image (no reference photo)
```python
exec(open('skills/image-portrait/generate_portrait.py').read())
result = generate_portrait(
prompt="a young woman in cyberpunk armor, neon city background, rain",
model="nanopro",
)
```
When no `image_path` or `face_image_url` is provided, the script uses the text-to-image endpoint (no `/edit` suffix).
### Delivering the result to the user β IMPORTANT
**Never hand the user the raw fal.media URL.** fal serves files with restrictive CSP headers. The only reliable delivery path is the **already-downloaded local file**:
1. Use each image's `local_path` (e.g. `output/images/xxx.png`) β the script always downloads on success.
2. Tell the user the files are saved to `output/images/` and viewable in the workspace file panel.
3. On Web channel, embed inline so the user can preview in chat:
```markdown

```
4. On Telegram / WeChat: send via `send_to_telegram(file_path="output/images/...", message_type="image")` or `send_to_wechat(file_path="output/images/...", message_type="image")`.
---
## 4. Parameters
| Parameter | Required | Default | Description |
|-----------|----------|---------|-------------|
| `image_path` | no | β | Local workspace file path to the user's face photo |
| `face_image_url` | no | β | Public HTTPS URL of the user's face photo |
| `style` | no | `"professional"` | Preset style key (see Β§5) |
| `scene` | no | `None` | Custom scene description (appended to style prompt) |
| `prompt` | no | `None` | Fully custom prompt β overrides style+scene when set |
| `model` | no | `"nanopro"` | Model: `"nano2"` (fastest ~15s), `"nanopro"` (balanced ~25s, default), or `"gpt"` (best quality ~150s) |
| `count` | no | `1` | Number of images to generate (1β8) |
| `aspect_ratio` | no | `"1:1"` | Output ratio: `1:1`, `3:4`, `4:3`, `9:16`, `16:9` |
**Image input rules:**
- Provide `image_path` OR `face_image_url` for identity-consistent generation (edit mode).
- If both are given, `image_path` takes priority.
- Omit both for pure text-to-image generation (generate mode).
**Prompt priority:** `prompt` > `style + scene` > `style` > default (`professional`).
---
## 5. Style presets
### A: Identity-consistent character styles
| Style | Key | Best for |
|-------|-----|----------|
| Professional headshot | `professional` | LinkedIn, resume, corporate |
| Artistic portrait | `artistic` | Creative portfolio, gallery |
| Anime | `anime` | Social media, fun avatar |
| Cyberpunk | `cyberpunk` | Gaming profile, sci-fi fan |
| Oil painting | `oil_painting` | Art gift, classical look |
| Watercolor | `watercolor` | Soft artistic portrait |
| Vintage | `vintage` | Retro aesthetic, nostalgia |
| Casual lifestyle | `casual` | Social media, personal blog |
### B: Personal showcase / dating / social
| Style | Key | Best for |
|-------|-----|----------|
| Dating β cafe | `dating_cafe` | Dating app, warm vibe |
| Dating β beach | `dating_beach` | Dating app, summer vibe |
| Dating β city | `dating_city` | Dating app, urban vibe |
| Dating β restaurant | `dating_restaurant` | Dating app, elegant vibe |
| Travel β Europe | `travel_europe` | Travel blog, social media |
| Travel β Japan | `travel_japan` | Travel blog, cultural |
| Travel β tropical | `travel_tropical` | Vacation, resort |
| Sports β gym | `sports_gym` | Fitness profile |
| Sports β running | `sports_running` | Athletic profile |
| Social media | `social_media` | Instagram, TikTok |
| LinkedIn | `linkedin` | Professional networking |
| Personal brand | `personal_brand` | Entrepreneur, creator |
### D: Themed / scene portraits
| Style | Key | Best for |
|-------|-----|----------|
| Christmas | `christmas` | Holiday greeting, social |
| Halloween | `halloween` | Holiday fun |
| Graduation | `graduation` | Milestone celebration |
| Wedding | `wedding` | Wedding planning, save-the-date |
| Business speech | `business_speech` | Speaker profile |
| Musician | `musician` | Music promotion |
| Chef | `chef` | Food blog, restaurant |
| Outdoor adventure | `outdoor_adventure` | Adventure blog |
| Pet together | `pet_together` | Pet lover profile |
| Reading | `reading` | Book club, literary |
| Night city | `night_city` | Urban lifestyle |
| Hanfu (Chinese traditional) | `hanfu` | Cultural, cosplay |
### O: Digital avatar
| Style | Key | Best for |
|-------|-----|----------|
| 3D cartoon | `avatar_3d` | Social avatar, Pixar style |
| Gaming avatar | `avatar_gaming` | Game profile, RPG |
| VTuber | `avatar_vtuber` | Streaming, VTuber |
### T: Children & family
| Style | Key | Best for |
|-------|-----|----------|
| Child portrait | `child_portrait` | Family keepsake |
| Family photo | `family_photo` | Family portrait |
### U: ID / passport photos
| Style | Key | Best for |
|-------|-----|----------|
| ID photo (white bg) | `id_photo_white` | Passport, driver's license |
| ID photo (blue bg) | `id_photo_blue` | Visa, work permit |
---
## 6. Model selection guide
| Model | Key | Speed | Quality | Best for |
|-------|-----|-------|---------|----------|
| Nano Banana 2 | `nano2` | ~15s | Good | Quick drafts, fast iteration, bulk generation. |
| NanoPro | `nanopro` | ~25s | Better | Default for all requests. Balanced speed and quality. |
| GPT Image 2 | `gpt` | ~150s | Best | When user explicitly asks for "highest quality" or "best quality". Complex scenes. |
**Decision rules:**
1. **Default:** always use `nanopro` unless the user explicitly requests otherwise.
2. **Use `nano2` when:** user wants fastest results, is iterating on styles, generating many images, or says "quick", "draft", "fast".
3. **Use `gpt` when:** user says "highest quality", "best quality", "premium", or the scene is very complex with many specific details.
```python
# Default (fast)
result = generate_portrait(image_path="photo.jpg", style="anime")
# High quality (user requested)
result = generate_portrait(image_path="photo.jpg", style="anime", model="gpt")
```
---
## 7. Custom scene examples
```python
# Style + custom scene
result = generate_portrait(
image_path="uploads/my_photo.jpg",
style="professional",
scene="in a modern office with city skyline view",
)
# Custom scene only (defaults to professional style base)
result = generate_portrait(
image_path="uploads/my_photo.jpg",
scene="standing on a beach at sunset, golden hour lighting",
)
# Fully custom prompt (overrides everything)
result = generate_portrait(
image_path="uploads/my_photo.jpg",
prompt="portrait of a person as a medieval knight, full plate armor, castle background, dramatic lighting, oil painting style",
)
# Different aspect ratio
result = generate_portrait(
image_path="uploads/my_photo.jpg",
style="cyberpunk",
aspect_ratio="9:16",
)
# Multiple images
result = generate_portrait(
image_path="uploads/my_photo.jpg",
style="dating_cafe",
count=4,
)
```
---
## 8. Prompt engineering best practices
When the user's request doesn't match any preset style, or when you need to construct a custom `prompt`, follow these guidelines (derived from reference skills: ai-headshot-generation, ai-avatar-generation, style-transfer, portrait-enhancement, character-design-sheet, avatar-portrait, nano-banana-pro, pet-portrait-generation).
### Automatic likeness preservation
When a reference image is provided (edit mode), the script **automatically prepends** a likeness preservation instruction to every prompt. This ensures the generated portrait preserves the subject's facial identity. You do NOT need to add likeness instructions manually β the script handles it.
Exception: avatar styles (`avatar_3d`, `avatar_gaming`, `avatar_vtuber`) skip the likeness prefix because stylization takes priority over photographic likeness.
### The 7-element prompt structure
Every effective portrait prompt should include these elements (from nano-banana-pro skill):
```
[subject], [outfit/attire], [pose/action], [expression], [background/setting], [lighting], [style/quality modifiers]
```
### Key principles
1. **Likeness vs. style balance** (from avatar-portrait skill):
- Too photorealistic = ignores requested style
- Too stylized = loses resemblance to source person
- For stylized portraits: emphasize "stylized but maintains individual features"
- For photorealistic: emphasize "keep facial features recognizable"
2. **Lighting is critical** β always specify lighting type:
- Studio: "soft diffused studio lighting", "Rembrandt chiaroscuro lighting"
- Natural: "golden hour warm light", "dappled sunlight through trees"
- Dramatic: "dramatic rim lighting", "volumetric light beams", "neon glow"
- Flat: "even flat lighting with no shadows" (for ID photos)
3. **Background specificity** β vague backgrounds produce poor results:
- β "nice background"
- β
"blurred modern office with glass windows and city view"
- β
"clean neutral gray gradient studio background"
- β
"background style should match the character style" (for avatars)
4. **Lens/camera hints** β help the model understand framing:
- "85mm lens look, shallow depth of field" (portrait)
- "head and shoulders framing" (headshot)
- "full body, clean white background" (character design)
- "close-up face, portrait orientation" (expression/avatar)
5. **Quality anchors** β add style quality references:
- "professional photography quality", "magazine cover quality"
- "National Geographic photography style" (adventure)
- "League of Legends splash art style" (gaming)
- "Pixar and Disney animation style" (3D avatar)
- "Studio Ghibli inspired" (anime)
- "fine art watercolor painting look" (watercolor)
6. **Texture and material** β for artistic styles, specify medium:
- "visible impasto brushstrokes, canvas texture" (oil painting)
- "loose expressive watercolor style, soft edges, beautiful color bleeds and washes" (watercolor)
- "natural film grain, Kodak Portra emulation" (vintage)
- "cel-shaded, clean line art, bold outlines" (anime)
- "visible pixels but NOT a pixelated photo filter" (pixel art)
7. **Expression guidance** β be specific about mood:
- β "smiling"
- β
"warm genuine smile, confident approachable expression"
- β
"neutral calm expression with mouth closed" (ID photo)
- β
"passionate expression, energetic" (musician)
### Example: building a custom prompt
User: "I want a photo of me as a wizard in a magical forest"
```python
result = generate_portrait(
image_path="uploads/photo.jpg",
prompt=(
"fantasy wizard portrait, wearing mystical purple robes with glowing runes, "
"ancient wooden staff with crystal orb, wise powerful expression, "
"enchanted forest background with bioluminescent plants and floating particles, "
"dramatic magical lighting with ethereal glow, "
"high fantasy art style, detailed digital painting quality"
),
)
# Note: likeness prefix is auto-added because image_path is provided
```
### Example: pixel art avatar (from avatar-portrait skill)
User: "Make me a retro pixel art avatar"
```python
result = generate_portrait(
image_path="uploads/photo.jpg",
prompt=(
"retro 16-bit pixel art portrait, visible pixels with clean lines, "
"rich colors, consistent shading, stylized but maintains individual features, "
"warm sunset cityscape background in matching pixel art style, "
"head and shoulders, square format"
),
)
```
---
## 9. Photo series
Generate a coordinated set of themed portraits in one call. Pass a custom list of styles/scenes β the agent assembles the list based on the user's request.
```python
exec(open('skills/image-portrait/generate_portrait.py').read())
result = generate_series(
image_path="uploads/my_photo.jpg",
series=[
{"style": "professional"},
{"style": "casual", "scene": "at a rooftop bar, sunset"},
{"style": "anime"},
{"prompt": "portrait as a superhero, cape flowing, city skyline"},
],
)
# result -> {"success": True, "images": [...4 images...], "series": "custom"}
```
Each item in the list is a dict with optional keys:
- `style` β any style key from Β§7 (e.g. `"professional"`, `"anime"`, `"cyberpunk"`)
- `scene` β override the scene description (combined with the style template)
- `prompt` β fully custom prompt (ignores style/scene)
---
## 10. Intent recognition guide
Use this table to map user requests to the correct style/parameters:
| User says | Style | Notes |
|-----------|-------|-------|
| "professional photo", "headshot", "LinkedIn photo" | `professional` or `linkedin` | |
| "dating photo", "dating app", "Tinder photo" | `dating_cafe` / `dating_beach` / `dating_city` | Ask which vibe |
| "anime me", "anime version", "cartoon me" | `anime` | |
| "cyberpunk", "sci-fi portrait" | `cyberpunk` | |
| "oil painting", "classical portrait" | `oil_painting` | |
| "watercolor portrait" | `watercolor` | |
| "vintage photo", "retro" | `vintage` | |
| "casual photo", "lifestyle" | `casual` | |
| "travel photo in Paris/Europe" | `travel_europe` | |
| "travel photo in Japan/Tokyo/Kyoto" | `travel_japan` | |
| "beach photo", "tropical" | `travel_tropical` or `dating_beach` | |
| "gym photo", "fitness" | `sports_gym` | |
| "Christmas photo" | `christmas` | |
| "Halloween photo" | `halloween` | |
| "graduation photo" | `graduation` | |
| "wedding photo" | `wedding` | |
| "chef photo", "cooking" | `chef` | |
| "musician", "on stage" | `musician` | |
| "with my dog/pet" | `pet_together` | |
| "reading", "bookish" | `reading` | |
| "night city", "urban night" | `night_city` | |
| "hanfu", "Chinese traditional" | `hanfu` | |
| "3D avatar", "Pixar style" | `avatar_3d` | |
| "gaming avatar", "RPG character" | `avatar_gaming` | |
| "VTuber avatar" | `avatar_vtuber` | |
| "kid photo", "children's portrait" | `child_portrait` | |
| "family photo" | `family_photo` | |
| "passport photo", "ID photo" | `id_photo_white` | White bg default |
| "visa photo" | `id_photo_blue` | Blue bg |
| "photo series", "set of photos" | Use `generate_series()` | Assemble custom list from styles |
| "highest quality", "best quality" | Any style + `model="gpt"` | |
| Custom scene not in presets | Use `scene=` or `prompt=` | |
---
## 11. Provided scripts
| File | Purpose |
|------|---------|
| `generate_portrait.py` | Core script: submit β poll β download. Handles local files (base64) and URLs, all styles, custom scenes, three models (nano2/nanopro/gpt). |
| `exports.py` | Re-exports `generate_portrait`, `generate_series`, `STYLE_PROMPTS` for programmatic use by other skills. |
| `_cost_track.py` | Cost tracking helper β records per-call costs via sc-proxy headers. |
---
## 12. Local testing
Set `FAL_KEY` env var to call fal.ai directly (bypasses sc-proxy):
```bash
# Single portrait
FAL_KEY=your-fal-key python3 skills/image-portrait/generate_portrait.py photo.jpg anime 1 nanopro
# Args: <image_path_or_url> [style] [count] [model]
```
---
## 13. Troubleshooting
| Problem | Fix |
|---------|-----|
| `File not found: ...` | Check the workspace path; the file must exist |
| `Unsupported image format` | Use `.jpg`, `.jpeg`, `.png`, `.webp`, or `.bmp` |
| `Image too large` | Resize to under 10 MB before uploading |
| `face_image_url must be a public HTTP(S) URL` | Use `image_path` for local files, or provide a valid `https://` URL |
| `HTTP 402 insufficient_credits` | Top up balance; cost is pre-charged on submit |
| `HTTP 403 endpoint_not_allowed` | sc-proxy only allows approved fal endpoints; contact admin |
| Generation `FAILED` upstream | Simplify prompt, ensure face photo is clear and well-lit, retry |
| Job stuck `IN_PROGRESS` >10 min | Save `request_id`, retry later |
| Poor face consistency | Use a clear, front-facing photo with good lighting; avoid group photos |
| `gpt` model too slow | Switch to `nanopro` (default) for faster results |
---
## 14. Infrastructure (reference)
- Caller β `sc-proxy` β `queue.fal.run/{model}` β fal model providers
- All requests must include `Authorization: Key fake-falai-key-12345` (proxy injects the real `FAL_KEY`)
- Pre-charge happens at submit. Poll/result calls are free.
- Local files are base64-encoded as data URIs β no separate upload step needed.
- Final images live at `https://*.fal.media/...` β public CDN, no auth needed for download.
- Cost tracking via `_cost_track.py` β records `X-Credits-Used` from sc-proxy response headers.
### Model endpoints
| Model | Edit (with ref image) | Generate (text only) |
|-------|----------------------|---------------------|
| nano2 | `fal-ai/nano-banana-2/edit` | `fal-ai/nano-banana-2` |
| nanopro | `fal-ai/nano-banana-pro/edit` | `fal-ai/nano-banana-pro` |
| gpt | `openai/gpt-image-2/edit` | `openai/gpt-image-2` |
---
Creator's repository Β· starchild-ai-agent/official-skills