DreamAPI Skill

25 AI-powered tools for video generation, talking avatars, image editing, voice cloning, and more — powered by DreamAPI. Describe what you want and the agent...

installs

stars

karma

SkillRank score ↗

7.9/ 10

evaluated by implexa, claude-haiku-4-5 · 2026-05-29

dreamapi-skill wraps 24 ai media tools (avatars, image/video generation and editing, voice cloning, tts, translation) via python scripts with explicit execution rules, timeout recovery patterns, and time estimates per task type.

structure

9.0

trigger phrases

8.0

procedure

8.0

edge cases

7.0

documentation

8.0

strengths

view original SKILL.md from clawhubclick to expand

---
name: dreamapi-skill
description: "25 AI-powered tools for video generation, talking avatars, image editing, voice cloning, and more — powered by DreamAPI. Describe what you want and the agent handles the rest."
metadata:
  tags: dreamapi, avatar, lipsync, video, image, voice, tts, flux, wan2.1, ai, api, text2image, image2video, face-swap, remove-bg, video-translate, voice-clone
  requires:
    bins: [python3]
  primaryEnv: DREAMAPI_API_KEY
---

# DreamAPI Skill

> 25 AI tools powered by [DreamAPI](https://api.newportai.com/) — from Newport AI.

## Execution Rule

> **Always use the Python scripts in `scripts/`. Do NOT use `curl` or direct HTTP calls.**

## User-Facing Reply Rules

> **Every user-facing reply MUST follow ALL rules below.**

1. **Keep replies short** — give the result or next step directly.
2. **Use plain language** — no API jargon, no terminal references, no mentions of environment variables, polling, JSON, scripts, or auth flow.
3. **Never mention terminal details** — do not reference command output, logs, exit codes, file paths, config files, or any technical internals.
4. **Always send the login link directly** — when login is needed, provide the DreamAPI Dashboard link: `https://api.newportai.com/`
5. **Explain errors simply** — if a task fails, tell the user in one sentence what happened and ask if they want to retry.
6. **Be result-oriented** — after task completion, give the user the result (link, image, video) directly. Do not describe intermediate steps.
7. **Give time estimates** — after submitting a task, tell the user the estimated wait time from the table below.

**Estimated Generation Time**

| Task Type | Estimated Time |
|-----------|---------------|
| Avatar (LipSync / DreamAvatar / Dreamact) | ~2–5 min |
| Image Generation (Flux) | ~30s–1 min |
| Image Editing (Colorize / Enhance / etc.) | ~30s–1 min |
| Video Generation (Wan2.1) | ~3–5 min |
| Video Editing (Swap Face / Matting) | ~2–5 min |
| Video Translate | ~3–5 min |
| Voice Clone | ~30s–1 min |
| TTS (Common / Pro / Clone) | ~10–30s |
| Remove Background | ~10–30s |

**Required login message template**

When authentication is needed, send the user this message (match user's language):

```text
To get started, you need a DreamAPI API key.

1. Go to: https://api.newportai.com/
2. Sign in with Google or GitHub
3. Copy your API key from the Dashboard

Once you have your key, just tell me and I'll set it up for you.
```

中文模板：

```text
开始之前，你需要一个 DreamAPI 的 API Key。

1. 打开 https://api.newportai.com/
2. 用 Google 或 GitHub 登录
3. 在 Dashboard 页面复制你的 API Key

拿到 Key 后告诉我，我帮你设置好。
```

## Prerequisites

- **Python 3.8+**
- Authenticated — see [references/auth.md](references/auth.md)
- Credits available — see [references/user.md](references/user.md)

```bash
pip install -r {baseDir}/scripts/requirements.txt
```

## Agent Workflow Rules

> **These rules apply to ALL generation modules.**

1. **Always start with `run`** — it submits the task and polls automatically until done.
2. **Do NOT ask the user to check the task status themselves.** The agent polls until completion.
3. **Only use `query`** when `run` has already timed out and you have a `taskId` to resume.
4. **If `query` also times out**, increase `--timeout` and try again with the same `taskId`.
5. **Do not resubmit** unless the task has actually failed.

```
Decision tree:
  → New request?           use `run`
  → run timed out?         use `query --task-id <id>`
  → query timed out?       use `query --task-id <id> --timeout 1200`
  → task status=fail?      resubmit with `run`
```

**Task Status Codes:**

| Code | Status | Description |
|------|--------|-------------|
| 0-2 | Processing | Task is queued or running |
| 3 | Success | Task completed |
| 4 | Failed | Task failed |

## Modules

| Module | Script | Reference | Description |
|--------|--------|-----------|-------------|
| Auth | `scripts/auth.py` | [auth.md](references/auth.md) | API key management — login, status, logout |
| Avatar | `scripts/avatar.py` | [avatar.md](references/avatar.md) | LipSync, LipSync 2.0, DreamAvatar 3.0 Fast, Dreamact |
| Image Gen | `scripts/image_gen.py` | [image_gen.md](references/image_gen.md) | Flux Text-to-Image, Flux Image-to-Image |
| Image Edit | `scripts/image_edit.py` | [image_edit.md](references/image_edit.md) | Colorize, Enhance, Outpainting, Inpainting, Swap Face, Remove BG |
| Video Gen | `scripts/video_gen.py` | [video_gen.md](references/video_gen.md) | Text-to-Video, Image-to-Video, Head-Tail-to-Video (Wan2.1) |
| Video Edit | `scripts/video_edit.py` | [video_edit.md](references/video_edit.md) | Swap Face Video, Video Matting, Composite |
| Video Translate | `scripts/video_translate.py` | [video_translate.md](references/video_translate.md) | Video Translate 2.0 (en/zh/es) |
| Voice | `scripts/voice.py` | [voice.md](references/voice.md) | Voice Clone, TTS Clone, TTS Common, TTS Pro, Voice List |
| User | `scripts/user.py` | [user.md](references/user.md) | Credit balance |

> **Read individual reference docs for usage, options, and examples.**
> Local files (image/audio/video) are auto-uploaded when passed as arguments.

## Tool Selection Guide

```
What does the user need?
│
├─ A talking face synced to audio?
│  ├─ Has a video + audio → avatar.py lipsync / lipsync2
│  └─ Has a photo + audio → avatar.py dreamavatar
│
├─ A character performing actions from a driving video?
│  → avatar.py dreamact
│
├─ Generate an image from text?
│  → image_gen.py text2image
│
├─ Transform an existing image?
│  → image_gen.py image2image
│
├─ Edit an image?
│  ├─ Colorize B&W photo → image_edit.py colorize
│  ├─ Enhance quality → image_edit.py enhance
│  ├─ Extend borders → image_edit.py outpainting
│  ├─ Fill/replace region → image_edit.py inpainting
│  ├─ Replace face → image_edit.py swap-face
│  └─ Remove background → image_edit.py remove-bg
│
├─ Generate a video from text?
│  → video_gen.py text2video
│
├─ Animate an image into video?
│  → video_gen.py image2video
│
├─ Create transition between two frames?
│  → video_gen.py head-tail
│
├─ Edit a video?
│  ├─ Replace face → video_edit.py swap-face
│  ├─ Remove background → video_edit.py matting
│  └─ Replace background → video_edit.py matting + composite
│
├─ Translate video speech?
│  → video_translate.py
│
├─ Text-to-speech?
│  ├─ With cloned voice → voice.py clone + tts-clone
│  ├─ Standard quality → voice.py tts-common
│  └─ Premium quality → voice.py tts-pro
│
├─ Browse available voices?
│  → voice.py list
│
├─ Check credit balance?
│  → user.py credit
│
└─ Outside capabilities?
   → Tell user this isn't supported yet
```

## Quick Reference

| User says... | Script & Command |
|-------------|-----------------|
| "Make a talking face video with this audio" | `avatar.py lipsync run` |
| "Generate an avatar from this photo and audio" | `avatar.py dreamavatar run` |
| "Make this character do the dance in this video" | `avatar.py dreamact run` |
| "Generate an image of..." | `image_gen.py text2image run` |
| "Modify this image to..." | `image_gen.py image2image run` |
| "Colorize this old photo" | `image_edit.py colorize run` |
| "Enhance this blurry image" | `image_edit.py enhance run` |
| "Extend this image" | `image_edit.py outpainting run` |
| "Fill in this area of the image" | `image_edit.py inpainting run` |
| "Swap the face in this photo" | `image_edit.py swap-face run` |
| "Remove the background" | `image_edit.py remove-bg run` |
| "Generate a video about..." | `video_gen.py text2video run` |
| "Animate this image into a video" | `video_gen.py image2video run` |
| "Create a transition between these two images" | `video_gen.py head-tail run` |
| "Swap the face in this video" | `video_edit.py swap-face run` |
| "Remove the video background" | `video_edit.py matting run` |
| "Replace the video background with..." | `video_edit.py matting run` + `composite run` |
| "Translate this video to Chinese" | `video_translate.py run` |
| "Clone this voice" | `voice.py clone run` |
| "Read this text with the cloned voice" | `voice.py tts-clone run` |
| "Convert this text to speech" | `voice.py tts-common run` or `tts-pro run` |
| "What voices are available?" | `voice.py list` |
| "How many credits do I have?" | `user.py credit` |

## Agent Behavior Protocol

### During Execution

1. **Local files auto-upload** — scripts detect local paths and upload via DreamAPI Storage automatically
2. **Parallelize independent tasks** — independent generation tasks can run concurrently via `submit`
3. **Keep consistency** — when generating multiple related outputs, use consistent parameters

### After Execution

Show the result URL first, then key metadata. Keep it clean.

**Result template:**

```text
[type emoji] [task type] complete

Result: <OUTPUT_URL>
• [key metadata]

Not happy with the result? Let me know and I'll adjust.
```

### Error Handling

See [references/error_handling.md](references/error_handling.md) for error codes and recovery.

## Capability Boundaries

| Category | Tools | Count |
|----------|-------|-------|
| Avatar | LipSync, LipSync 2.0, DreamAvatar 3.0 Fast, Dreamact | 4 |
| Image Generation | Flux Text-to-Image, Flux Image-to-Image | 2 |
| Image Editing | Colorize, Enhance, Outpainting, Inpainting, Swap Face, Remove BG | 6 |
| Video Generation | Text-to-Video, Image-to-Video, Head-Tail-to-Video | 3 |
| Video Editing | Swap Face Video, Video Matting, Composite | 3 |
| Video Translate | Video Translate 2.0 | 1 |
| Voice | Voice Clone, TTS Clone, TTS Common, TTS Pro, Voice List | 5 |
| **Total** | | **24** |

> **Never promise capabilities that don't exist as modules.**

don't have the plugin yet? install it then click "run inline in claude" again.

added explicit 8-step procedure with inputs/outputs per step, expanded decision tree with full tool routing logic, clarified timeout recovery path, documented dreamapi external connection with auth/rate limits/file upload details, added edge cases (auth expiry, zero credits, network timeout, empty results), specified output json contracts for all response types, and enforced user-facing reply rules with examples.

DreamAPI Skill

25 AI tools powered by DreamAPI , from Newport AI.

intent

this skill wraps 25 DreamAPI tools across avatars, image generation, image editing, video generation, video editing, video translation, and voice cloning. use it whenever a user needs to generate or edit images, videos, or audio synthetically. the skill auto-handles polling, file uploads, and auth flow. expect 30 seconds to 5 minutes per task depending on type.

inputs

required environment

DREAMAPI_API_KEY: your DreamAPI account API key from https://api.newportai.com/. get it by signing in with google or github and copying from the dashboard.

python runtime

python 3.8 or higher
install dependencies: pip install -r {baseDir}/scripts/requirements.txt

external connection: dreamapi

service: DreamAPI (Newport AI)
auth method: API key (bearer token)
endpoints: https://api.newportai.com/ (portal), https://api.dreamapi.ai/ (api)
setup: env var DREAMAPI_API_KEY must be set before running any script. scripts will check auth status automatically.
rate limits: per-account credits system. each task consumes credits based on type. check balance with scripts/user.py credit.
file uploads: local files (images, videos, audio) auto-upload to DreamAPI storage when passed as script arguments. no manual upload needed.

external context (optional)

task ID from a previous timed-out run (for resume with query)
reference docs in references/ folder (auth.md, avatar.md, image_gen.md, etc.)

procedure

step 1: authenticate (once per session)

input: none (first run only)

run python3 scripts/auth.py status to check if API key is already set
if not set, show user the login message (see decision points below)
once user provides API key, set env var: export DREAMAPI_API_KEY=<key>
verify with python3 scripts/auth.py status again

output: authenticated session; DREAMAPI_API_KEY env var loaded

step 2: select tool by user intent (route to correct script)

input: user description of what they want (e.g. "make a talking face from this photo and audio")

match user request against tool selection guide (see decision points for full tree)
identify which module (avatar, image_gen, image_edit, video_gen, video_edit, video_translate, voice, user)
identify which command (e.g. lipsync, text2image, colorize, text2video, etc.)

output: selected script path and command name

step 3: submit task with run (new request)

input: script, command, user parameters (text prompt, file paths, voice settings, etc.)

assemble command: python3 scripts/{module}.py {command} run [options]
pass file paths directly (images, videos, audio). scripts auto-detect and upload
include user parameters (prompt, voice id, quality, language, etc.)
execute and wait for polling to complete (runs in foreground by default)

output: task ID, status, result URL (if complete within default timeout of 600 seconds)

step 4: handle timeout (if run exceeds 600s)

input: task ID from step 3, user's original request

if polling times out before completion, save the task ID
inform user of estimated wait time from table below
offer to resume with query (step 5) when user is ready

output: task ID for later query; message to user with next steps

step 5: resume with query (timeout recovery)

input: task ID from a previous run, original request context

run python3 scripts/{module}.py {command} query --task-id <id> --timeout 600
polls again with fresh timeout window
if still incomplete after 600s, user can request query again with --timeout 1200

output: updated status; result URL if now complete; error if task failed (status=4)

step 6: handle task failure

input: task status=4 (failed) returned by run or query

inform user in plain language what went wrong (see error_handling.md for code mapping)
ask if user wants to retry or adjust parameters
do not auto-resubmit; wait for explicit user approval

output: error explanation; ready for user decision (retry, adjust, or cancel)

step 7: deliver result

input: completed task (status=3), result URL, metadata (dimensions, duration, voices used, etc.)

show result URL directly (image, video, or audio link)
include brief metadata (resolution, duration, model used, etc.)
use result template: [emoji] [type] complete | Result: [URL] • [metadata]
ask if user wants adjustments

output: user sees final asset; knows task succeeded

step 8: check credit balance (optional, before/after)

input: none (queries account)

run python3 scripts/user.py credit at any time
shows remaining credits and usage summary

output: credit balance; task cost breakdown

decision points

authentication gate

if no DREAMAPI_API_KEY env var set, user must log in first. send this message (exact format):

To get started, you need a DreamAPI API key.

1. Go to: https://api.newportai.com/
2. Sign in with Google or GitHub
3. Copy your API key from the Dashboard

Once you have your key, just tell me and I'll set it up for you.

(Chinese variant available in original doc above.)

else: proceed with authenticated session.

tool routing tree

user request → match against intent below → route to script + command

talking face synced to audio?
  → has video + audio file → avatar.py lipsync run
  → has photo + audio file → avatar.py dreamavatar run
  → wants character to perform actions from driving video → avatar.py dreamact run

generate image from text prompt?
  → image_gen.py text2image run

transform or restyle existing image?
  → image_gen.py image2image run

edit an image (colorize, enhance, fill, replace, remove bg)?
  → image_edit.py colorize run (b&w photo → color)
  → image_edit.py enhance run (boost quality)
  → image_edit.py outpainting run (extend borders)
  → image_edit.py inpainting run (fill/replace region)
  → image_edit.py swap-face run (replace face in image)
  → image_edit.py remove-bg run (strip background)

generate video from text?
  → video_gen.py text2video run

animate a still image into video?
  → video_gen.py image2video run

create smooth transition between two images?
  → video_gen.py head-tail run

edit a video (face swap, matting, composite)?
  → video_edit.py swap-face run (replace face in video)
  → video_edit.py matting run (remove background from video)
  → video_edit.py composite run (replace background in video)

translate video speech?
  → video_translate.py run (en/zh/es)

text-to-speech?
  → voice.py tts-common run (standard quality)
  → voice.py tts-pro run (premium)
  → voice.py clone run + voice.py tts-clone run (custom voice)

list available voices?
  → voice.py list

check credits?
  → user.py credit

outside capability set?
  → tell user clearly not supported yet

polling and timeout recovery

if run completes within 600s (default): → show result immediately, go to step 7

else if run times out (polling exceeds 600s): → save task ID → inform user with estimated wait time from table below → offer to resume with query --task-id <id> when ready

if query also times out after 600s: → user can request query --task-id <id> --timeout 1200 for longer wait → repeat as needed

if task status returns 4 (failed): → ask user to retry or adjust parameters → do not auto-resubmit

file upload routing

if user passes local file path (image, video, audio): → scripts auto-detect and upload to DreamAPI storage → pass file path directly as argument; no manual upload → if upload fails (network timeout, quota), inform user and ask to retry

else if user passes URL: → use URL directly; no upload needed

credits edge case

if user has zero credits remaining: → run will fail with insufficient credits error → direct user to purchase credits via dashboard → do not retry

output contract

task submission (after run)

{
  "taskId": "string (save for query if needed)",
  "status": "integer (0-2=processing, 3=success, 4=failed)",
  "estimatedTime": "string from table below (if processing)"
}

task completion (status=3)

{
  "taskId": "string",
  "status": 3,
  "result": {
    "url": "string (https://... direct link to asset)",
    "metadata": {
      "type": "image|video|audio",
      "dimensions": "WxH (images/videos)",
      "duration": "seconds (video/audio)",
      "model": "string (Flux, Wan2.1, DreamAvatar, etc.)"
    }
  }
}

task failure (status=4)

{
  "taskId": "string",
  "status": 4,
  "error": {
    "code": "integer (see error_handling.md)",
    "message": "string (user-friendly error)"
  }
}

credit balance query

{
  "credits": "integer (remaining)",
  "monthlyUsage": "object (breakdown by task type)"
}

estimated generation times (inform user after submission)

Task Type	Estimated Time
Avatar (LipSync / DreamAvatar / Dreamact)	2 to 5 min
Image Generation (Flux)	30s to 1 min
Image Editing (Colorize / Enhance / etc.)	30s to 1 min
Video Generation (Wan2.1)	3 to 5 min
Video Editing (Swap Face / Matting)	2 to 5 min
Video Translate	3 to 5 min
Voice Clone	30s to 1 min
TTS (Common / Pro / Clone)	10s to 30s
Remove Background	10s to 30s

file output locations

result URLs are permanent DreamAPI storage links (https://cdn.dreamapi.ai/...)
user can download, share, or embed directly
no local file writes unless user explicitly saves

outcome signal

user knows the skill worked when:

authentication succeeded: auth.py status returns authenticated: true
task submitted: user sees task ID and estimated wait time message
task completed: user receives result URL directly, with metadata (dimensions, duration, model)
result is usable: result URL is live and downloads/plays correctly (test by opening in browser or sending to user)
user confirms: user sees asset and explicitly says "looks good" or requests adjustments
credits deducted: balance decreased; user can verify via user.py credit

user-facing reply rules (enforce for every response)

keep replies short. give result or next step directly, no filler.
use plain language. no api jargon, no terminal refs, no env var mentions, no polling explanations.
never mention terminal details. no command output, logs, exit codes, file paths, or config internals in user message.
always send login link as https://api.newportai.com/ if login needed.
explain errors in one sentence. ask if user wants to retry.
be result-oriented. after completion, send result url first, then metadata. skip intermediate steps.
give time estimates. after submit, tell user the wait time from the table.

result message template

[emoji] [task type] complete

Result: <OUTPUT_URL>
• [key metadata line]

Not happy with the result? Let me know and I'll adjust.

example success outcomes

user says "make a talking head video" → skill submits, user gets "Avatar complete | Result: [url] | estimated 3-5 min"
task finishes → skill sends "Avatar complete | Result: [url] • 1280x720 video, 45s, DreamAvatar model"
user says "enhance this blurry photo" → skill submits and completes in 45s → "Image edit complete | Result: [url] • 2048x1536 PNG, enhanced quality"
user says "how many credits" → skill returns "You have 450 credits remaining"
user says "clone my voice" → skill returns "Voice cloned | Clone ID: abc123 | Ready to use in text-to-speech"

DreamAPI Skill

related skills

DreamAPI Skill

intent

inputs

procedure

decision points

output contract

outcome signal