back
loading skill details...
Asset preprocessing for HyperFrames compositions — text-to-speech narration (Kokoro), audio/video transcription (Whisper), and background removal for…
HyperFrames Media Preprocessing Three CLI commands that produce assets for compositions: tts (speech), transcribe (timestamps), and remove-background (transparent video). Each downloads a model on first run and caches it under ~/.cache/hyperframes/. Drop the output into the project, then reference it from the composition HTML — see the hyperframes skill for the audio/video element conventions. Text-to-Speech (tts) Generate speech audio locally with Kokoro-82M. No API key. npx hyperframes tts "Text here" --voice af_nova --output narration.wav npx hyperframes tts script.txt --voice bf_emma --output narration.wav npx hyperframes tts --list # all 54 voices Voice Selection Match voice to content. Default is af_heart.
don't have the plugin yet? install it then click "run inline in claude" again.