Item: SiliconFlow TTS Gen
Rating: 5.3
Author: Implexa

SiliconFlow TTS Gen

Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.

installs

stars

karma

SkillRank score ↗

5.3/ 10

evaluated by implexa, claude-haiku-4-5 · 2026-07-02

siliconflow-tts-gen wraps the cosyvoice2 api for speech synthesis across 8 preset voices, 4 languages, and chinese dialects. generates audio files with configurable speed and output format.

structure

3.0

trigger phrases

4.0

procedure

6.0

edge cases

2.0

documentation

6.0

strengths

SKILL.md

---
name: siliconflow-tts-gen
description: Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.
env:
  - SILICONFLOW_API_KEY
files:
  config:
    - ~/.openclaw/openclaw.json
---

# SiliconFlow TTS Generation Skill

Text-to-Speech using SiliconFlow API with CosyVoice2 model. Supports 8 preset voices, multiple languages, and Chinese dialects.

## Features

- 🎙️ **8 Preset Voices**: 4 male + 4 female voices
- 🌍 **Multilingual**: Chinese, English, Japanese, Korean
- 🗣️ **Chinese Dialects**: Cantonese, Sichuan, Shanghai, Tianjin, Wuhan
- ⚡ **Ultra Low Latency**: 150ms first packet delay
- 🎵 **Voice Cloning**: 3-second rapid voice cloning
- 💾 **Auto Download**: Saves audio files locally

## Requirements

- **Environment Variable**: `SILICONFLOW_API_KEY`
- **Optional Config File**: `~/.openclaw/openclaw.json` (for auto-detect)

## Installation

```bash
npx clawhub install siliconflow-tts-gen
```

## Configuration

Set your SiliconFlow API key:

```bash
export SILICONFLOW_API_KEY="your-api-key"
```

## Usage

### List Available Voices

```bash
python3 scripts/generate.py --list-voices
```

### Generate Speech

```bash
# Basic usage (default voice: alex)
python3 scripts/generate.py "你好，世界"

# Specify voice
python3 scripts/generate.py "Hello World" --voice bella

# Adjust speed
python3 scripts/generate.py "你好" --voice claire --speed 0.9

# Save to file
python3 scripts/generate.py "欢迎收听" --output welcome.mp3

# Change format
python3 scripts/generate.py "Hello" --format wav
```

## Available Voices

### Male Voices
| ID | Name | Characteristic |
|----|------|----------------|
| alex | 沉稳男声 | Mature and steady |
| benjamin | 低沉男声 | Deep and low |
| charles | 磁性男声 | Magnetic |
| david | 欢快男声 | Cheerful |

### Female Voices
| ID | Name | Characteristic |
|----|------|----------------|
| anna | 沉稳女声 | Mature and elegant |
| bella | 激情女声 | Passionate |
| claire | 温柔女声 | Gentle and kind |
| diana | 欢快女声 | Sweet and happy |

## Parameters

| Parameter | Type | Default | Range | Description |
|-----------|------|---------|-------|-------------|
| `--voice` | string | alex | - | Voice ID |
| `--speed` | float | 1.0 | 0.25-4.0 | Speech speed |
| `--format` | string | mp3 | mp3/opus/wav/pcm | Output format |
| `--output` | string | output.mp3 | - | Output file path |

## Security Notes

- This skill requires an API key to call SiliconFlow services
- The script reads `~/.openclaw/openclaw.json` only to auto-detect API keys
- No sensitive data is transmitted except to `api.siliconflow.cn`
- Review the code at `scripts/generate.py` before providing credentials

## Author

MaxStorm Team

## License

MIT

don't have the plugin yet? install it then click "run inline in claude" again.

SiliconFlow TTS Gen

SKILL.md

related skills