Submit Dataify YouTube Audio File by URL Builder tasks. Use when the user wants the YouTube audio file collection tool, collect YouTube audio files, scrape Y...
---
name: dataify-youtube-audio-by-url
description: Submit Dataify YouTube Audio File by URL Builder tasks. Use when the user wants the YouTube audio file collection tool, collect YouTube audio files, scrape YouTube audio files, crawl YouTube audio media, collect audio files by URL, create a Dataify youtube_audio_by-url task, or asks in Chinese with meanings like "YouTube音频文件采集", "YouTube音频文件抓取", "YouTube音频采集", "YouTube音频抓取", "音频文件采集", "音频文件抓取", "音频采集", "音频抓取", or similar audio file noun plus collection/scraping action wording. Also use when receiving task_id/status, configuring DATAIFY_API_TOKEN, or troubleshooting this Dataify Builder request.
---
# Dataify YouTube Audio By URL
Submit YouTube audio file collection jobs through Dataify Builder by URL. After a successful submission, give the user the `task_id`, the returned or inferred status, and tell them to visit `https://dataify.com/dashboard/` to view results.
## API TOKEN Handling
Use `DATAIFY_API_TOKEN` as the long-term saved token name.
- If the user provides a token in the request, use it for this run.
- If no token is provided, first check whether `DATAIFY_API_TOKEN` is already saved locally in the environment.
- If `DATAIFY_API_TOKEN` is saved locally, use it without asking the user to re-enter the token.
- If no token is available locally, tell the user they need to provide a Dataify API TOKEN.
- If the user does not have an API TOKEN, tell them they can register or log in at `https://dataify.com/login` to get one.
- If the user already has an API TOKEN, tell them it is available in the top-right area of `https://dataify.com/dashboard/`.
- After the user provides an API TOKEN and no local `DATAIFY_API_TOKEN` is saved, ask whether they want to save it locally as `DATAIFY_API_TOKEN` for future use.
- If the user wants to save it, give the appropriate command for their shell and ask them to run it; do not silently persist tokens without confirmation.
- Do not call the Builder endpoint without a token.
- Always call it `API TOKEN` in user-facing instructions. Prefer the environment variable name `DATAIFY_API_TOKEN` for saved local use.
PowerShell examples for saving the token for the current session:
```powershell
$env:DATAIFY_API_TOKEN = "YOUR_DATAIFY_API_TOKEN"
```
For a persistent user-level variable on Windows:
```powershell
[Environment]::SetEnvironmentVariable("DATAIFY_API_TOKEN", "YOUR_DATAIFY_API_TOKEN", "User")
```
## Core Workflow
1. Before submitting, show the user the required values, shared values, optional values, and defaults listed in the Parameter Checklist.
2. For dropdown fields, show all allowed options as Markdown tables with both `Label` and `Value` columns. Use `scripts/submit_dataify_youtube_audio_by_url.py --list-options` to print the full dropdown tables.
3. Ask whether the user wants to change any value before running the task.
4. Ask whether the user wants to collect multiple YouTube audio files. If yes, ask for multiple `url` values.
5. Normalize the final `url` values into a list of `spider_parameters` objects.
6. Normalize `bitrate`, `audio_format`, `subtitles_language`, `kilohertz`, `is_subtitles`, and `selected_only` into one shared `spider_universal` object.
7. Resolve the Dataify token from explicit input or saved `DATAIFY_API_TOKEN`.
8. If no token is available, ask the user to enter their API TOKEN and ask whether to save it as `DATAIFY_API_TOKEN`.
9. Validate each URL, dropdown value, kilohertz direction, bitrate direction, and file name.
10. Submit a Builder request to create the task.
11. Read `data.task_id` from the Builder response and read `data.status` or `status` when present.
12. Stop after Builder succeeds.
13. Tell the user to visit `https://dataify.com/dashboard/` to view or manage results.
## Parameter Checklist
When the user invokes this skill, first tell them these values are used. Always display submitted parameters as a Markdown table; do not use a plain sentence or bullet list for the parameter confirmation.
| Field | Required | Default | Location | Notes |
| --- | --- | --- | --- | --- |
| `url` | Yes | `https://www.youtube.com/watch?v=_SdpvpvVrLY` | `spider_parameters` | YouTube URL to collect. |
| `kilohertz` | No | `48000` with `<=` | `spider_universal` | Dropdown-style shared parameter. Ask whether it should be greater than or equal to, or less than or equal to, the selected value. Submit as `<=48000` by default. |
| `audio_format` | No | `opus` | `spider_universal` | Dropdown-style shared parameter. Audio format: `opus` or `m4a`. |
| `bitrate` | No | `320` with `<=` | `spider_universal` | Dropdown-style shared parameter. Ask whether it should be greater than or equal to, or less than or equal to, the selected value. Submit as `<=320` by default. |
| `subtitles_language` | No | `ab` | `spider_universal` | Dropdown-style shared parameter. Subtitle language. |
| `is_subtitles` | No | `false` | `spider_universal` | Shared parameter. Whether to include subtitles. |
| `selected_only` | No | `false` | `spider_universal` | Dropdown-style shared parameter. Whether to use only selected specifications. |
| `file_name` | No | `{{TasksID}}` | Builder form field | Use the default when the user does not change it. |
Then ask: "Do you want to change any of these values before I submit the task?"
Also ask: "Do you want to collect multiple YouTube audio files? If yes, provide multiple `url` values."
For `kilohertz` and `bitrate`, ask whether the selected option should use `<=` or `>=`. Default to `<=` when the user does not specify a direction.
If the user has already provided some values, show those values in place of the defaults and only ask whether the remaining/defaulted values should be changed.
## Dropdown Options
Before asking the user to choose dropdown values, show all allowed options as Markdown tables with both `Label` and `Value` columns.
Use this command to print the complete tables:
```powershell
python3 ".\scripts\submit_dataify_youtube_audio_by_url.py" --list-options
```
If `python3` is not available, use the local Python 3 command for that machine, such as `python`.
The script prints:
- `kilohertz` options.
- `kilohertz_direction` options.
- `audio_format` options.
- `bitrate` options.
- `bitrate_direction` options.
- `subtitles_language` options, using the supplied `cn` value as `Label` and `typeValue` as `Value`.
- `is_subtitles` options.
- `selected_only` options.
## Parameter Handling
- `url` is required. If the user does not provide it, use the default `https://www.youtube.com/watch?v=_SdpvpvVrLY` only after showing it in the parameter confirmation table.
- Trim leading and trailing whitespace from `url`.
- `url` cannot be empty.
- `url` must start with `https://www.youtube.com/`.
- Multiple collection groups only repeat `url` inside `spider_parameters`.
- `bitrate`, `audio_format`, `subtitles_language`, `kilohertz`, `is_subtitles`, and `selected_only` are shared parameters. Send them in `spider_universal`, not inside each `spider_parameters` object.
- `kilohertz` defaults to `48000` and `kilohertz_direction` defaults to `<=`; submit `kilohertz` as `<=48000`.
- `bitrate` defaults to `320` and `bitrate_direction` defaults to `<=`; submit `bitrate` as `<=320`.
- `audio_format` defaults to `opus`. Allowed values are `opus` and `m4a`.
- `subtitles_language` defaults to `ab`.
- `is_subtitles` defaults to `false`.
- `selected_only` defaults to `false`.
- `file_name` defaults to `{{TasksID}}`. If the user changes it, submit the user-provided value.
- `file_name` cannot be empty.
Single-group example:
```json
spider_parameters=[{"url":"https://www.youtube.com/watch?v=_SdpvpvVrLY"}]
spider_universal={"bitrate":"<=320","audio_format":"opus","subtitles_language":"ab","kilohertz":"<=48000","is_subtitles":"false","selected_only":"false"}
```
Multi-group example:
```json
spider_parameters=[{"url":"https://www.youtube.com/watch?v=_SdpvpvVrLY"},{"url":"https://www.youtube.com/watch?v=_SdpvpvVrLY"}]
spider_universal={"bitrate":"<=320","audio_format":"opus","subtitles_language":"ab","kilohertz":"<=48000","is_subtitles":"false","selected_only":"false"}
```
## Dataify Builder Request
Use form fields rather than hand-built URL-encoded strings.
- URL: `https://scraperapi.dataify.com/builder?platform=1`
- Method: `POST`
- Authorization header: `Bearer DATAIFY_API_TOKEN`
- Content type: `application/x-www-form-urlencoded`
- Fixed fields:
- `spider_name=youtube.com`
- `spider_id=youtube_audio_by-url`
- `spider_errors=true`
- Default field:
- `file_name={{TasksID}}`
- Dynamic fields:
- `spider_parameters` must be a JSON string array of URL objects.
- `spider_universal` must be a JSON string object containing shared audio settings.
## Script
For stable execution, prefer `scripts/submit_dataify_youtube_audio_by_url.py` with Python 3.6 or newer instead of rewriting the Builder flow.
```powershell
python3 ".\scripts\submit_dataify_youtube_audio_by_url.py" --url "https://www.youtube.com/watch?v=_SdpvpvVrLY"
```
To override the saved environment token or default shared parameters for one run:
```powershell
python3 ".\scripts\submit_dataify_youtube_audio_by_url.py" --api-token "YOUR_DATAIFY_API_TOKEN" --url "https://www.youtube.com/watch?v=_SdpvpvVrLY" --kilohertz "48000" --kilohertz-direction "<=" --audio-format "opus" --bitrate "320" --bitrate-direction "<=" --subtitles-language "ab" --is-subtitles "false" --selected-only "false" --file-name "{{TasksID}}"
```
To submit multiple URLs:
```powershell
python3 ".\scripts\submit_dataify_youtube_audio_by_url.py" --params-json '[{"url":"https://www.youtube.com/watch?v=_SdpvpvVrLY"},{"url":"https://www.youtube.com/watch?v=_SdpvpvVrLY"}]'
```
The script prints a JSON summary with `task_id`, `status`, `parameters`, `spider_universal`, `file_name`, `dashboard_url`, and `message`.
## Troubleshooting
`Missing Dataify API TOKEN` means no explicit token was passed and `DATAIFY_API_TOKEN` is not saved locally. Tell the user they need to provide their Dataify API TOKEN, ask whether they want to save it as `DATAIFY_API_TOKEN`, or tell them they can register or log in at `https://dataify.com/login` to get one. If they already have a token, tell them it is in the top-right area of `https://dataify.com/dashboard/`.
`url cannot be empty` means the required YouTube URL is missing.
`url must start with https://www.youtube.com/` means the URL is outside the allowed YouTube domain.
`Unsupported kilohertz`, `Unsupported audio_format`, `Unsupported bitrate`, `Unsupported subtitles_language`, `Unsupported is_subtitles`, or `Unsupported selected_only` means the value must be one of the allowed dropdown values.
`Unsupported kilohertz_direction` or `Unsupported bitrate_direction` means the value must be `<=` or `>=`.
`File name cannot be empty` means no usable `file_name` was provided.
`Necessary parameters is empty!` usually means the Builder request was not submitted as form fields, `spider_parameters` was not a JSON string array, `spider_universal` was not a JSON string object, or one `spider_parameters` object is missing `url`.
Missing `task_id` usually means the authorization header, token, `spider_name`, `spider_id`, `spider_parameters`, or `spider_universal` is wrong.
## Guardrails
- Do not put `bitrate`, `audio_format`, `subtitles_language`, `kilohertz`, `is_subtitles`, or `selected_only` inside `spider_parameters`.
- Do not omit `spider_universal`.
- Do not poll for results after Builder succeeds.
- Use only `API TOKEN` and `DATAIFY_API_TOKEN` when referring to authentication.
- Do not hard-code local Python paths.
- Do not claim the Builder response contains YouTube audio files.
- Do not invent result fields.
- Always direct the user to `https://dataify.com/dashboard/` after successful task creation.
don't have the plugin yet? install it then click "run inline in claude" again.