Route AI agent compute tasks to the cheapest viable backend. Supports local inference (Ollama), cloud GPU (Vast.ai), and quantum hardware (Wukong 72Q). Use w...
---
name: edge-router
description: Route AI agent compute tasks to the cheapest viable backend. Supports local inference (Ollama), cloud GPU (Vast.ai), and quantum hardware (Wukong 72Q). Use when an agent needs to decide where to run a task, optimize compute costs, check backend availability, or execute workloads across edge/cloud/quantum infrastructure.
---
# Edge Router
Routes tasks to cheapest available backend: local (free) → cloud GPU ($0.01) → quantum ($0.10).
## API
Base: `https://edge-router.gpupulse.dev/api/v1` (or localhost:3825)
### Route (recommend)
```bash
curl -X POST "$BASE/route" -H "Content-Type: application/json" \
-d '{"task_type": "inference"}'
```
### Execute (route + run)
```bash
curl -X POST "$BASE/execute" -H "Content-Type: application/json" \
-d '{"task_type": "inference", "payload": {"model": "llama3.2:1b", "prompt": "hello"}}'
```
### Task Types
- `inference` → local first, cloud fallback
- `training` → cloud GPU
- `quantum` → Wukong 72Q
- `auto` → cheapest available
### Other
- `GET /backends` — list + status
- `GET /stats` — routing statistics
- `GET /health` — health check
don't have the plugin yet? install it then click "run inline in claude" again.