Run AI models on Replicate via predictions, webhooks, and streaming.
Docs
Reference: https://replicate.com/docs/llms.txt
OpenAPI schema: https://api.replicate.com/openapi.json
MCP server: https://mcp.replicate.com
Per-model docs: https://replicate.com/{owner}/{model}/llms.txt
Set Accept: text/markdown when requesting docs pages for Markdown responses.
Workflow
Choose the right model - Search with the API or ask the user.
Get model metadata - Fetch input and output schema via API.
Create prediction - POST to /v1/predictions.
Poll for results - GET prediction until status is "succeeded".
Return output - Usually URLs to generated content.
Three ways to get outputdon't have the plugin yet? install it then click "run inline in claude" again.