SKILL.md

Semantic search and RAG with Cloudflare Vectorize V2, supporting 5M vectors, 31ms latency, and async mutations.

Vectorize V2 (GA September 2024) brings 25× capacity increase (200K → 5M vectors), 18× faster queries (549ms → 31ms), and async mutations with breaking API changes including returnMetadata boolean → string enum

Metadata filtering with 10 indexes per collection, range operators ($gte, $lt, etc.), and cardinality-aware query optimization for RAG and semantic search workflows

Batch insert optimization: 5000-vector batches deliver 18× throughput improvement; individual inserts impractical at scale

Native Workers AI integration for embeddings (@cf/baai/bge-base-en-v1.5, 768 dims) plus OpenAI support; dimension limit of 1536 requires reduction for larger models

14 common errors prevented: dimension mismatches, metadata index timing, V2 async mutation handling, TypeScript type gaps, Windows dev registry issues, and topK conditional limits

Cloudflare Vectorize

Complete implementation guide for Cloudflare Vectorize - a globally distributed vector database for building semantic search, RAG (Retrieval Augmented Generation), and AI-powered applications with Cloudflare Workers.

Status: Production Ready ✅
Last Updated: 2026-01-21
Dependencies: cloudflare-worker-base (for Worker setup), cloudflare-workers-ai (for embeddings)
Latest Versions: wrangler@4.59.3, @cloudflare/workers-types@4.20260109.0
Token Savings: ~70%
Errors Prevented: 14
Dev Time Saved: ~4 hours

What This Skill Provides

don't have the plugin yet? install it then click "run inline in claude" again.

cloudflare-vectorize

SKILL.md

related skills