back
loading skill details...
|
Semantic search and RAG with Cloudflare Vectorize V2, supporting 5M vectors, 31ms latency, and async mutations. Vectorize V2 (GA September 2024) brings 25× capacity increase (200K → 5M vectors), 18× faster queries (549ms → 31ms), and async mutations with breaking API changes including returnMetadata boolean → string enum Metadata filtering with 10 indexes per collection, range operators ($gte, $lt, etc.), and cardinality-aware query optimization for RAG and semantic search workflows Batch insert optimization: 5000-vector batches deliver 18× throughput improvement; individual inserts impractical at scale Native Workers AI integration for embeddings (@cf/baai/bge-base-en-v1.5, 768 dims) plus OpenAI support; dimension limit of 1536 requires reduction for larger models 14 common errors prevented: dimension mismatches, metadata index timing, V2 async mutation handling, TypeScript type gaps, Windows dev registry issues, and topK conditional limits Cloudflare Vectorize Complete implementation guide for Cloudflare Vectorize - a globally distributed vector database for building semantic search, RAG (Retrieval Augmented Generation), and AI-powered applications with Cloudflare Workers. Status: Production Ready ✅ Last Updated: 2026-01-21 Dependencies: cloudflare-worker-base (for Worker setup), cloudflare-workers-ai (for embeddings) Latest Versions: wrangler@4.59.3, @cloudflare/workers-types@4.20260109.0 Token Savings: ~70% Errors Prevented: 14 Dev Time Saved: ~4 hours What This Skill Provides
don't have the plugin yet? install it then click "run inline in claude" again.