Vector Database Engineer is a data claude skill built by sickn33. Best for: ML engineers and backend developers implement production vector search systems with optimized retrieval performance..

What it does
Design and optimize vector databases, embeddings, and semantic search for RAG and recommendation systems.
Category
data
Created by
sickn33
Last updated
Claude Skilldata GitHub-backed CuratedadvancedClaude Code

Vector Database Engineer

Design and optimize vector databases, embeddings, and semantic search for RAG and recommendation systems.

Skill instructions


name: vector-database-engineer description: "Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similar" risk: unknown source: community date_added: "2026-02-27"

Vector Database Engineer

Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similarity search. Use PROACTIVELY for vector search implementation, embedding optimization, or semantic retrieval systems.

Do not use this skill when

  • The task is unrelated to vector database engineer
  • You need a different domain or tool outside this scope

Instructions

  • Clarify goals, constraints, and required inputs.
  • Apply relevant best practices and validate outcomes.
  • Provide actionable steps and verification.
  • If detailed examples are required, open resources/implementation-playbook.md.

Capabilities

  • Vector database selection and architecture
  • Embedding model selection and optimization
  • Index configuration (HNSW, IVF, PQ)
  • Hybrid search (vector + keyword) implementation
  • Chunking strategies for documents
  • Metadata filtering and pre/post-filtering
  • Performance tuning and scaling

Use this skill when

  • Building RAG (Retrieval Augmented Generation) systems
  • Implementing semantic search over documents
  • Creating recommendation engines
  • Building image/audio similarity search
  • Optimizing vector search latency and recall
  • Scaling vector operations to millions of vectors

Workflow

  1. Analyze data characteristics and query patterns
  2. Select appropriate embedding model
  3. Design chunking and preprocessing pipeline
  4. Choose vector database and index type
  5. Configure metadata schema for filtering
  6. Implement hybrid search if needed
  7. Optimize for latency/recall tradeoffs
  8. Set up monitoring and reindexing strategies

Best Practices

  • Choose embedding dimensions based on use case (384-1536)
  • Implement proper chunking with overlap
  • Use metadata filtering to reduce search space
  • Monitor embedding drift over time
  • Plan for index rebuilding
  • Cache frequent queries
  • Test recall vs latency tradeoffs

Limitations

  • Use this skill only when the task clearly matches the scope described above.
  • Do not treat the output as a substitute for environment-specific validation, testing, or expert review.
  • Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.

Use this skill

Most skills are portable instruction packages. Claude Code supports SKILL.md directly. Other agents can use adapted files like AGENTS.md, .cursorrules, and GEMINI.md.

Claude Code

Save SKILL.md into your Claude Skills folder, then restart Claude Code.

mkdir -p ~/.claude/skills/vector-database-engineer && curl -L "https://raw.githubusercontent.com/sickn33/antigravity-awesome-skills/HEAD/skills/vector-database-engineer/SKILL.md" -o ~/.claude/skills/vector-database-engineer/SKILL.md

Installs to ~/.claude/skills/vector-database-engineer/SKILL.md.

Use cases

ML engineers and backend developers implement production vector search systems with optimized retrieval performance.

Reviews

No reviews yet. Be the first to review this skill.

No signup required

Stats

Installs0
GitHub Stars35.8k
Forks5869
LicenseMIT License
UpdatedMar 25, 2026