RAG and Retrieval Quality
Chunking strategies, hybrid retrieval, reranking, and citation-grounded response patterns.
About
Most AI features look great on demo day. I work on the part that comes after; making them reliable, measurable, and safe to keep in production a month later.
What pulled me toward this specifically was a pattern I kept seeing; AI features that looked great in the demo, then quietly became a trust problem once real users hit them. Nobody could explain why because nobody had been measuring. I built my whole workflow around making sure that doesn't happen.
Chunking strategies, hybrid retrieval, reranking, and citation-grounded response patterns.
Prompt contracts, function-calling, fallback behavior, and stable API outputs for apps.
Regression suites, faithfulness tracking, and trace-based debugging in pre-release workflows.
Stack
Python-first APIs and workflow systems; the tools I reach for when something needs to ship fast and stay maintainable.
Everything needed to make retrieval accurate, answers grounded, and quality measurable before release.