Design a production RAG (Retrieval-Augmented Generation) system for an enterprise that needs to answer questions over 1 million internal documents (PDFs, Word files, Slack messages, wikis).
Requirements:
- •Answer any question grounded in company knowledge
- •Responses must cite source documents with page numbers
- •Queries answered within 3 seconds end-to-end
- •System must handle 10,000 queries/day
- •Data must stay secure (no data sent to public APIs without encryption)
What you'll be assessed on
The interviewer will dig into chunking strategy, embedding model choice, hybrid search architecture, re-ranking, and how you evaluate quality at scale.