description: Tutorial 5 - RAG without tools. Use an indexed document as grounded context in a chat answer, with per-message retrieval traces from the Vector Database.
Time 5 min · Difficulty ★★☆ · Surfaces Agentic Chat, Vector Database
!!! abstract “Goal” Use the document you indexed in Tutorial 3 as grounded context in a chat answer - no tools yet, just retrieval-augmented generation.
qwen3.5:9b model already selected (from Tutorial 4 - it sticks until you change it).
① the indexed test-rag.pdf is selected - every prompt in this chat will retrieve relevant chunks from the document before the model answers.
① grounded prompt - the model will retrieve chunks first, then answer using their content rather than generic memory.
!!! warning “RAG only as good as your chunks” A great chat model can’t recover from poorly chunked content. If your document has tables or code blocks, look at the chunked output in Vector Database before relying on it in chat - the splitter may have cut at unhelpful boundaries.