title: Vector Database Observability description: RAG-side observability - query rate, top_k distribution, similarity threshold bands, per-DB returned-doc counts. The indicators that determine RAG quality.

Vector Database - Similarity threshold distribution buckets thresholds into 6 bands (0.0-0.1, 0.1-0.3, …, 0.9-1.0) so an over-strict threshold is visible without per-query inspection.
Purpose - RAG-side observability. Beyond simple “did the agent retrieve” questions, this tab surfaces retrieval parameters (top_k, similarity threshold) and per-DB returned-doc counts - the indicators that determine RAG quality.
top_k being set sensibly?” - top_k distribution (a spike at 1 or above 50 usually indicates an agent miscalibration).db.vector.client.operation spans.
All dashboards share the Observability global settings - time window, refresh interval, custom range. Vector Database has no tab-specific controls beyond those.
| Card | Shows | Source |
|---|---|---|
| Traces with RAG | Number of chat turns that included a vector query | TraceRecord.hasRag true count |
| Vector queries | Total vector store operations (query + add + delete) | db.vector.client.operation span count |
| Avg top_k | Mean of the requested top_k parameter |
db.vector.query.top_k attribute averaged |
| Avg query latency | Mean query duration | Span duration |
| Chart | Type | Reading |
|---|---|---|
| Operations / minute | Stacked bar (query / add / delete) | Add bursts during document ingestion; query bursts during chat |
| Query latency p50 / p95 / p99 | Multi-line, ms | Tail latency on remote vector stores can be ten-fold higher than SimpleVectorStore |
| top_k distribution | Histogram | Distribution of requested k values |
| Similarity threshold distribution | Histogram, 6 bands ([0.0, 0.1) … [0.9, 1.0]) |
Where the agent is anchoring relevance - strictness vs recall |
| DB systems | Donut by db.system |
One backend = single configuration; multiple = mixed stores |
| Avg returned docs by db | Horizontal bar | Combined with similarity threshold, this is the retrieval-recall signal |