Add low-latency raw search path separate from agentic answer synthesis

The retrieval uses LLM tool selection plus storage calls plus LLM synthesis in retrieval.py. This is quality-friendly but latency-heavy.

Acceptance criteria:

- Add /search fast path returning ranked profile/summary/temporal/snippet/code hits without synthesis.
- Make LLM answer generation optional via answer=true.
- Cache profile catalogs and retrieval plans.
- Track p50/p95/p99 latency per retrieval mode.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add low-latency raw search path separate from agentic answer synthesis #163

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add low-latency raw search path separate from agentic answer synthesis #163

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions