Question on Phase III latency + ablation without Agentic Verification/Query Rewriting

Hi EverMemOS team,

Thanks for sharing EverMemOS!

I have a question specifically about Phase III (Reconstructive Recollection). As described, it includes:
1. MemScene Selection
2. Episode and Foresight Filtering
3. Agentic Verification and Query Rewriting

The third step (Agentic Verification and Query Rewriting) may trigger multiple LLM calls (e.g., sufficiency check + query rewriting + additional retrieval rounds). In some real-time conversational systems, memory retrieval latency is tightly constrained (e.g., staying within ~200ms on the retrieval path), so I’m trying to understand the runtime implications of this module.

Could you please share:

**1) Retrieval latency measurements (if available)**
- Did you measure end-to-end retrieval latency for Phase III (p50/p95)?

**2) Ablation results without Agentic Verification and Query Rewriting**
- Did you run an ablation where Phase III is “one-shot retrieval” (MemScene Selection + Episode/Foresight Filtering only), i.e., removing the verification/rewriting loop entirely?
- If yes, could you share the performance impact on LoCoMo/LongMemEval/PersonaMem v2, and any notes on the trade-off between accuracy and latency?

This would be very helpful for practitioners who want to adopt EverMemOS in latency-sensitive settings.

Thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on Phase III latency + ablation without Agentic Verification/Query Rewriting #31

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Question on Phase III latency + ablation without Agentic Verification/Query Rewriting #31

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions