Pilot Validation Results
Feb 2026We conducted a real-world Proof of Concept (PoC) on an agentic retrieval pipeline using XRM-SSD as the local vector/PQ storage backend and Supermemory as the application memory engine.
On a PCIe Gen5 NVMe SSD–based setup (1–4 nodes), the synergy architecture achieved:
- • ~55–65% reduction in end-to-end latency (Avg: 3.2 ms → 1.4 ms | P99: 4.4 ms → 1.7 ms)
- • ~3× improvement in retrieval throughput (~300 RPS → 850–900 RPS)
- • High context relevance (up to 92%) and low hallucination risk (15%)
This demonstrates that combining hardware-optimized local retrieval (XRM-SSD) with persistent application memory (Supermemory) transforms agentic RAG into a real-time, edge-native cognitive pipeline.
For detailed configuration or to discuss your own PoC, contact: [email protected]
| Metric | Baseline | XRM-SSD + Supermemory | Improvement |
|---|---|---|---|
| Avg Latency | ~3.2 ms | ~1.4 ms | 56% ↓ |
| P99 Latency | ~4.4 ms | ~1.7 ms | 61% ↓ |
| Throughput (RPS) | ~300 | ~850–900 | 2.8–3× ↑ |
SSD
PCIe Gen5 NVMe SSD (12–14 GB/s class)
Node Count
1–4 UALink Pods (or simulated UALink fabric)
CPU
x86 / ARM Edge Node
RAM
64–128 GB
Dataset Size
1M–10M vectors (PQ / Quantized)
Vector Dimension
384 / 768
Top-K
5 / 10
Supermemory Store
Local KV + Profile Memory
Avg / P99 Latency + Throughput Comparison
- Baseline
- XRM-SSD + Supermemory
Latency Distribution Histogram
Baseline shows long tail (high P99), Synergy concentrates around low peak
- Baseline
- Synergy
Latency Box Plot (Median, P99 Reduction)
Baseline
Min: 1.2 ms
Q1: 2.1 ms
Median: 3.0 ms
Q3: 3.8 ms
P99: 4.4 ms
XRM-SSD + Supermemory
Min: 0.8 ms
Q1: 1.1 ms
Median: 1.3 ms
Q3: 1.5 ms
P99: 1.7 ms
Pilot Validation Note
Results measured on representative Gen5 setup (Feb 2026). Full UALink hardware PoC in progress. For detailed metrics or to validate with your workload, contact: [email protected]
Interactive Comparison Mode
- Context Relevance
- Memory Retention
✅ XRM-SSD + Supermemory: Context relevance stays at 82% by round 5. Persistent memory maintains conversation coherence.
Simulation Note
This comparison mode uses real POC data from Feb 2026 pilot validation. For production deployment or custom workload testing, contact: [email protected]