---
id: kvquant-before
title: KVQuant before
tags: before, kvquant, memory
links: 
created_at: 2026-05-03T07:41:16.974729+00:00
updated_at: 2026-05-03T07:41:16.974756+00:00
source: 
---

Before: the model gets a raw prompt, no compression, no semantic retrieval, and more clutter in context.
