# Microsoft Taught Models to Forget Their Own Thoughts. The Ghost Traces Stayed. - Date: 2026-04-10 - Category: Artificial Intelligence Microsoft Research paper Memento teaches models to compress their own chain-of-thought mid-generation, cutting KV cache 2-3x and nearly doubling throughput. Key twist: erased reasoning blocks leave traces the model still uses. ---