Show HN: We matched full-context recall on ~1% of the tokens (open benchmark)
Category: library
Tags: llm, context-compression, benchmarks
Score: 6.5/10 (Innovation: 6, Technical: 6, Documentation: 8, Utility: 6)
Compresh-benchmarks provides open, reproducible benchmark results for Context Compression and Episodic Memory layers for LLM APIs, demonstrating competitive recall performance with fewer tokens read. Its use of independent judges, comparison across retrieval strategies, and transparent self-assessment make it interesting for advancing efficient LLM context handling.
Target audience: AI researchers, ML engineers, backend devs
Repository: https://github.com/compresh/compresh-benchmarks/blob/main/epbench/WRITEUP.md · Python · NOASSERTION
View on Hacker News