Question 1

What is the 'Lost in the Middle' problem with LLM context windows?

Accepted Answer

Research from Stanford, Berkeley, and others shows that LLMs struggle with information placed in the middle of long contexts. Models attend well to the beginning and end of their context window, but reliability drops significantly for information in the middle. This means a 128K context window is functionally much smaller for reliable retrieval and reasoning tasks.

Question 2

How does metadata enrichment improve retrieval quality?

Accepted Answer

In the FractalRecall D-22 experiment, prepending a 24-token natural-language metadata sentence to each chunk before embedding improved retrieval quality (NDCG@10) by 16.5% and recall by 27.3%. This suggests that a small amount of the right context can outperform a large amount of raw text.

Question 3

What is semantic compression in the context of LLMs?

Accepted Answer

Semantic compression (as explored by Haiku Protocol) is the systematic transformation of verbose, human-friendly prose into dense, machine-optimized strings that preserve the same information in fewer tokens. Unlike summarization (which loses detail) or truncation (which loses endings), compression aims to retain all information while reducing token count.

One post tagged with "Haiku Protocol"

Context Windows Are a Lie (And Haiku Protocol Is My Coping Mechanism)