62 - Lost-in-the-Middle Problem | Mango Encyclopedia

Introduction

The lost-in-the-middle problem is an issue where language models can better retrieve information at the beginning or end of a long context, but perform significantly worse when the relevant information is located in the middle. This was documented in "Lost in the Middle" (Liu et al., 2024).

The Phenomenon

When asked to retrieve a specific fact from a long document:

Position of fact → Retrieval accuracy

Beginning: ~90%
End: ~90%
Middle: ~60-70%

Why Does This Happen?

1. Attention Flow

Information must flow through many layers. Early layers may not preserve middle information to later layers.

2. Positional Bias

Models trained with positional encodings may favor beginning/end positions.

3. Attention Dilution

As context grows, middle positions receive less distinct attention.

Experimental Evidence

Researchers inserted a random fact into documents of varying lengths and asked models to retrieve it:

Context Length	Beginning	Middle	End
1K tokens	95%	90%	95%
8K tokens	85%	60%	85%
32K tokens	80%	45%	80%

Implications

RAG systems: May miss information in middle of retrieved docs
Long document QA: Middle information may be ignored
Code understanding: Important code in middle may be missed

Test Your Understanding

Question 1: In lost-in-the-middle, retrieval accuracy is lowest at:

A) Beginning
B) End
C) Middle of context
D) Everywhere

Question 2: For 8K tokens, middle retrieval might be:

A) 95%
B) 90%
C) ~60%
D) 100%

Question 3: Why might middle be worse?

A) Model prefers beginning and end
B) Attention dilution, positional bias
C) Middle has no information
D> No reason

Question 4: Lost-in-the-middle affects:

A) Only short contexts
B) RAG systems, long document QA
C) No practical application
D) Only vision models

Question 5: As context length increases, middle retrieval accuracy:

A) Improves
B) Degrades further
C) Stays same
D) Becomes perfect

Question 6: This problem suggests models may:

A) Never forget
B) Have positional preferences, not uniform attention
C) Be perfect
D) No issue

Question 7: For 32K tokens, middle accuracy might drop to:

A) 80%
B) 45%
C) 95%
D) 100%

Question 8: The paper "Lost in the Middle" was published in:

A) 2020
B) 2024
C) 2019
D) 2022

62. Lost-in-the-Middle Problem