Nucleus (Top-p) Sampling (Medium) | AI Code Lab

Nucleus (Top-p) SamplingMedium

00:00

Python idle

Nucleus (Top-p) Sampling

Used by GPT-4, Claude, and all modern LLMs. Selects the smallest set of tokens whose cumulative probability exceeds threshold p.

1. Sort tokens by probability (descending)

2. Accumulate until sum ≥ p

3. Return the selected token indices

Adapts vocabulary size based on confidence — uses fewer tokens when model is confident, more when uncertain.

nucleus_indices([0.5, 0.3, 0.15, 0.05], p=0.8)
# cumsum: 0.5, 0.8 → stop after 2 tokens
→ [0, 1]  (indices sorted by probability)