Language Model Sampling

Topics

Notes

Linked

Beam Search

Greedy decoding misses high-probability sequences. Maintain top-k partial hypotheses at each step for better approximate search.