Language Model Sampling
Topics
Notes
Linked
Beam Search
Greedy decoding misses high-probability sequences. Maintain top-k partial hypotheses at each step for better approximate search.
Greedy decoding misses high-probability sequences. Maintain top-k partial hypotheses at each step for better approximate search.