Understanding Text Generation Parameters in Transformers

Posted by:

|

On:

|

This post is divided into seven parts; they are: – Core Text Generation Parameters – Experimenting with Temperature – Top-K and Top-P Sampling – Controlling Repetition – Greedy Decoding and Sampling – Parameters for Specific Applications – Beam Search and Multiple Sequences Generation Let’s pick the GPT-2 model as an example.

Posted by

in

Leave a Reply

Your email address will not be published. Required fields are marked *