DefaultPromptHandler
* Set model_max_length in tokenizer in prompt handler * Add release note
use_fast
model_kwargs