AttributeError: 'GPT2TokenizerFast' object has no attribute 'max_len'

11k Views Asked by m.b At 14 April 2021 at 10:20

I am just using the huggingface transformer library and get the following message when running run_lm_finetuning.py: AttributeError: 'GPT2TokenizerFast' object has no attribute 'max_len'. Anyone else with this problem or an idea how to fix it? Thanks!

My full experiment run: mkdir experiments

for epoch in 5 do python run_lm_finetuning.py
--model_name_or_path distilgpt2
--model_type gpt2
--train_data_file small_dataset_train_preprocessed.txt
--output_dir experiments/epochs_$epoch
--do_train
--overwrite_output_dir
--per_device_train_batch_size 4
--num_train_epochs $epoch done

Original Q&A

There are 2 best solutions below

Wiktor Stribiżew On 14 April 2021 at 10:27 BEST ANSWER

The "AttributeError: 'BertTokenizerFast' object has no attribute 'max_len'" Github issue contains the fix:

The run_language_modeling.py script is deprecated in favor of language-modeling/run_{clm, plm, mlm}.py.

If not, the fix is to change max_len to model_max_length.

Also, pip install transformers==3.0.2 might fix the issue since it has been reported to work for some people.

white On 09 May 2022 at 09:08

I use this command to solve it.

pip install transformers==3.0.2

AttributeError: 'GPT2TokenizerFast' object has no attribute 'max_len'

There are 2 best solutions below

Related Questions in TOKENIZE

Related Questions in HUGGINGFACE-TRANSFORMERS

Related Questions in TRANSFORMER-MODEL

Related Questions in HUGGINGFACE-TOKENIZERS

Related Questions in GPT-2

Trending Questions

Popular # Hahtags

Popular Questions