gpt2_base_prefix_682k / src /seriguela /data /preprocess_data.py
augustocsc's picture
GPT-2 Base trained on prefix dataset (682K)
c082aa2 verified
# Script para pré-processar dados (raw -> processed)