gpt2_large_prefix_682k / src /seriguela /data /preprocess_data.py
augustocsc's picture
GPT-2 Large trained on prefix dataset (682K)
28b769b verified
# Script para pré-processar dados (raw -> processed)