Dataset Details

#11
by joaogui1 - opened

Will there be any details on how the subset of The Stack was selected? What kind of filtering was used, etc?

Replit org

We are working on a technical report which will contain more info also about data sampling and processing. Stay tuned!

Thanks!

Sign up or log in to comment