README / README.md
loubnabnl's picture
loubnabnl HF staff
add info about datasets and models
cf70bbd
metadata
title: README
emoji: 
colorFrom: gray
colorTo: red
sdk: static
pinned: false

drawing

Big Code is an open scientific collaboration working on responsible training of large language models for coding applications.

You can find more information on the main website at https://www.bigcode-project.org. You can also follow Big Code on Twitter at https://twitter.com/BigCodeProject.

In this organization, you can find The Stack, a 3.1TB of source code in 30 programming languages, its near deduplicated version and a small subset.

If you want to access the models trained on these datasets, please send a request to contact@bigcode-project.org.