Research interests

None defined yet.

Team members 263

Organization Card
About org cards


BigCode is an open scientific collaboration working on responsible training of large language models for coding applications.

You can find more information on the main website at You can also follow Big Code on Twitter at

In this organization, you can find The Stack, a 6.4TB of source code in 358 programming languages from permissive licenses. You can also find SantaCoder, a strong 1.1B code generation model trained on Java, JavaScript & Python. In addition to some data governance tools.