Research interests

None defined yet.

Team members 263

Organization Card
About org cards

drawing

BigCode is an open scientific collaboration working on responsible training of large language models for coding applications.

You can find more information on the main website at https://www.bigcode-project.org. You can also follow Big Code on Twitter at https://twitter.com/BigCodeProject.

In this organization, you can find The Stack, a 6.4TB of source code in 358 programming languages from permissive licenses. You can also find SantaCoder, a strong 1.1B code generation model trained on Java, JavaScript & Python. In addition to some data governance tools.