--- title: README emoji: ✨ colorFrom: gray colorTo: red sdk: static pinned: false ---

drawing

BigCode is an open scientific collaboration working on responsible training of large language models for coding applications. You can find more information on the main website at https://www.bigcode-project.org. You can also follow Big Code on Twitter at https://twitter.com/BigCodeProject. In this organization, you can find The Stack, a 6.4TB of source code in 358 programming languages from permissive licenses. You can also find SantaCoder, a strong 1.1B code generation model trained on Java, JavaScript & Python. In addition to some data governance tools.