
cerebras/Cerebras-GPT-13B
•
Updated
•
9.01k
•
616
None defined yet.
Cerebras is the inventor of the Wafer-Scale Engine – the revolutionary processor at the heart of our Cerebras CS-2 system. Our co-designed hardware/software stack is designed to train large language models upward of 1 trillion parameters using only data parallelism. This is a collection of models we trained on Cerebras CS-2 systems.
Join the Cerebras Discord to discuss our work and research!