Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -23,10 +23,18 @@ High quality datasets are the powerhouse of LLMs and require special care and sk
|
|
23 |
|
24 |
The datatsets and training recipes of most state-of-the-art models are not released. We build cutting-edge models and release the full training pipeline as well fostering more innovation and reproducibility, such as [Zephyr](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta), [StarCoder2](https://huggingface.co/bigcode/starcoder2-15b), or [SmolLM2](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct).
|
25 |
|
|
|
|
|
|
|
|
|
26 |
### βοΈ Infrastructre
|
27 |
|
28 |
The research team is organized in small teams with typically <4 people and the science cluster consists of 96 x 8xH100 nodes as well as an auto-scalable CPU cluster for dataset processing. In this setup, even a small research team can build and push out impactful artifacts.
|
29 |
|
|
|
|
|
|
|
|
|
30 |
### π Releases
|
31 |
|
32 |
This is the release timeline of 2024 so far (you can click on each element!):
|
|
|
23 |
|
24 |
The datatsets and training recipes of most state-of-the-art models are not released. We build cutting-edge models and release the full training pipeline as well fostering more innovation and reproducibility, such as [Zephyr](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta), [StarCoder2](https://huggingface.co/bigcode/starcoder2-15b), or [SmolLM2](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct).
|
25 |
|
26 |
+
### πΈ Collaborations
|
27 |
+
|
28 |
+
Research and collaboration go hand in hand. That's why we like to organize and participate in large open collaborations such as [BigScience](https://bigscience.huggingface.co) and [BigCode](https://www.bigcode-project.org).
|
29 |
+
|
30 |
### βοΈ Infrastructre
|
31 |
|
32 |
The research team is organized in small teams with typically <4 people and the science cluster consists of 96 x 8xH100 nodes as well as an auto-scalable CPU cluster for dataset processing. In this setup, even a small research team can build and push out impactful artifacts.
|
33 |
|
34 |
+
### π Educational material
|
35 |
+
|
36 |
+
Besides writing tech reports of research projects we also like to write more educational content to help newcomers get started to the field or practitioners. We built for example the [alignment handbook](https://github.com/huggingface/alignment-handbook), the [pretraining tutorial](https://www.youtube.com/watch?v=2-SPH9hIKT8), or the [FineWeb blog](https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1).
|
37 |
+
|
38 |
### π Releases
|
39 |
|
40 |
This is the release timeline of 2024 so far (you can click on each element!):
|