rdiehlmartinez
commited on
Commit
β’
e948f36
1
Parent(s):
35ed80d
Update README.md
Browse files
README.md
CHANGED
@@ -1,10 +1,52 @@
|
|
1 |
---
|
2 |
title: README
|
3 |
-
emoji:
|
4 |
colorFrom: red
|
5 |
colorTo: yellow
|
6 |
sdk: static
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
title: README
|
3 |
+
emoji: π―
|
4 |
colorFrom: red
|
5 |
colorTo: yellow
|
6 |
sdk: static
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
# π― Pico: Tiny Language Models for Learning Dynamics Research
|
11 |
+
|
12 |
+
Pico is a framework for training and analyzing small language models, designed with clarity and educational purposes in mind. Built on a LLAMA-style architecture, Pico makes it easy to experiment with and understand transformer-based language models.
|
13 |
+
|
14 |
+
## π Key Features
|
15 |
+
|
16 |
+
- **Simple Architecture**: Clean, modular implementation of core transformer components
|
17 |
+
- **Educational Focus**: Well-documented code with clear references to academic papers
|
18 |
+
- **Research Ready**: Built-in tools for analyzing model learning dynamics
|
19 |
+
- **Efficient Training**: Pre-tokenized dataset and optimized training loop
|
20 |
+
- **Modern Stack**: Built with PyTorch Lightning, Wandb, and HuggingFace integrations
|
21 |
+
|
22 |
+
## ποΈ Core Components
|
23 |
+
|
24 |
+
- **RMSNorm** for stable layer normalization
|
25 |
+
- **Rotary Positional Embeddings (RoPE)** for position encoding
|
26 |
+
- **Multi-head attention** with KV-cache support
|
27 |
+
- **SwiGLU activation** function
|
28 |
+
- **Residual connections** throughout
|
29 |
+
|
30 |
+
## π References
|
31 |
+
|
32 |
+
Our implementation draws inspiration from and builds upon:
|
33 |
+
- [LLAMA](https://arxiv.org/abs/2302.13971)
|
34 |
+
- [RoPE](https://arxiv.org/abs/2104.09864)
|
35 |
+
- [SwiGLU](https://arxiv.org/abs/2002.05202)
|
36 |
+
|
37 |
+
## π€ Contributing
|
38 |
+
|
39 |
+
We welcome contributions! Whether it's:
|
40 |
+
- Adding new features
|
41 |
+
- Improving documentation
|
42 |
+
- Fixing bugs
|
43 |
+
- Sharing experimental results
|
44 |
+
|
45 |
+
## π License
|
46 |
+
|
47 |
+
Apache 2.0 License
|
48 |
+
|
49 |
+
## π« Contact
|
50 |
+
|
51 |
+
- GitHub: [rdiehlmartinez/pico](https://github.com/rdiehlmartinez/pico)
|
52 |
+
- Author: Richard Diehl Martinez
|