rdiehlmartinez commited on
Commit
e948f36
β€’
1 Parent(s): 35ed80d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -2
README.md CHANGED
@@ -1,10 +1,52 @@
1
  ---
2
  title: README
3
- emoji: 😻
4
  colorFrom: red
5
  colorTo: yellow
6
  sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  title: README
3
+ emoji: 🎯
4
  colorFrom: red
5
  colorTo: yellow
6
  sdk: static
7
  pinned: false
8
  ---
9
 
10
+ # 🎯 Pico: Tiny Language Models for Learning Dynamics Research
11
+
12
+ Pico is a framework for training and analyzing small language models, designed with clarity and educational purposes in mind. Built on a LLAMA-style architecture, Pico makes it easy to experiment with and understand transformer-based language models.
13
+
14
+ ## πŸ”‘ Key Features
15
+
16
+ - **Simple Architecture**: Clean, modular implementation of core transformer components
17
+ - **Educational Focus**: Well-documented code with clear references to academic papers
18
+ - **Research Ready**: Built-in tools for analyzing model learning dynamics
19
+ - **Efficient Training**: Pre-tokenized dataset and optimized training loop
20
+ - **Modern Stack**: Built with PyTorch Lightning, Wandb, and HuggingFace integrations
21
+
22
+ ## πŸ—οΈ Core Components
23
+
24
+ - **RMSNorm** for stable layer normalization
25
+ - **Rotary Positional Embeddings (RoPE)** for position encoding
26
+ - **Multi-head attention** with KV-cache support
27
+ - **SwiGLU activation** function
28
+ - **Residual connections** throughout
29
+
30
+ ## πŸ“š References
31
+
32
+ Our implementation draws inspiration from and builds upon:
33
+ - [LLAMA](https://arxiv.org/abs/2302.13971)
34
+ - [RoPE](https://arxiv.org/abs/2104.09864)
35
+ - [SwiGLU](https://arxiv.org/abs/2002.05202)
36
+
37
+ ## 🀝 Contributing
38
+
39
+ We welcome contributions! Whether it's:
40
+ - Adding new features
41
+ - Improving documentation
42
+ - Fixing bugs
43
+ - Sharing experimental results
44
+
45
+ ## πŸ“ License
46
+
47
+ Apache 2.0 License
48
+
49
+ ## πŸ“« Contact
50
+
51
+ - GitHub: [rdiehlmartinez/pico](https://github.com/rdiehlmartinez/pico)
52
+ - Author: Richard Diehl Martinez