File size: 413 Bytes
6e2f86a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
---
library_name: nanotron
---
# ⚙️ Nano-Mistral
Modeling code for Mistral to use with [Nanotron](https://github.com/huggingface/nanotron/)
## 🚀 Quickstart
```python
# Generate a config file
python config_tiny_mistral.py
# Run training
export CUDA_DEVICE_MAX_CONNECTIONS=1 # important for some distributed operations
torchrun --nproc_per_node=8 run_train.py --config-file config_tiny_mistral.yaml
``` |