argonne-private
/

AuroraGPT-IT-v4-0125

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Update README.md

#2

by samforeman - opened 12 days ago

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -1,6 +1,9 @@
 ---
 library_name: transformers
-tags: []
 ---
 # Model Card for Model ID
@@ -39,5 +42,4 @@ Trained on 32 nodes of Polaris supercomputer using pytorch FSDP with Hybrid-shar
 * LR = 5x10^-5
 * per-gpu batch size = 1
 * Gradient accumulation = 6
-* Global batch size = 768

 ---
 library_name: transformers
+datasets:
+- allenai/dolma
+base_model:
+- argonne-private/AuroraGPT-7B
 ---
 # Model Card for Model ID
 * LR = 5x10^-5
 * per-gpu batch size = 1
 * Gradient accumulation = 6
+* Global batch size = 768