Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
71a43f8
qwerrwe
/
configs
100 contributors
History:
39 commits
winglian
use pythia-12b, neox-20b is flaky
3961902
over 1 year ago
accelerate
quickstart instructions for starting from runpod (#5)
over 1 year ago
cerebras_1_3B_alpaca.yml
939 Bytes
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
galactica_1_3B.yml
820 Bytes
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
llama_13B_alpaca.yml
835 Bytes
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
llama_65B_alpaca.yml
1.06 kB
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
llama_7B_4bit.yml
981 Bytes
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
llama_7B_alpaca.yml
937 Bytes
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
llama_7B_jeopardy.yml
1.15 kB
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
pythia_1_2B_alpaca.yml
983 Bytes
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
quickstart.yml
976 Bytes
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
sample.yml
3.67 kB
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
stability_3b.yml
1.11 kB
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago
vicuna_13B_4bit_reflect.yml
1.05 kB
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago