Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

qwerrwe / examples

100 contributors

History: 107 commits

jrc's picture

jrc

Add shifted sparse attention (#973) [skip-ci]

1d70f24 unverified about 1 year ago

cerebras
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
code-llama
Add shifted sparse attention (#973) [skip-ci] about 1 year ago
falcon
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
gptj
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
jeopardy-bot
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
llama-2
Add shifted sparse attention (#973) [skip-ci] about 1 year ago
mamba
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
mistral
Set eval_sample_packing to false in mistral config.yaml (#1003) about 1 year ago
mpt-7b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
openllama-3b
Add shifted sparse attention (#973) [skip-ci] about 1 year ago
phi
pin model_revision for phi2 (#1123) about 1 year ago
pythia-12b
Feat(wandb): Refactor to be more flexible (#767) about 1 year ago
pythia
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
qwen
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
redpajama
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
replit-3b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
tiny-llama
streaming multipack for pretraining dataset (#959) about 1 year ago
xgen-7b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 1 year ago
yi-34B-chat
Add an example config for finetuning a 34B model on a 24GB GPU (#1000) about 1 year ago