Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
8430db2
qwerrwe
/
tests
100 contributors
History:
108 commits
jinwonkim93
Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)
8430db2
unverified
9 months ago
core
add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083)
11 months ago
e2e
relora: magnitude pruning of the optimizer (#1245)
10 months ago
fixtures
Respect sequence_len in config for `type: llama2_chat` (#926)
12 months ago
monkeypatch
support for true batches with multipack (#1230)
10 months ago
prompt_strategies
Feat/chatml add system message (#1117)
10 months ago
utils
Add shifted sparse attention (#973) [skip-ci]
10 months ago
test_data.py
Safe
2.23 kB
Fix pretraining with iterable/streaming Dataset (#556)
about 1 year ago
test_dict.py
Safe
3.17 kB
fix DefaultDict.__or__
over 1 year ago
test_expand_mask.py
Safe
1.43 kB
Attention mask and position id fixes for packing (#285)
over 1 year ago
test_normalize_config.py
Safe
3 kB
set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]
10 months ago
test_packed_batch_sampler.py
Safe
3.32 kB
support for true batches with multipack (#1230)
10 months ago
test_packed_dataset.py
Safe
2.28 kB
Attention mask and position id fixes for packing (#285)
over 1 year ago
test_packed_pretraining.py
Safe
2.7 kB
Pretrain transforms (#1261)
10 months ago
test_prompt_tokenizers.py
Safe
16.3 kB
fix mistral prompt assembly (#982)
11 months ago
test_prompters.py
Safe
4.39 kB
Attention mask and position id fixes for packing (#285)
over 1 year ago
test_schedulers.py
Safe
1.72 kB
Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)
9 months ago
test_tokenizers.py
Safe
2.51 kB
Support for additional_special_tokens (#1221) [skip ci]
10 months ago
test_validation.py
Safe
22.8 kB
Peft lotfq (#1222)
10 months ago