Datasets used for training
#63 opened 2 days ago
by
saransh03sharma
fp16 normal weights
#62 opened 4 days ago
by
gioaca
GREAT MODEL
#61 opened 5 days ago
by
doberst
Sliding window = 2047?
#60 opened 5 days ago
by
skyshine102
Recent change on the rstrip property on special tokens
1
#59 opened 6 days ago
by
xxhansh
fix: update tokenizer config to support `add_generation_prompt=True` and clarify content
#57 opened 8 days ago
by
lamhieu
Fix auto-casting with SFT sometimes only up-casting keys and not queries
#56 opened 9 days ago
by
roborovski
Help with merging LoRA layers back onto Phi3
#55 opened 10 days ago
by
SHIMURA0321
Changed instruction/chat template
#54 opened 12 days ago
by
ofirzaf
Phi-3 support for GPU training with MPS on Mac
1
#52 opened 17 days ago
by
chaishirin
System prompts ignored in chat completions
10
#51 opened 18 days ago
by
joshuaturner
Fine-tuning is not improving the domain knowledge? it is very complicated, could you help?
5
#50 opened 18 days ago
by
aaditya
the phi3 model not learning from training data using orpo.
3
#44 opened 22 days ago
by
Imran1
Leaking Training Data
3
#40 opened 23 days ago
by
qunitindk
Instruct model needs work
8
#35 opened 24 days ago
by
Nafnlaus