Jetson nano
#74 opened about 19 hours ago
by
idotr7
fixed generation_args in Sample inference code
#73 opened 7 days ago
by
dkleine
![](https://cdn-avatars.huggingface.co/v1/production/uploads/657a068948287621b1748ab0/4JRNv8CtaMHLkjOYOR5Cz.jpeg)
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
#72 opened 9 days ago
by
Kenkentron
tflite convertion
#71 opened 13 days ago
by
henrywang0314
why EOS=<|endoftext|> not </s> when <s> is used as BOS?
1
#70 opened 17 days ago
by
shiyue
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1632426687077-613a776517297f3c0bfd7b41.jpeg)
fine-tuning with structured data set
#68 opened 22 days ago
by
don412
Questions about chat template format (whether to escape newline characters and special characters or not)
#67 opened 25 days ago
by
king-crab
Datasets used for training
#63 opened about 1 month ago
by
saransh03sharma
fp16 normal weights
#62 opened about 1 month ago
by
gioaca
GREAT MODEL
#61 opened about 1 month ago
by
doberst
Sliding window = 2047?
#60 opened about 1 month ago
by
skyshine102
Recent change on the rstrip property on special tokens
1
#59 opened about 1 month ago
by
xxhansh
fix: update tokenizer config to support `add_generation_prompt=True` and clarify content
#57 opened about 1 month ago
by
lamhieu
![](https://cdn-avatars.huggingface.co/v1/production/uploads/600ae38cc92b79f54efd4556/cSqRIslYl5L3I4WK3a31f.png)
Fix auto-casting with SFT sometimes only up-casting keys and not queries
#56 opened about 1 month ago
by
roborovski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62b3605951b07307bd041f71/TdOqCIeoXyTUejjbuoWAx.jpeg)
Help with merging LoRA layers back onto Phi3
#55 opened about 1 month ago
by
SHIMURA0321
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65ffa7d4a4d296af0761df93/mXRNvhU7msg6D2uKhbLfo.png)
Changed instruction/chat template
#54 opened about 1 month ago
by
ofirzaf
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1616423186722-5f8907c65d083370c711f284.jpeg)
Phi-3 support for GPU training with MPS on Mac
1
#52 opened about 2 months ago
by
chaishirin
System prompts ignored in chat completions
13
#51 opened about 2 months ago
by
joshuaturner
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/9MoceSWwnnTdMzbQrbdhR.jpeg)
Fine-tuning is not improving the domain knowledge? it is very complicated, could you help?
5
#50 opened about 2 months ago
by
aaditya
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f3fe13d79c1ba4c353d0c19/LKAyCTg1th9kuZKka313U.jpeg)
the phi3 model not learning from training data using orpo.
3
#44 opened about 2 months ago
by
Imran1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62846faa99bff5076f0a93b4/QO7sgRWOXS6nlQ-GcEg94.jpeg)
Leaking Training Data
3
#40 opened about 2 months ago
by
qunitindk
Instruct model needs work
8
#35 opened about 2 months ago
by
Nafnlaus