Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
69
8
464
wing lian
PRO
winglian
Follow
mftorrey's profile picture
EduardG's profile picture
Archid's profile picture
2212 followers
·
14 following
winglian
winglian
AI & ML interests
None yet
Organizations
winglian
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
axolotl-ai-co/romulus-mistral-nemo-12b-simpo
about 1 month ago
Update README.md
#2 opened about 1 month ago by
CombinHorizon
New activity in
deepseek-ai/DeepSeek-Prover-V1.5-Base
2 months ago
the config class and config.json uses DeepseekConfig, not v2
1
#5 opened 2 months ago by
winglian
Match the config class name to what the modeling code expects
1
#4 opened 2 months ago by
winglian
New activity in
microsoft/Phi-3.5-mini-instruct
2 months ago
trust_remote_code=True
1
#9 opened 2 months ago by
winglian
New activity in
NousResearch/Hermes-2-Pro-Llama-3-8B
6 months ago
add axolotl tag
#1 opened 6 months ago by
winglian
New activity in
mattshumer/Llama-3-8B-16K
6 months ago
add axolotl tag
#3 opened 6 months ago by
winglian
New activity in
cognitivecomputations/dolphin-2.9-llama3-8b
7 months ago
add axolotl tag
#12 opened 7 months ago by
winglian
New activity in
openbmb/Eurus-RM-7b
7 months ago
Enable flash_attention_2 support since the underlying Mistral model supports it
#3 opened 7 months ago by
winglian
New activity in
meta-llama/Meta-Llama-3-8B
7 months ago
Rename original/tokenizer.model to tokenizer.model
3
#6 opened 7 months ago by
winglian
commented
a paper
7 months ago
Octopus v2: On-device language model for super agent
Paper
•
2404.01744
•
Published
Apr 2
•
57
•
8
New activity in
PrunaAI/dbrx-base-bnb-4bit
7 months ago
invalid weights doesn't match modeling code
1
#3 opened 7 months ago by
winglian
New activity in
SinclairSchneider/dbrx-base-quantization-fixed
7 months ago
reduce verbosity of logging
#1 opened 7 months ago by
winglian
New activity in
databricks/dbrx-instruct
7 months ago
The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA
31
#10 opened 7 months ago by
tdrussell
New activity in
LnL-AI/dbrx-base-converted-v2
7 months ago
reduce logging verbosity
1
#3 opened 7 months ago by
winglian
New activity in
SinclairSchneider/dbrx-instruct-quantization-fixed
7 months ago
dbrx-base
1
#2 opened 7 months ago by
winglian
New activity in
ai21labs/Jamba-v0.1
7 months ago
finetuning issues
2
#9 opened 7 months ago by
winglian
Fix bias logic to enable QLoRA finetuning
3
#5 opened 7 months ago by
winglian
New activity in
cerebras/SlimPajama-627B
11 months ago
Trouble with streaming
7
#5 opened about 1 year ago by
andersonbcdefg
New activity in
open-llm-leaderboard/open_llm_leaderboard
11 months ago
latest commit breaks ability to submit mistral finetunes
4
#410 opened 11 months ago by
winglian
New activity in
Open-Orca/Mistral-7B-OpenOrca
11 months ago
Can you share the training configuration of Axolotl?
3
#24 opened 11 months ago by
timlim123
Load more