Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
17
Kailai Yang
klyang
Follow
dark-pen's profile picture
ai-dufflebags's profile picture
ruffibuddy's profile picture
4 followers
·
1 following
https://stevekgyang.github.io/
SteveKGYang
AI & ML interests
Natural language processing
Recent Activity
upvoted
a
paper
3 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
authored
a paper
24 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
liked
a model
about 1 month ago
allenai/OLMo-2-1124-7B
View all activity
Organizations
None yet
Papers
6
arxiv:
2501.13629
arxiv:
2403.06765
arxiv:
2402.12659
arxiv:
2401.08508
Expand 6 papers
models
5
Sort: Recently updated
klyang/deepseek-math-7b-sft
Updated
Nov 4, 2024
•
8
klyang/MentaLLaMA-chat-7B-hf
Text Generation
•
Updated
Jan 11, 2024
•
172
•
3
klyang/MentaLLaMA-33B-lora
Updated
Oct 31, 2023
•
4
klyang/MentaLLaMA-chat-7B
Text Generation
•
Updated
Sep 28, 2023
•
3.05k
•
18
klyang/MentaLLaMA-chat-13B
Text Generation
•
Updated
Sep 27, 2023
•
206
•
16
datasets
None public yet