Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
14
YigeYuan
1t4chi
Follow
TTTXXX01's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
8 days ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT
published
a model
8 days ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT
updated
a model
23 days ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT
View all activity
Organizations
None yet
1t4chi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
8 days ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT
Updated
8 days ago
•
5
published
a model
8 days ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT
Updated
8 days ago
•
5
updated
a model
23 days ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT
Text Generation
•
Updated
23 days ago
•
300
published
a model
23 days ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT
Text Generation
•
Updated
23 days ago
•
300
published
a model
24 days ago
1t4chi/Qwen2.5-Math-7B-QwQMath6K-SFT
Updated
24 days ago
updated
a model
27 days ago
1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532
Updated
27 days ago
•
19
published
a model
27 days ago
1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532
Updated
27 days ago
•
19
published
a model
29 days ago
1t4chi/Qwen2.5-Math-7B-4GPU-Nothink-KL0.00005
Updated
29 days ago
published
a model
30 days ago
1t4chi/Qwen2.5-Math-7B-HJX8k-4GPU-Nothink-KL0.0-FindData
Updated
30 days ago
published
a model
about 1 month ago
1t4chi/mistral-7b-base-simper
Updated
Feb 21
liked
a Space
3 months ago
Running
350
350
Reward Bench Leaderboard
📐
Explore and analyze RewardBench leaderboard data
liked
a model
4 months ago
RLHFlow/RewardModel-Mistral-7B-for-DPA-v1
Text Classification
•
Updated
May 23, 2024
•
599
•
3
liked
3 models
5 months ago
allenai/tulu-v2.5-dpo-13b-hh-rlhf
Text Generation
•
Updated
Jun 14, 2024
•
4
•
1
allenai/tulu-2-dpo-13b
Text Generation
•
Updated
May 17, 2024
•
2.68k
•
20
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
May 9, 2024
•
32
•
10
liked
3 datasets
5 months ago
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18, 2024
•
164k
•
4.64k
•
133
PKU-Alignment/PKU-SafeRLHF-10K
Viewer
•
Updated
Jul 20, 2023
•
10k
•
315
•
63
unalignment/toxic-dpo-v0.2
Viewer
•
Updated
Jan 9, 2024
•
541
•
836
•
126
liked
2 models
5 months ago
ChenmieNLP/Zephyr-7B-Beta-Helpful
Text Generation
•
Updated
Oct 10, 2024
•
4
•
1
HelpingAI/HelpingAI-9B
Text Generation
•
Updated
Oct 31, 2024
•
130
•
25
Load more