mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
commented on
a paper
26 days ago
TransMLA: Multi-head Latent Attention Is All You Need
commented on
a paper
26 days ago
TransMLA: Multi-head Latent Attention Is All You Need
updated
a collection
28 days ago
CLOVER-Commonsense-148k
Organizations
None yet
Collections
8
models
55
fxmeng/PiSSA-llama-7b-commonsense-148k
Updated
•
20
fxmeng/PiSSA-Llama-3-8b-commonsense-148k
Updated
•
15
fxmeng/PiSSA-Llama-2-7b-commonsense-148k
Updated
•
16
fxmeng/PiSSA-llama-13b-commonsense-148k
Updated
•
22
fxmeng/CLOVER-llama-3-8b-commonsense-148k
Updated
•
13
fxmeng/CLOVER-llama-2-7b-commonsense-148k
Updated
•
18
fxmeng/CLOVER-llama-13b-commonsense-148k
Updated
•
18
fxmeng/CLOVER-llama-7b-commonsense-148k
Updated
•
15
fxmeng/TransMLA_qwen2.5_0.5b_instruct
Updated
fxmeng/TransMLA_llama3.2_1b_instruct
Updated
datasets
9
fxmeng/pissa-dataset
Viewer
•
Updated
•
844k
•
852
•
2
fxmeng/big-bench-hard-continue-finetuning
Viewer
•
Updated
•
10.3k
•
242
fxmeng/commonsense_filtered
Viewer
•
Updated
•
170k
•
269
•
1
fxmeng/MetaMath-GSM240K
Viewer
•
Updated
•
240k
•
77
•
1
fxmeng/MetaMath-MATH155K
Viewer
•
Updated
•
155k
•
58
fxmeng/CodeFeedback-Python105K
Viewer
•
Updated
•
105k
•
664
•
5
fxmeng/llava_finetune_336x336
Preview
•
Updated
•
65
fxmeng/llava_pretrain_336x336
Preview
•
Updated
•
58
fxmeng/WizardLM_evol_instruct_V2_143k
Viewer
•
Updated
•
143k
•
93
•
2