Traing data and models of SliME
Yi-Fan Zhang
yifanzhang114
AI & ML interests
Yi-Fan Zhang presently is a third-year PhD student at the State Key Laboratory of Pattern Recognition, University of Chinese Academy of Sciences, under the esteemed guidance of Prof. Tieniu Tan, is dedicated to spearheading robust and reliable deep learning systems and large pretrained models.
Recent Activity
updated
a model
4 days ago
yifanzhang114/v2t_1B_mllm_siglip_lr3e-4_sft
updated
a model
4 days ago
yifanzhang114/v2t_1B_mllm_siglip_lr3e-4
updated
a model
4 days ago
yifanzhang114/v2t_1B_mllm_clip_3e-4_sft
Organizations
None yet
Collections
2
Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
-
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Paper • 2408.13257 • Published • 25 -
yifanzhang114/MME-RealWorld
Preview • Updated • 301 • 13 -
yifanzhang114/MME-RealWorld-Base64
Viewer • Updated • 11.5k • 149 • 1 -
yifanzhang114/MME-RealWorld-CN-Lmms-eval
Viewer • Updated • 5.89k • 400 • 1
models
8
yifanzhang114/v2t_1B_mllm_siglip_lr3e-4_sft
Text Generation
•
Updated
•
30
yifanzhang114/v2t_1B_mllm_siglip_lr3e-4
Text Generation
•
Updated
•
21
yifanzhang114/v2t_1B_mllm_clip_3e-4_sft
Text Generation
•
Updated
•
16
yifanzhang114/v2t_1B_mllm_clip_3e-4
Text Generation
•
Updated
•
212
yifanzhang114/SliME-Llama3-8B
Image-Text-to-Text
•
Updated
•
132
•
2
yifanzhang114/SliME-vicuna-7B
Image-Text-to-Text
•
Updated
•
17
•
2
yifanzhang114/SliME-Llama3-8B-lora
Image-Text-to-Text
•
Updated
•
14
yifanzhang114/SliME-vicuna-13B
Image-Text-to-Text
•
Updated
•
20
•
2
datasets
9
yifanzhang114/MME-RealWorld-Base64
Viewer
•
Updated
•
11.5k
•
149
•
1
yifanzhang114/MME-RealWorld-Lite
Preview
•
Updated
•
21
yifanzhang114/MME-RealWorld-lite-lmms-eval
Viewer
•
Updated
•
1.92k
•
73
•
1
yifanzhang114/MME-RealWorld
Preview
•
Updated
•
301
•
13
yifanzhang114/AMBER_base64
Viewer
•
Updated
•
14.2k
•
64
yifanzhang114/MMSft
Viewer
•
Updated
•
498k
•
7
yifanzhang114/MME-RealWorld-Lmms-eval
Viewer
•
Updated
•
23.1k
•
338
•
1
yifanzhang114/MME-RealWorld-CN-Lmms-eval
Viewer
•
Updated
•
5.89k
•
400
•
1
yifanzhang114/SMR
Viewer
•
Updated
•
558k
•
95
•
4