Ramesh
rameshch
·
AI & ML interests
None yet
Recent Activity
new activity
3 days ago
Qwen/Qwen2.5-Omni-7B:Thank You for Open-Sourcing Your Model & Feedback
liked
a model
4 days ago
Qwen/Qwen2.5-VL-32B-Instruct
new activity
4 days ago
google/gemma-3-27b-it:Tokens generated per second
Organizations
rameshch's activity
Thank You for Open-Sourcing Your Model & Feedback
3
#8 opened 3 days ago
by
rameshch
Tokens generated per second
2
#39 opened 5 days ago
by
rameshch
Thank You for Open-Sourcing Your Model & Feedback
1
#4 opened 5 days ago
by
rameshch
How do we use it with Transformers? can you give some sample code ?
8
#22 opened 11 days ago
by
rameshch
Can you share sample inference code?
#1 opened 28 days ago
by
rameshch
Merging "Qwen2.5-7B-Instruct" text adapter into "Qwen2.5-VL-7B-Instruct" model
1
#9 opened about 2 months ago
by
rameshch
Video Inference - TypeError: process_vision_info() got an unexpected keyword argument 'return_video_kwargs'
3
#8 opened about 2 months ago
by
rameshch
Merging Text and Image adapters
#9 opened 4 months ago
by
rameshch
Error(s) in loading state_dict for PeftModelForCausalLM:
2
#23 opened 6 months ago
by
rameshch
Flash Attention Support
2
#41 opened 6 months ago
by
rameshch
Is it possible to merge MiniCPM-Llama3-V-2-5 with a Llama-3-1 based model using MOE
10
#68 opened 8 months ago
by
rameshch
llava-Onevision-projector for LLama-3.1-8B Model
1
#4 opened 8 months ago
by
rameshch
RuntimeError: only Tensors of floating point dtype can require gradients
1
#69 opened 7 months ago
by
rameshch
Is it possible to merge MiniCPM-Llama3-V-2-5 with a Llama-3-1 based model using MOE
10
#68 opened 8 months ago
by
rameshch
RuntimeError: only Tensors of floating point dtype can require gradients
1
#69 opened 7 months ago
by
rameshch
llava-Onevision-projector for LLama-3.1-8B Model
1
#4 opened 8 months ago
by
rameshch