33 75 206

dame rajee

damerajee

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

liked a model about 10 hours ago

PowerInfer/SmallThinker-3B-Preview

liked a model 3 days ago

luodian/OTTER-Video-LLaMA7B-DenseCaption

View all activity

Organizations

damerajee's activity

upvoted a paper about 9 hours ago

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Paper • 2412.19326 • Published 4 days ago • 11

upvoted a paper 3 days ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 7 days ago • 27

upvoted 2 papers 5 days ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 6 days ago • 37

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published 12 days ago • 12

upvoted 2 papers 6 days ago

TRecViT: A Recurrent Video Transformer

Paper • 2412.14294 • Published 12 days ago • 11

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

Paper • 2412.14590 • Published 12 days ago • 12

upvoted an article 6 days ago

Article

Deriving DPO's Loss

•

7 days ago

• 20

upvoted a paper 12 days ago

Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents

Paper • 2412.13194 • Published 13 days ago • 12

upvoted 3 papers 13 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 53

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 15 days ago • 25

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 7

upvoted 2 papers 15 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 17 days ago • 132

JuStRank: Benchmarking LLM Judges for System Ranking

Paper • 2412.09569 • Published 18 days ago • 19

upvoted a paper 22 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 24 days ago • 121

upvoted a paper 25 days ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published 25 days ago • 57

upvoted a paper about 1 month ago

Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages

Paper • 2411.12240 • Published Nov 19 • 6

upvoted 3 articles 2 months ago

Article

Deploying Speech-to-Speech on Hugging Face

Oct 22

• 35

Article

ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models

•

Oct 18

• 16

Article

Fixing Gradient Accumulation

Oct 16

• 44

upvoted a paper 2 months ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9 • 43