firas snake

abol3z

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

echo840/ocrbench-leaderboard

liked a Space about 1 month ago

smolagents/smolagents-leaderboard

liked a Space 2 months ago

galileo-ai/agent-leaderboard

View all activity

Organizations

None yet

abol3z's activity

liked 2 Spaces about 1 month ago

148

Ocrbench Leaderboard

🏆

Display OCRBench leaderboard for model evaluations

124

smolagents LLM leaderboard

🏆

A leaderboard for LLMs powering smolagents

liked a Space 2 months ago

278

Agent Leaderboard

💬

Ranking of LLMs for agentic tasks

liked a dataset 2 months ago

galileo-ai/agent-leaderboard

Viewer • Updated Feb 11 • 1.28k • 225 • 24

upvoted 3 papers 2 months ago

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 53

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 55

liked 2 Spaces 2 months ago

369

Open Medical-LLM Leaderboard

🥇

Browse and submit LLM evaluations

Berkeley Function Calling Leaderboard

🏃

liked a dataset 3 months ago

Eladio/emrqa-msquad

Viewer • Updated Mar 10, 2024 • 164k • 189 • 4

upvoted 3 papers 3 months ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28 • 38

IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems

Paper • 2501.11067 • Published Jan 19 • 13

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 385

liked a model 3 months ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated Jan 13 • 20.5k • 543

liked 2 Spaces 4 months ago

Open Universal Arabic Asr Leaderboard

🥇

A benchmark for open-source multi-dialect Arabic ASR models

708

TTS Arena

🏆

Vote on the latest TTS models!

liked 3 Spaces 6 months ago

CAMEL-Bench Leaderboard

🥇

Learderboard to Evaluate Arabic Multimodal Models

13k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

4.31k

Chatbot Arena Leaderboard

🏆

Display chatbot leaderboard and statistics