Andrei Alexandru

inwaves

inwaves

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Atla Selene Mini: A General Purpose Evaluation Model

upvoted a paper about 1 month ago

Atla Selene Mini: A General Purpose Evaluation Model

liked a Space about 1 month ago

AtlaAI/selene-1-mini-tech-report

View all activity

Organizations

inwaves's activity

authored a paper about 1 month ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published Jan 27 • 33

upvoted a paper about 1 month ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published Jan 27 • 33

liked a Space about 1 month ago

Selene 1 Mini Tech Report

🧠

Selene 1 Mini: Technical Report

upvoted an article about 1 month ago

Article

Selene 1 Mini: the best small language model-as-a-judge

and 10 others •

Jan 29

• 12

published an article about 1 month ago

Article

Selene 1 Mini: the best small language model-as-a-judge

and 10 others •

Jan 29

• 12

liked a model about 1 month ago

AtlaAI/Selene-1-Mini-Llama-3.1-8B

Text Generation • Updated 21 days ago • 13.7k • 74

upvoted a paper about 1 month ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 63

updated a model about 2 months ago

AtlaAI/Selene-1-Mini-Llama-3.1-8B

Text Generation • Updated 21 days ago • 13.7k • 74

liked 2 models 2 months ago

maldv/Qwentile2.5-32B-Instruct

Text Generation • Updated Jan 9 • 712 • 32

failspy/Llama-3-8B-Instruct-MopeyMule

Text Generation • Updated May 30, 2024 • 126 • 78

liked a model 3 months ago

microsoft/Phi-3.5-mini-instruct

Text Generation • Updated 7 days ago • 313k • • 835

updated 2 datasets 4 months ago

inwaves/magpie_ultra_length_tail

Viewer • Updated Nov 14, 2024 • 22.8k • 55

inwaves/SkyworkReward-v0.2-PromptExtracted

Viewer • Updated Nov 13, 2024 • 73.2k • 56

upvoted a paper 4 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 46

upvoted an article 4 months ago

Article

Experimenting with different training objectives for an AI evaluator

and 1 other •

Oct 31, 2024

• 2

published an article 4 months ago

Article

Experimenting with different training objectives for an AI evaluator

and 1 other •

Oct 31, 2024

• 2

liked a Space 4 months ago

344

Reward Bench Leaderboard

📐

Explore and analyze RewardBench leaderboard data

upvoted a paper 5 months ago

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Paper • 2410.18451 • Published Oct 24, 2024 • 17

liked a model 6 months ago

meta-llama/Llama-3.1-70B-Instruct

Text Generation • Updated Dec 15, 2024 • 751k • • 792

upvoted an article 6 months ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

•

Apr 24, 2024

• 62