Anthony Ivan S

anthonyivn

anthonyivn2

AI & ML interests

None yet

Recent Activity

liked a Space 29 days ago

smolagents/smolagents-leaderboard

updated a model about 1 month ago

anthonyivn/ModernBERT-Base-llm-router

published a model about 1 month ago

anthonyivn/ModernBERT-Base-llm-router

View all activity

Organizations

None yet

anthonyivn's activity

upvoted 2 papers about 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 221

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 149

upvoted an article 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.21k

upvoted a paper 3 months ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 42

upvoted 2 articles 3 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

• 95

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 170

upvoted a paper 3 months ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 102

upvoted a paper 5 months ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 68

upvoted 3 papers 7 months ago

upvoted a paper 9 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 138

upvoted a collection 9 months ago

InternLM2.5

Collection

14 items • Updated Feb 11 • 71

upvoted 3 papers 10 months ago

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published Jun 25, 2024 • 23

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 96

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 71

upvoted an article 10 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 85