38 72 308

Edoardo Federici

efederici

https://banda-larga.github.io

AI & ML interests

llms, ir, graphs & co

Recent Activity

liked a dataset 1 day ago

argilla/ifeval-like-data

updated a dataset 8 days ago

mii-llm/train_eval_mix

liked a Space 11 days ago

data-is-better-together/fineweb-c

View all activity

Organizations

efederici's activity

upvoted a paper 26 days ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published 30 days ago • 53

upvoted an article about 2 months ago

Article

Visually Multilingual: Introducing mcdse-2b

•

Oct 27

• 37

upvoted a paper about 2 months ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8 • 81

upvoted 2 papers 2 months ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22 • 11

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

upvoted 3 papers 3 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 47

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Paper • 2406.06592 • Published Jun 5 • 26

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9 • 45

upvoted an article 4 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3

• 30

upvoted a paper 4 months ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22 • 50

upvoted a collection 5 months ago

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 36

upvoted 3 papers 6 months ago

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1 • 39

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 65

upvoted 4 papers 7 months ago

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27 • 31

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 86

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23 • 37

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23 • 36

upvoted 2 papers 8 months ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30 • 108

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30 • 47