fpreiss (Fabian A. Preiß)

upvoted an article 8 months ago

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16, 2024

• 33

upvoted a paper 9 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 90

upvoted 2 papers 10 months ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29, 2024 • 69

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18, 2024 • 36

upvoted 3 papers 11 months ago

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 61

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 128

upvoted an article 11 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 234

upvoted a collection 11 months ago

fuck quadratic attention

Collection

11 items • Updated Apr 24, 2024 • 23

upvoted 5 papers 11 months ago

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Paper • 2310.06770 • Published Oct 10, 2023 • 5

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Paper • 2403.12881 • Published Mar 19, 2024 • 17

upvoted 2 collections 12 months ago

MoEs papers reading list

Collection

60 items • Updated Nov 4, 2024 • 141

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 234

upvoted 2 collections about 1 year ago

Leaderboards and benchmarks ✨

Collection

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 91 items • Updated 14 days ago • 98

Tiny Series

Collection

Tiny datasets that empower the foundation of Small Language Model! • 11 items • Updated Jan 26, 2024 • 36

Fabian A. Preiß

AI & ML interests

Organizations

fpreiss's activity

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

ChatQA: Building GPT-4 Level Conversational QA Models

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Fine-tune Llama 3 with ORPO

fuck quadratic attention

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Advancing LLM Reasoning Generalists with Preference Trees

ReFT: Representation Finetuning for Language Models

Lost in the Middle: How Language Models Use Long Contexts

MoEs papers reading list

Model Merging

Leaderboards and benchmarks ✨

Tiny Series