MoritzLaurer (Moritz Laurer)

upvoted an article 2 days ago

Article

Space secrets security update

3 days ago

• 36

upvoted an article 4 days ago

Article

Benchmarking Text Generation Inference

5 days ago

• 17

upvoted a paper 4 days ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published 5 days ago • 12

upvoted an article 6 days ago

Article

SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model

By

•

Apr 19

• 3

upvoted an article 11 days ago

Article

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

13 days ago

• 8

upvoted a paper 13 days ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published 18 days ago • 73

upvoted a collection 20 days ago

NuNerZero - Zero Shot NER

Collection

The best compact Zero-Shot NER models with MIT license • 4 items • Updated 23 days ago • 13

upvoted an article 22 days ago

Article

Improving Prompt Consistency with Structured Generations

Apr 30

• 46

upvoted a paper 26 days ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 115

upvoted a paper 27 days ago

What matters when building vision-language models?

Paper • 2405.02246 • Published about 1 month ago • 87

upvoted 2 articles 27 days ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 53

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 134

upvoted 2 collections about 1 month ago

PDF Document / OCR Datasets

Collection

Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30 • 37

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 99

upvoted a collection about 2 months ago

Idefics2 🐶

Collection

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 27 days ago • 83

upvoted a paper about 2 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 58

upvoted an article about 2 months ago

Article

Total noob’s intro to Hugging Face Transformers

Mar 22

• 21

upvoted 2 papers 3 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 125

A Critical Evaluation of AI Feedback for Aligning Large Language Models

Paper • 2402.12366 • Published Feb 19 • 3

upvoted 2 collections 3 months ago

Reward models on the hub

Collection

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13 • 24

🤗 Spaces Helper

Collection

5 items • Updated Mar 19 • 2

upvoted a collection 4 months ago

⛔️🔦 Provenance, Watermarking & Deepfake Detection

Collection

Technical tools for more control over non-consensual synthetic content • 14 items • Updated Apr 1 • 36

upvoted 2 papers 4 months ago

Multilingual E5 Text Embeddings: A Technical Report

Paper • 2402.05672 • Published Feb 8 • 16

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 135

upvoted a collection 5 months ago

Universal token classification

Collection

Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 9 items • Updated 3 days ago • 9

upvoted 2 papers 5 months ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 73

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1 • 21

upvoted 2 papers 6 months ago

LLM-Assisted Code Cleaning For Training Accurate Code Generators

Paper • 2311.14904 • Published Nov 25, 2023 • 3

Magicoder: Source Code Is All You Need

Paper • 2312.02120 • Published Dec 4, 2023 • 78

upvoted a collection 6 months ago

Seamless Communication

Collection

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16 • 126

upvoted a collection 7 months ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 80

upvoted 3 papers 7 months ago

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 31

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 116

This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Paper • 2310.15941 • Published Oct 24, 2023 • 6

upvoted 3 papers 8 months ago

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 50

tasksource: Structured Dataset Preprocessing Annotations for Frictionless Extreme Multi-Task Learning and Evaluation

Paper • 2301.05948 • Published Jan 14, 2023 • 3

Nougat: Neural Optical Understanding for Academic Documents

Paper • 2308.13418 • Published Aug 25, 2023 • 33

Moritz Laurer

AI & ML interests

Articles

Synthetic data: save money, time and carbon with open source

Organizations

MoritzLaurer's activity

Space secrets security update

Benchmarking Text Generation Inference

SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

Improving Prompt Consistency with Structured Generations

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Total noob’s intro to Hugging Face Transformers