Florian Zimmermeister's picture

Florian Zimmermeister

flozi00

·

AI & ML interests

ASR, German LLM

Organizations

$A\\Ware's profile picture$

flozi00's activity

upvoted a paper 10 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 12 days ago • 166

upvoted an article about 1 month ago

Article

The Large Language Model Course

By

•

Jan 16

• 96

upvoted a paper about 1 month ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 26

upvoted a paper 2 months ago

Transformers Can Navigate Mazes With Multi-Step Prediction

Paper • 2412.05117 • Published Dec 6, 2024 • 5

upvoted an article 2 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 439

upvoted 2 articles 3 months ago

Article

Releasing the largest multilingual open pretraining dataset

By

and 2 others •

Nov 13, 2024

• 98

Article

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive

By

•

Nov 9, 2024

• 9

upvoted a paper 3 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 48

upvoted an article 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 219

upvoted a paper 4 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 169

upvoted an article 4 months ago

Article

Welcome, Gradio 5

Oct 9, 2024

• 124

upvoted 2 papers 5 months ago

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

Paper • 2407.10960 • Published Jul 15, 2024 • 12

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 13

upvoted a collection 5 months ago

Strong German fp8 LLM's

Strong Large Language Models for the german language in fp8 format • 6 items • Updated Sep 24, 2024 • 3

upvoted 3 papers 5 months ago

Parameter-Efficient Fine-Tuning with Discrete Fourier Transform

Paper • 2405.03003 • Published May 5, 2024 • 8

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 140

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11, 2024 • 32

upvoted a collection 5 months ago

INT4 LLMs for vLLM

Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! • 18 items • Updated Sep 26, 2024 • 8

upvoted a collection 6 months ago

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17, 2024 • 65

upvoted a collection 7 months ago

Research projects on top of vLLM

Papers cited in https://blog.vllm.ai/2024/07/25/lfai-perf.html • 6 items • Updated Jul 29, 2024 • 12