David Golchinfar's picture

David Golchinfar PRO

DavidGF

·

https://vago-solutions.ai

DavidGFar

dgolchin

AI & ML interests

finetune llms, improve german language understanding and generated text of llms

Organizations

DavidGF's activity

upvoted a collection about 1 month ago

ablation-models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated 3 days ago • 20

upvoted 2 articles about 1 month ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24

• 48

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 193

upvoted a collection about 1 month ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 557

upvoted a collection about 2 months ago

🇩🇪German SFT and DPO datasets

Datasets that can be used for LLM training with axolotl, trl or llama_factory. • 30 items • Updated 5 days ago • 6

upvoted 2 papers 2 months ago

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20 • 17

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 567

upvoted a paper 4 months ago

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Paper • 2401.17377 • Published Jan 30 • 32

upvoted a paper 5 months ago

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 55