diwank (Diwank Tomer)

upvoted a collection about 1 hour ago

Yi-1.5 (2024/05)

Collection

6 items • Updated 6 days ago • 61

upvoted 4 papers 8 days ago

upvoted an article 9 days ago

Article

Introducing the Open Chain of Thought Leaderboard

26 days ago

• 20

upvoted a collection 11 days ago

Granite Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 10 items • Updated 6 days ago • 117

upvoted 4 papers 14 days ago

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Paper • 2404.13208 • Published 29 days ago • 37

Capabilities of Gemini Models in Medicine

Paper • 2404.18416 • Published 20 days ago • 21

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published 16 days ago • 44

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published 16 days ago • 92

upvoted a collection 15 days ago

GreenBitAI MLX LLM

Collection

GreenBitAI's Low-bit LLMs in MLX format • 69 items • Updated 12 days ago • 4

upvoted 3 papers 16 days ago

BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation

Paper • 2402.03216 • Published Feb 5 • 2

Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization

Paper • 2401.07793 • Published Jan 15 • 3

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Paper • 2401.06532 • Published Jan 12 • 10

upvoted 4 papers 17 days ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published 18 days ago • 91

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published 18 days ago • 41

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published 18 days ago • 61

Extending Llama-3's Context Ten-Fold Overnight

Paper • 2404.19553 • Published 18 days ago • 28

upvoted a paper 18 days ago

Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2 • 13

upvoted a collection 19 days ago

🦢SWIM-IR Dataset

Collection

29 million Synthetic Wikipedia-based Multilingual Retrieval Training Pairs. • 4 items • Updated 20 days ago • 6

upvoted an article 21 days ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

22 days ago

• 54

upvoted a paper 25 days ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published 26 days ago • 230

upvoted a collection 25 days ago

Phi-3

Collection

Phi-3 family of models • 7 items • Updated 1 day ago • 200

upvoted 2 articles 26 days ago

Article

Fine-tune Llama 3 with ORPO

By

•

26 days ago

• 177

Article

On Coding Your First Attention

By

•

27 days ago

• 7

upvoted 2 collections 27 days ago

Eurus

Collection

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Apr 15 • 22

Ultra Series

Collection

UltraLM, UltraRM and UltraCM. • 8 items • Updated Apr 1 • 5

upvoted a collection 28 days ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 89

upvoted a collection 30 days ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated about 1 month ago • 522

upvoted 8 papers about 1 month ago

Stream of Search (SoS): Learning to Search in Language

Paper • 2404.03683 • Published Apr 1 • 21

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4 • 20

LLoCO: Learning Long Contexts Offline

Paper • 2404.07979 • Published Apr 11 • 15

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14 • 42

Compression Represents Intelligence Linearly

Paper • 2404.09937 • Published Apr 15 • 27

Audio Dialogues: Dialogues dataset for audio and music understanding

Paper • 2404.07616 • Published Apr 11 • 14

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15 • 79

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12 • 61

upvoted 2 collections about 1 month ago

LLM evaluation datasets

Collection

32 items • Updated Sep 8, 2023 • 6

Leaderboards and benchmarks ✨

Collection

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 61 items • Updated 4 days ago • 59

upvoted 5 papers about 1 month ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9 • 62

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 57

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

Paper • 2404.04125 • Published Apr 4 • 26

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 79

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10 • 92

upvoted a collection about 1 month ago

CodeGemma Release

Collection

16 items • Updated 4 days ago • 58

upvoted an article about 1 month ago

Article

Public Policy at Hugging Face

Apr 8

• 16

upvoted a paper about 1 month ago

TimeGPT-1

Paper • 2310.03589 • Published Oct 5, 2023 • 3

upvoted an article about 1 month ago

Article

Data is better together

Mar 4

• 4

upvoted 2 papers about 1 month ago

Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

Paper • 2310.01801 • Published Oct 3, 2023 • 3

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7 • 43

upvoted a collection about 1 month ago

🤖 Agents

Collection

16 items • Updated 24 days ago • 20

upvoted a paper about 1 month ago

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22 • 24

upvoted a collection about 1 month ago

Dataset generation

Collection

112 items • Updated 21 days ago • 16

upvoted 5 papers about 1 month ago

Larimar: Large Language Models with Episodic Memory Control

Paper • 2403.11901 • Published Mar 18 • 30

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 71

Training LLMs over Neurally Compressed Text

Paper • 2404.03626 • Published Apr 4 • 21

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Paper • 2404.02575 • Published Apr 3 • 46

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

upvoted a collection about 1 month ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 76

Diwank Tomer PRO

AI & ML interests

Organizations

diwank's activity

Introducing the Open Chain of Thought Leaderboard

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Fine-tune Llama 3 with ORPO

On Coding Your First Attention

Public Policy at Hugging Face

Data is better together