Haris Jabbar's picture

Haris Jabbar PRO

maveriq

·

AI & ML interests

Tokenization, language generation, normalizing flows, language modeling, document ai

Recent Activity

updated a dataset 18 days ago

spd-dev/codetest

published a dataset 18 days ago

spd-dev/codetest

View all activity

Organizations

maveriq's activity

upvoted an article about 2 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4

• 70

upvoted a paper about 2 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 149

upvoted an article 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.21k

upvoted a collection 4 months ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 23

upvoted a paper 4 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 62

upvoted a collection 4 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 26 days ago • 78

upvoted a collection 7 months ago

INT8 LLMs for vLLM

Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 50 items • Updated Sep 26, 2024 • 15

upvoted 2 papers 9 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 138

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Paper • 2305.11738 • Published May 19, 2023 • 8

upvoted a collection about 1 year ago

💫 StarCoder2

StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 84

upvoted a paper about 1 year ago

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 116

upvoted a paper over 1 year ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

upvoted 2 collections over 1 year ago

Handbook v0.1 models and datasets

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24

⭐ StarCoder

All models, datasets, and demos related to StarCoder! • 11 items • Updated Feb 27, 2024 • 23