abdullah (Abdullah Abdelrhim)

upvoted a paper 2 days ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published 4 days ago • 27

upvoted a collection 7 days ago

Wikimedia Datasets

Wikimedia datasets, across languages and modalities, from different Wikimedia projects, on the hub. Not all tested. • 19 items • Updated 7 days ago • 9

upvoted an article 8 days ago

Article

Introducing the Open Arabic LLM Leaderboard

10 days ago

• 45

upvoted a paper 12 days ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published 16 days ago • 6

upvoted a paper 20 days ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published 25 days ago • 110

upvoted 2 papers 22 days ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published 23 days ago • 41

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published 23 days ago • 61

upvoted an article 24 days ago

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

By

•

24 days ago

• 26

upvoted a paper 24 days ago

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21 • 25

upvoted a collection 27 days ago

Text-to-text Generation Models (LLMs, Llama, GPT, ...)

Collection

748 items • Updated about 8 hours ago • 7

upvoted an article 27 days ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

27 days ago

• 55

upvoted 2 papers about 1 month ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 235

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18 • 51

upvoted an article about 1 month ago

Article

Fine-tune Llama 3 with ORPO

By

•

about 1 month ago

• 181

upvoted a collection about 2 months ago

Multilingual Models Chat Spaces

Collection

Here you find Chat spaces to interact and test multilingual models but the goal here is to test on Arabic • 2 items • Updated Apr 6 • 1

upvoted 5 papers about 2 months ago

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Paper • 2404.04167 • Published Apr 5 • 8

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4 • 20

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 101

Leveraging Corpus Metadata to Detect Template-based Translation: An Exploratory Case Study of the Egyptian Arabic Wikipedia Edition

Paper • 2404.00565 • Published Mar 31 • 6

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 53

upvoted a collection about 2 months ago

A little guide to building Large Language Models in 2024

Collection

Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 • 19 items • Updated Apr 1 • 13

upvoted 2 papers about 2 months ago

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29 • 34

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26 • 25

upvoted 2 collections 2 months ago

🔮 Mixture of Experts

Collection

MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y • 13 items • Updated Mar 22 • 21

Preference Datasets for KTO

Collection

This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. • 5 items • Updated Mar 19 • 10

upvoted 4 papers 2 months ago

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11 • 52

upvoted a collection 2 months ago

Awesome Document AI

Collection

A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11 • 38

upvoted a paper 2 months ago

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 59

upvoted 11 papers 3 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 61

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 173

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

Paper • 2403.02775 • Published Mar 5 • 11

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5 • 92

PALO: A Polyglot Large Multimodal Model for 5B People

Paper • 2402.14818 • Published Feb 22 • 22

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21 • 104

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 73

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19 • 15

GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements

Paper • 2402.10963 • Published Feb 13 • 9

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16 • 41

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 90

upvoted a collection 3 months ago

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 11 days ago • 173

upvoted a paper 3 months ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 33

upvoted a collection 3 months ago

Arabic-LLMs

Collection

25 items • Updated 11 days ago • 1

upvoted 9 papers 4 months ago

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

Paper • 2402.01622 • Published Feb 2 • 30

Efficient Exploration for LLMs

Paper • 2402.00396 • Published Feb 1 • 18

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Paper • 2401.17377 • Published Jan 30 • 32

Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 39

Efficient Tool Use with Chain-of-Abstraction Reasoning

Paper • 2401.17464 • Published Jan 30 • 15

Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon

Paper • 2401.03462 • Published Jan 7 • 25

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 38

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29 • 46

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26 • 62

upvoted a collection 4 months ago

Tiny Series

Collection

Tiny datasets that empower the foundation of Small Language Model! • 11 items • Updated Jan 26 • 31

upvoted 4 papers 4 months ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19 • 50

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 135

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18 • 32

Asynchronous Local-SGD Training for Language Modeling

Paper • 2401.09135 • Published Jan 17 • 9

upvoted a collection 4 months ago

Hermes

Collection

Nous' Flagship LLM Series • 21 items • Updated 8 days ago • 85

Abdullah Abdelrhim

AI & ML interests

Organizations

abdullah's activity

Introducing the Open Arabic LLM Leaderboard

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Fine-tune Llama 3 with ORPO