59 18 54

David Adelani

Davlan

https://dadelani.github.io/

AI & ML interests

Low resource NLP

Recent Activity

upvoted a collection 6 days ago

GemmaX2

liked a model 13 days ago

ymoslem/ModernBERT-base-long-context-qe-v1

new activity 30 days ago

Davlan/afro-xlmr-large-76L:Adding `safetensors` variant of this model

View all activity

Organizations

Davlan's activity

upvoted a collection 6 days ago

GemmaX2

Collection

GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated 4 days ago • 2

upvoted a collection about 2 months ago

Multilingual LLM Evaluation

Collection

Multilingual Evaluation Benchmarks • 6 items • Updated Dec 13, 2024 • 10

upvoted a collection 8 months ago

IrokoBench

Collection

a human-translated benchmark dataset for 16 African languages covering three tasks: NLI, MMLU and MGSM • 6 items • Updated May 31, 2024 • 18

upvoted a paper 10 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

upvoted an article 10 months ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 232

upvoted a paper 10 months ago

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 21

upvoted 2 papers 11 months ago

OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining

Paper • 2311.08849 • Published Nov 15, 2023 • 5

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 185

upvoted 2 papers 12 months ago

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 53

LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons

Paper • 2402.14086 • Published Feb 21, 2024 • 9

upvoted 6 papers about 1 year ago

SpiRit-LM: Interleaved Spoken and Written Language Model

Paper • 2402.05755 • Published Feb 8, 2024 • 14

SeaLLMs -- Large Language Models for Southeast Asia

Paper • 2312.00738 • Published Dec 1, 2023 • 24

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Paper • 2401.16658 • Published Jan 30, 2024 • 14

upvoted 2 papers over 1 year ago

Prompting Large Language Models with Speech Recognition Abilities

Paper • 2307.11795 • Published Jul 21, 2023 • 17

Less is More: Parameter-Free Text Classification with Gzip

Paper • 2212.09410 • Published Dec 19, 2022 • 3