Kashif Rasul's picture

Kashif Rasul

kashif

·

krasul

kashif

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Articles

Constitutional AI with Open LLMs

Patch Time Series Transformer in Hugging Face

PatchTSMixer in HuggingFace

Preference Tuning LLMs with Direct Preference Optimization Methods

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tune Llama 2 with DPO

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Multivariate Probabilistic Time Series Forecasting with Informer

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Probabilistic Time Series Forecasting with 🤗 Transformers

The Annotated Diffusion Model

Organizations

kashif's activity

upvoted a collection 17 days ago

LLaMA-3-8B-SFR-Instruct-R

3 items • Updated about 21 hours ago • 13

upvoted 2 papers 2 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 58

VILA: On Pre-training for Visual Language Models

Paper • 2312.07533 • Published Dec 12, 2023 • 18

upvoted a collection 2 months ago

Moirai-1.0-R models

3 items • Updated 21 days ago • 21

upvoted a collection 3 months ago

Chronos Models

Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. • 6 items • Updated Mar 18 • 25

upvoted a collection 4 months ago

datasets-SPIN

Generated synthetic data used to finetune SPIN. • 8 items • Updated Feb 9 • 10

upvoted a paper 5 months ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 11

upvoted 4 papers 6 months ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 116

NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval

Paper • 2310.14282 • Published Oct 22, 2023 • 5

Diffusion Model Alignment Using Direct Preference Optimization

Paper • 2311.12908 • Published Nov 21, 2023 • 47

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 172

upvoted 2 papers 7 months ago

Fine-tuning Language Models for Factuality

Paper • 2311.08401 • Published Nov 14, 2023 • 26

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 75

upvoted a collection 7 months ago

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13 • 24

upvoted a paper 7 months ago

Matryoshka Diffusion Models

Paper • 2310.15111 • Published Oct 23, 2023 • 39