Kashif Rasul's picture

Kashif Rasul

kashif

·

krasul

kashif

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Articles

🧨 Diffusers welcomes Stable Diffusion 3

Constitutional AI with Open LLMs

Patch Time Series Transformer in Hugging Face

PatchTSMixer in HuggingFace

Preference Tuning LLMs with Direct Preference Optimization Methods

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tune Llama 2 with DPO

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Multivariate Probabilistic Time Series Forecasting with Informer

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Probabilistic Time Series Forecasting with 🤗 Transformers

The Annotated Diffusion Model

Organizations

kashif's activity

upvoted 4 articles 10 days ago

Article

Putting RL back in RLHF

27 days ago

• 53

Article

🧨 Diffusers welcomes Stable Diffusion 3

27 days ago

• 71

Article

The Annotated Diffusion Model

Jun 7, 2022

• 45

Article

Welcome Gemma 2 - Google's new open LLM

12 days ago

• 86

upvoted a paper 13 days ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published 14 days ago • 73

upvoted a paper 15 days ago

GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks

Paper • 2406.12925 • Published 25 days ago • 18

upvoted a paper 20 days ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published 28 days ago • 9

upvoted 2 papers 4 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 59

VILA: On Pre-training for Visual Language Models

Paper • 2312.07533 • Published Dec 12, 2023 • 18

upvoted 2 collections 4 months ago

Moirai-1.0-R models

6 items • Updated 16 days ago • 25

Chronos Models & Datasets

Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. • 8 items • Updated 11 days ago • 26

upvoted a collection 5 months ago

datasets-SPIN

Generated synthetic data used to finetune SPIN. • 8 items • Updated Feb 9 • 10

upvoted 3 papers 7 months ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 11

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 119

NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval

Paper • 2310.14282 • Published Oct 22, 2023 • 5

upvoted 4 papers 8 months ago

Diffusion Model Alignment Using Direct Preference Optimization

Paper • 2311.12908 • Published Nov 21, 2023 • 47

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 176

Fine-tuning Language Models for Factuality

Paper • 2311.08401 • Published Nov 14, 2023 • 26

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 76

upvoted a collection 8 months ago

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13 • 24

upvoted a paper 9 months ago

Matryoshka Diffusion Models

Paper • 2310.15111 • Published Oct 23, 2023 • 39