Piyush Maharana's picture

Piyush Maharana

catastropiyush

·

https://catastropiyush.github.io/

catastropiyush

AI & ML interests

LLMs for scientific data extraction, Solid State Hydrogen Storage,Machine Learning

Recent Activity

updated a model about 10 hours ago

catastropiyush/llama3_1_GRPO

published a model about 10 hours ago

catastropiyush/llama3_1_GRPO

upvoted an article 1 day ago

The N Implementation Details of RLHF with PPO

View all activity

Organizations

catastropiyush's activity

upvoted an article 1 day ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

• 33

upvoted an article 8 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

By

•

9 days ago

• 23

upvoted an article 9 days ago

Article

We now support VLMs in smolagents!

15 days ago

• 71

upvoted an article 11 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

16 days ago

• 119

upvoted an article 12 days ago

Article

Getting Started With Embeddings

Jun 23, 2022

• 49

upvoted an article 15 days ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

15 days ago

• 59

upvoted 2 papers 17 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 22 days ago • 36

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 22 days ago • 67

upvoted a collection 23 days ago

Tools 4 Agents

This is a collection of spaces on the hub that are useful for building agents. https://huggingface.co/docs/smolagents/en/tutorials/tools • 5 items • Updated 24 days ago • 4

upvoted a collection 24 days ago

mechanistic interpretability with sparse autoencoders

A collection of papers that I found useful for learning about using Sparse Autoencoders for finding interpretable features in language models • 9 items • Updated Sep 3, 2024 • 1

upvoted an article 25 days ago

Article

Mastering Tensor Dimensions in Transformers

By

•

26 days ago

• 42

upvoted an article about 1 month ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

and 5 others •

Dec 23, 2024

• 18

upvoted a paper about 1 month ago

Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

Paper • 2411.15221 • Published Nov 20, 2024 • 28

upvoted an article about 1 month ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

By

•

Aug 26, 2024

• 49

upvoted a collection about 2 months ago

timm tiny test models

A collection of very small (~300-500k parameter) models at 160x160 resolution, for testing purposes. Trained on ImageNet-1k. • 13 items • Updated Oct 2, 2024 • 5

upvoted 2 collections 2 months ago

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 3 days ago • 35

Unsloth 4-bit Dynamic Quants

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 20 items • Updated 1 day ago • 39

upvoted 2 articles 6 months ago

Article

Open LLM Leaderboard: DROP deep dive

Dec 1, 2023

• 6

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 112

upvoted a collection 6 months ago

4bit Instruct Models

18 items • Updated 3 days ago • 28