Shyam Sudhakaran's picture

Shyam Sudhakaran

shyamsn97

·

AI & ML interests

Reinforcement Learning, Open-Ended Algorithms, Neural Cellular Automata

Recent Activity

liked a dataset 15 days ago

nvidia/OpenCodeReasoning

liked a model 16 days ago

Qwen/QwQ-32B

updated a dataset 20 days ago

shyamsn97/cube

View all activity

Organizations

shyamsn97's activity

upvoted a collection 4 months ago

3D Modelization

48 items • Updated 9 days ago • 9

upvoted a paper 7 months ago

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12, 2024 • 47

upvoted 2 collections 8 months ago

WebInstruct 🌐 Embeddings 🧱 Models

A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated Sep 4, 2024 • 11

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Feb 20 • 51

upvoted 2 collections 11 months ago

Mixture-of-preference-reward-modeling

The mixture of preference datasets used for reward modeling. • 2 items • Updated Apr 29, 2024 • 3

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 24

upvoted a paper 12 months ago

Data-Efficient Multimodal Fusion on a Single GPU

Paper • 2312.10144 • Published Dec 15, 2023 • 6

upvoted 2 collections about 1 year ago

Fine-Tuned

41 items • Updated Feb 4 • 7

Merges

Experimental LLM merging • 1292 items • Updated Feb 4 • 7

upvoted a paper over 1 year ago

Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11, 2024 • 38

upvoted a collection over 1 year ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 42

upvoted a paper over 1 year ago

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 13

upvoted a collection over 1 year ago

🚂 SD-XL Training Suite

All the steps to train your own SD-XL custom model • 9 items • Updated Feb 14 • 22

upvoted a paper almost 2 years ago

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Paper • 2307.06949 • Published Jul 13, 2023 • 51