Seriny's picture

26 11

Seriny

JIMIY

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

upvoted a paper 24 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

upvoted a paper 24 days ago

Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

View all activity

Organizations

None yet

JIMIY's activity

upvoted a paper 1 day ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published 2 days ago • 61

upvoted 19 papers 24 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 27 days ago • 183

Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

Paper • 2402.10211 • Published Feb 15, 2024 • 14

DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization

Paper • 2402.09812 • Published Feb 15, 2024 • 16

GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering

Paper • 2402.10128 • Published Feb 15, 2024 • 18

Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15, 2024 • 25

Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion

Paper • 2402.10009 • Published Feb 15, 2024 • 22

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 22

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Paper • 2402.10210 • Published Feb 15, 2024 • 35

How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15, 2024 • 42

MPIrigen: MPI Code Generation through Domain-Specific Language Models

Paper • 2402.09126 • Published Feb 14, 2024 • 15

Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers

Paper • 2402.08958 • Published Feb 14, 2024 • 6

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models

Paper • 2402.08714 • Published Feb 13, 2024 • 14

GhostWriter: Augmenting Collaborative Human-AI Writing Experiences Through Personalization and Agency

Paper • 2402.08855 • Published Feb 13, 2024 • 14

Computing Power and the Governance of Artificial Intelligence

Paper • 2402.08797 • Published Feb 13, 2024 • 15

Tandem Transformers for Inference Efficient LLMs

Paper • 2402.08644 • Published Feb 13, 2024 • 10

NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs

Paper • 2402.08622 • Published Feb 13, 2024 • 6

IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation

Paper • 2402.08682 • Published Feb 13, 2024 • 14

ChatCell: Facilitating Single-Cell Analysis with Natural Language

Paper • 2402.08303 • Published Feb 13, 2024 • 13

Learning Continuous 3D Words for Text-to-Image Generation

Paper • 2402.08654 • Published Feb 13, 2024 • 12