1 9 3

Yifan Zeng

yokey

https://xhmy.github.io/

AI & ML interests

Large Language Model, Agentic AI, Deep Learning

Recent Activity

upvoted a paper about 1 month ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

new activity about 1 month ago

google/gemma-2-9b:RuntimeError: Index put requires the source and destination dtypes match, got BFloat16 for the destination and Float for the source.

upvoted a paper about 2 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

View all activity

Organizations

None yet

yokey's activity

upvoted a paper about 1 month ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21 • 57

New activity in google/gemma-2-9b about 1 month ago

RuntimeError: Index put requires the source and destination dtypes match, got BFloat16 for the destination and Float for the source.

#24 opened 5 months ago by

saireddy

upvoted 2 papers about 2 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28 • 77

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25 • 17

authored a paper 2 months ago

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Paper • 2410.16033 • Published Oct 18

liked a model 2 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • Updated Oct 25 • 165k • 1.93k

commented a paper 2 months ago

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Paper • 2410.13828 • Published Oct 17 • 3 •

authored 2 papers 2 months ago

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Paper • 2410.13828 • Published Oct 17 • 3

LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking

Paper • 2406.00231 • Published May 31

upvoted a paper 2 months ago

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Paper • 2410.13828 • Published Oct 17 • 3

updated a collection 2 months ago

LLM

Collection

19 items • Updated Oct 17

liked a model 2 months ago

openai-community/gpt2

Text Generation • Updated Feb 19 • 10.5M • • 2.46k

upvoted a paper 3 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

updated 2 collections 3 months ago

LLM

Collection

19 items • Updated Oct 17

AI4Sci

Collection

1 item • Updated Sep 14