Chih-Kai Yang

zenyn

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

upvoted a paper about 9 hours ago

Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

upvoted a paper about 9 hours ago

Could Thinking Multilingually Empower LLM Reasoning?

View all activity

Organizations

zenyn's activity

upvoted 5 papers about 9 hours ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published 5 days ago • 38

Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Paper • 2504.13816 • Published 4 days ago • 14

Could Thinking Multilingually Empower LLM Reasoning?

Paper • 2504.11833 • Published 7 days ago • 23

EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models

Paper • 2504.15133 • Published 1 day ago • 13

LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs

Paper • 2504.14655 • Published 2 days ago • 12

upvoted 12 papers 5 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published 14 days ago • 103

A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility

Paper • 2504.07086 • Published 13 days ago • 20

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Paper • 2504.06514 • Published 14 days ago • 39

SAEs Can Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs

Paper • 2504.08192 • Published 12 days ago • 4

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

Paper • 2504.05262 • Published 15 days ago • 11

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published 11 days ago • 53

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published 8 days ago • 39

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published 12 days ago • 45

SIFT-50M: A Large-Scale Multilingual Dataset for Speech Instruction Fine-Tuning

Paper • 2504.09081 • Published 11 days ago • 16

upvoted a paper 18 days ago

Scaling Analysis of Interleaved Speech-Text Language Models

Paper • 2504.02398 • Published 19 days ago • 27

updated a dataset 24 days ago

Morioh/livingroom

Updated Mar 4 • 290

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 385