Zhi-Yi Chin's picture

28 1

Zhi-Yi Chin

joycenerd

·

https://joycenerd.github.io/

AI & ML interests

Trustworthy AI, Generative Model, Self-supervised Learning

Organizations

joycenerd's activity

upvoted 2 papers 3 months ago

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published Dec 31, 2024 • 30

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 12

upvoted a paper 4 months ago

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published Dec 18, 2024 • 24

upvoted a paper 6 months ago

Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations

Paper • 2410.02762 • Published Oct 3, 2024 • 9

upvoted 3 papers 8 months ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5, 2024 • 29

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5, 2024 • 61

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1, 2024 • 14

upvoted 2 papers 9 months ago

Self-Recognition in Language Models

Paper • 2407.06946 • Published Jul 9, 2024 • 26

A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses

Paper • 2407.02551 • Published Jul 2, 2024 • 9

upvoted a collection 10 months ago

P4D Red-teamer

Resources for ICML 2024 paper "Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts" • 2 items • Updated Aug 27, 2024 • 2

upvoted 3 papers 10 months ago

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Paper • 2405.20204 • Published May 30, 2024 • 36

EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture

Paper • 2405.18991 • Published May 29, 2024 • 12

T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

Paper • 2405.18750 • Published May 29, 2024 • 21

upvoted 3 papers 11 months ago

FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper • 2405.11473 • Published May 19, 2024 • 57

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 122

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Paper • 2404.16820 • Published Apr 25, 2024 • 17

upvoted 2 papers 12 months ago

MoDE: CLIP Data Experts via Clustering

Paper • 2404.16030 • Published Apr 24, 2024 • 14

A Multimodal Automated Interpretability Agent

Paper • 2404.14394 • Published Apr 22, 2024 • 21

upvoted 2 papers about 1 year ago

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 17

TextCraftor: Your Text Encoder Can be Image Quality Controller

Paper • 2403.18978 • Published Mar 27, 2024 • 15