Zhimeng Guo

zhimeng

https://zhimeng.page

AI & ML interests

Machine Learning

Recent Activity

updated a model 3 days ago

zhimeng/Qwen2.5-1.5B-Open-R1-Code-GRPO

liked a dataset 6 days ago

hkust-nlp/SimpleRL-Zoo-Data

updated a model about 1 month ago

zhimeng/Qwen-2.5-7B-Simple-RL

View all activity

Organizations

zhimeng's activity

upvoted a paper 5 months ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6

upvoted a paper 12 months ago

PointInfinity: Resolution-Invariant Point Diffusion Models

Paper • 2404.03566 • Published Apr 4, 2024 • 16

upvoted 12 papers about 1 year ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 64

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

Paper • 2403.05438 • Published Mar 8, 2024 • 21

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Paper • 2401.11708 • Published Jan 22, 2024 • 30

Aria Everyday Activities Dataset

Paper • 2402.13349 • Published Feb 20, 2024 • 31

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 81

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Paper • 2402.08017 • Published Feb 12, 2024 • 27

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

Paper • 2402.07456 • Published Feb 12, 2024 • 44

Model Editing with Canonical Examples

Paper • 2402.06155 • Published Feb 9, 2024 • 13

upvoted a collection about 1 year ago

OLMo Suite

Collection

Artifacts for the first set of OLMo models. • 18 items • Updated 22 days ago • 71

upvoted 3 papers about 1 year ago

Scavenging Hyena: Distilling Transformers into Long Convolution Models

Paper • 2401.17574 • Published Jan 31, 2024 • 17

Has Your Pretrained Model Improved? A Multi-head Posterior Based Approach

Paper • 2401.02987 • Published Jan 2, 2024 • 10

Instruct-Imagen: Image Generation with Multi-modal Instruction

Paper • 2401.01952 • Published Jan 3, 2024 • 31

upvoted a paper over 1 year ago

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 37