zeng xian's picture

zeng xian

themez

·

themez

AI & ML interests

None yet

Organizations

themez's activity

upvoted 2 papers 8 months ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 67

Generative Photomontage

Paper • 2408.07116 • Published Aug 13, 2024 • 21

upvoted a paper 10 months ago

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Paper • 2406.11813 • Published Jun 17, 2024 • 32

upvoted 3 papers about 1 year ago

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12, 2024 • 68

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 28

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Paper • 2401.13919 • Published Jan 25, 2024 • 31

upvoted a collection about 1 year ago

Sora参考论文

OpenAI "Video generation models as world simulators"技术报告后面的参考论文，总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失，链接已补充到note中。 • 32 items • Updated Feb 18, 2024 • 54

upvoted a paper about 1 year ago

Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation

Paper • 2401.15688 • Published Jan 28, 2024 • 11

upvoted 12 papers over 1 year ago

Point Transformer V3: Simpler, Faster, Stronger

Paper • 2312.10035 • Published Dec 15, 2023 • 20

DREAM: Diffusion Rectification and Estimation-Adaptive Models

Paper • 2312.00210 • Published Nov 30, 2023 • 17

FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting

Paper • 2312.00451 • Published Dec 1, 2023 • 12

Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models

Paper • 2311.12092 • Published Nov 20, 2023 • 23

Make Pixels Dance: High-Dynamic Video Generation

Paper • 2311.10982 • Published Nov 18, 2023 • 69

Story-to-Motion: Synthesizing Infinite and Controllable Character Animation from Long Text

Paper • 2311.07446 • Published Nov 13, 2023 • 29

GOAT: GO to Any Thing

Paper • 2311.06430 • Published Nov 10, 2023 • 16

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 80

CodePlan: Repository-level Coding using LLMs and Planning

Paper • 2309.12499 • Published Sep 21, 2023 • 78

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

Paper • 2309.11674 • Published Sep 20, 2023 • 31

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 23