4 18 9

Zecheng Tang

ZetangForward

https://zetangforward.github.io/

ZetangForward

AI & ML interests

Natural Language Processing, Multimodal Models, Pre-trained Language Models

Recent Activity

updated a dataset about 1 month ago

ZetangForward/Bilingual_CiteEval

published a dataset about 1 month ago

ZetangForward/Bilingual_CiteEval

published a dataset about 2 months ago

ZetangForward/EN_CiteEval

View all activity

Organizations

ZetangForward's activity

upvoted 3 papers about 2 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 55

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Paper • 2502.09614 • Published Feb 13 • 12

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 35

upvoted 4 papers 6 months ago

Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Paper • 2410.18693 • Published Oct 24, 2024 • 43

upvoted a paper 12 months ago

LLoCO: Learning Long Contexts Offline

Paper • 2404.07979 • Published Apr 11, 2024 • 23

upvoted 2 papers about 1 year ago

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Paper • 2401.17377 • Published Jan 30, 2024 • 37

StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

Paper • 2401.17093 • Published Jan 30, 2024 • 21

upvoted 8 papers over 1 year ago

Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers

Paper • 2311.10642 • Published Nov 17, 2023 • 26

Trusted Source Alignment in Large Language Models

Paper • 2311.06697 • Published Nov 12, 2023 • 12

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 72

Teaching Language Models to Self-Improve through Interactive Demonstrations

Paper • 2310.13522 • Published Oct 20, 2023 • 12

Small-scale proxies for large-scale Transformer training instabilities

Paper • 2309.14322 • Published Sep 25, 2023 • 20

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Paper • 2309.10020 • Published Sep 18, 2023 • 41

OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch

Paper • 2309.10706 • Published Sep 19, 2023 • 17

LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models

Paper • 2309.09506 • Published Sep 18, 2023 • 15