-
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
Paper • 2402.01391 • Published • 41 -
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper • 2402.13753 • Published • 104 -
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
Paper • 2404.08801 • Published • 61 -
TransformerFAM: Feedback attention is working memory
Paper • 2404.09173 • Published • 42
gunasekar
GunA-SD
AI & ML interests
None yet
Organizations
None yet