Gorantla Narendra's picture

Gorantla Narendra

narrinddhar

·

AI & ML interests

None yet

Organizations

None yet

narrinddhar's activity

upvoted a paper 7 months ago

FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance

Paper • 2305.05176 • Published May 9, 2023 • 6

upvoted a paper 12 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 258

upvoted an article 12 months ago

Article

CodeGemma - an official Google release for code LLMs

Apr 9, 2024

• 100

upvoted 3 papers about 1 year ago

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

Paper • 2404.02905 • Published Apr 3, 2024 • 69

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Paper • 2404.01258 • Published Apr 1, 2024 • 12

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Paper • 2403.17804 • Published Mar 26, 2024 • 18

upvoted 2 collections about 1 year ago

Papers to read

106 items • Updated Feb 11 • 7

Text to Image

5 items • Updated Apr 1, 2024 • 1

upvoted 11 papers about 1 year ago

Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation

Paper • 2403.16990 • Published Mar 25, 2024 • 25

WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs

Paper • 2403.07944 • Published Mar 10, 2024 • 1

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

Paper • 2403.14148 • Published Mar 21, 2024 • 20

AnimateDiff-Lightning: Cross-Model Diffusion Distillation

Paper • 2403.12706 • Published Mar 19, 2024 • 18

Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Paper • 2403.12015 • Published Mar 18, 2024 • 67

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

Paper • 2403.05438 • Published Mar 8, 2024 • 21

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Paper • 2403.04692 • Published Mar 7, 2024 • 40

VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

Paper • 2403.00522 • Published Mar 1, 2024 • 46

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Paper • 2402.15627 • Published Feb 23, 2024 • 38

Subobject-level Image Tokenization

Paper • 2402.14327 • Published Feb 22, 2024 • 18

The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 21