L's picture

L

abunchofrandomwords

·

AI & ML interests

None yet

Organizations

None yet

abunchofrandomwords's activity

upvoted a paper 4 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 22

upvoted a paper 7 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 85

upvoted an article 9 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 191

upvoted 2 papers 10 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 93

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 76

upvoted a collection 12 months ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 37

upvoted 13 papers over 1 year ago

From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Paper • 2309.04269 • Published Sep 8, 2023 • 33

Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 76

FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

Paper • 2308.04556 • Published Aug 8, 2023 • 9

JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

Paper • 2308.04729 • Published Aug 9, 2023 • 32

Shepherd: A Critic for Language Model Generation

Paper • 2308.04592 • Published Aug 8, 2023 • 32

PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers

Paper • 2308.05732 • Published Aug 10, 2023 • 9

Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI

Paper • 2308.05221 • Published Aug 9, 2023 • 10

Flexible Isosurface Extraction for Gradient-Based Mesh Optimization

Paper • 2308.05371 • Published Aug 10, 2023 • 11

Follow Anything: Open-set detection, tracking, and following in real-time

Paper • 2308.05737 • Published Aug 10, 2023 • 12

OpenProteinSet: Training data for structural biology at scale

Paper • 2308.05326 • Published Aug 10, 2023 • 11

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Paper • 2308.05374 • Published Aug 10, 2023 • 28

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

Paper • 2308.01390 • Published Aug 2, 2023 • 33