1 11 12

Mia Hawthorne

MiaHawthorne

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

NexaAIDev/OmniAudio-2.6B

liked a model about 1 month ago

tencent/HunyuanVideo

liked a model about 1 month ago

meta-llama/Llama-3.2-1B

View all activity

Organizations

None yet

MiaHawthorne's activity

liked 3 models about 1 month ago

liked 3 models about 2 months ago

NexaAIDev/Qwen2-Audio-7B-GGUF

Audio-Text-to-Text • Updated Nov 25, 2024 • 6.86k • 133

mistralai/Mistral-7B-v0.1

Text Generation • Updated Jul 24, 2024 • 2.32M • 3.53k

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated 10 days ago • 43.7k • 304

New activity in NexaAIDev/OmniVLM-968M 2 months ago

about ocr

#1 opened 2 months ago by

MiaHawthorne

liked a model 2 months ago

NexaAIDev/OmniVLM-968M

Updated Dec 17, 2024 • 1.1k • 497

upvoted 10 papers 2 months ago

Analyzing The Language of Visual Tokens

Paper • 2411.05001 • Published Nov 7, 2024 • 23

M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models

Paper • 2411.04075 • Published Nov 6, 2024 • 16

M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding

Paper • 2411.04952 • Published Nov 7, 2024 • 28

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Paper • 2411.05005 • Published Nov 7, 2024 • 13

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

Paper • 2411.05003 • Published Nov 7, 2024 • 70

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 50

Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?

Paper • 2411.05000 • Published Nov 7, 2024 • 21

Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model

Paper • 2411.04496 • Published Nov 7, 2024 • 22

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Paper • 2411.04928 • Published Nov 7, 2024 • 49

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 113

liked 2 models 2 months ago

stabilityai/stable-diffusion-3.5-medium

Text-to-Image • Updated Oct 31, 2024 • 95.3k • 534

tencent/Hunyuan3D-1

Text-to-3D • Updated Nov 23, 2024 • 974 • 262