atayloraerospace's picture

atayloraerospace

Taylor658

·

atayloraerospace

AI & ML interests

Multimodal Gen AI 🤖 | Agentic AI 🧠🤖 | Computer Vision 🔭 | AI in Healthcare 🩺 | AI in Aerospace 🚀

Recent Activity

new activity 3 days ago

Taylor658/Electrohydrodynamics:Update README.md

updated a model 3 days ago

Taylor658/Electrohydrodynamics

new activity 3 days ago

Taylor658/Titan-Hohmann:Update README.md

View all activity

Organizations

Taylor658's activity

upvoted 14 papers 3 days ago

ColorFlow: Retrieval-Augmented Image Sequence Colorization

Paper • 2412.11815 • Published 6 days ago • 26

BrushEdit: All-In-One Image Inpainting and Editing

Paper • 2412.10316 • Published 9 days ago • 33

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published 12 days ago • 35

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 10 days ago • 69

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Paper • 2412.12606 • Published 5 days ago • 40

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published 5 days ago • 39

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 5 days ago • 82

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published 4 days ago • 19

GUI Agents: A Survey

Paper • 2412.13501 • Published 5 days ago • 17

AnySat: An Earth Observation Model for Any Resolutions, Scales, and Modalities

Paper • 2412.14123 • Published 4 days ago • 11

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Paper • 2412.12953 • Published 5 days ago • 11

Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation

Paper • 2412.14015 • Published 4 days ago • 11

AniDoc: Animation Creation Made Easier

Paper • 2412.14173 • Published 4 days ago • 43

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 4 days ago • 41

upvoted 6 papers 6 days ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published 10 days ago • 35

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published 10 days ago • 43

FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Paper • 2412.09611 • Published 10 days ago • 9

ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation

Paper • 2412.08645 • Published 11 days ago • 11

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 10 days ago • 83

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 9 days ago • 130