VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper β’ 2501.05874 β’ Published 4 days ago β’ 41
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper β’ 2501.06186 β’ Published 3 days ago β’ 33
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Paper β’ 2501.04689 β’ Published 5 days ago β’ 15 β’ 5
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published 5 days ago β’ 207 β’ 27
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published 5 days ago β’ 207
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Paper β’ 2501.04689 β’ Published 5 days ago β’ 15 β’ 5
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Paper β’ 2501.04689 β’ Published 5 days ago β’ 15
DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization Paper β’ 2501.03271 β’ Published 9 days ago β’ 9
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Paper β’ 2501.00599 β’ Published 13 days ago β’ 40
Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation Paper β’ 2412.18176 β’ Published 21 days ago β’ 15
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations β’ 3 items β’ Updated Dec 13, 2024 β’ 20
GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Paper β’ 2412.04440 β’ Published Dec 5, 2024 β’ 19
view post Post 1223 π Your AI toolkit just got a major upgrade! I updated the Journalists on Hugging Face community's collection with tools for investigative work, content creation, and data analysis.Sharing these new additions with the links in case itβs helpful:- @wendys-llc 's excellent 6-part video series on AI for investigative journalism https://www.youtube.com/playlist?list=PLewNEVDy7gq1_GPUaL0OQ31QsiHP5ncAQ- @jeremycaplan 's curated AI Spaces on HF https://wondertools.substack.com/p/huggingface- @Xenova 's Whisper Timestamped (with diarization!) for private, on-device transcription Xenova/whisper-speaker-diarization & Xenova/whisper-word-level-timestamps- Flux models for image gen & LoRAs autotrain-projects/train-flux-lora-ease- FineGrain's object cutter finegrain/finegrain-object-cutter and object eraser (this one's cool) finegrain/finegrain-object-eraser- FineVideo: massive open-source annotated dataset + explorer HuggingFaceFV/FineVideo-Explorer- Qwen2 chat demos, including 2.5 & multimodal versions (crushing it on handwriting recognition) Qwen/Qwen2.5 & Qwen/Qwen2-VL- GOT-OCR integration stepfun-ai/GOT_official_online_demo- HTML to Markdown converter maxiw/HTML-to-Markdown- Text-to-SQL query tool by @davidberenstein1957 for HF datasets davidberenstein1957/text-to-sql-hub-datasetsThere's a lot of potential here for journalism and beyond. Give these a try and let me know what you build! You can also add your favorite ones if you're part of the community!Check it out: https://huggingface.co/JournalistsonHF#AIforJournalism #HuggingFace #OpenSourceAI π 5 5 π 4 4 + Reply
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning Paper β’ 2407.20806 β’ Published Jul 30, 2024 β’ 1
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark Paper β’ 2410.19168 β’ Published Oct 24, 2024 β’ 19
CogVLM2: Visual Language Models for Image and Video Understanding Paper β’ 2408.16500 β’ Published Aug 29, 2024 β’ 57 β’ 5
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities Paper β’ 2401.12168 β’ Published Jan 22, 2024 β’ 26
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper β’ 2409.12183 β’ Published Sep 18, 2024 β’ 37
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge Paper β’ 2102.03315 β’ Published Feb 5, 2021 β’ 1