SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis Paper • 2411.16173 • Published 4 days ago • 4
VideoGPT+ Collection VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding • 10 items • Updated Jun 11 • 3
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models Paper • 2406.01920 • Published Jun 4 • 1
What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models Paper • 2403.13513 • Published Mar 20 • 1