DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? Paper • 2409.07703 • Published 24 days ago • 63
Hardware Acceleration of LLMs: A comprehensive survey and comparison Paper • 2409.03384 • Published about 1 month ago • 1
Attention Heads of Large Language Models: A Survey Paper • 2409.03752 • Published about 1 month ago • 86
Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold Paper • 2408.14608 • Published Aug 26 • 7
Geometry on the Wasserstein space over a compact Riemannian manifold Paper • 2104.00910 • Published Apr 2, 2021 • 1
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM Paper • 2408.12076 • Published Aug 22 • 11
Controllable Text Generation for Large Language Models: A Survey Paper • 2408.12599 • Published Aug 22 • 61
MixUp as Locally Linear Out-Of-Manifold Regularization Paper • 1809.02499 • Published Sep 7, 2018 • 1
RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness Paper • 2206.14502 • Published Jun 29, 2022 • 1
Fixup Initialization: Residual Learning Without Normalization Paper • 1901.09321 • Published Jan 27, 2019 • 1
PowerNorm: Rethinking Batch Normalization in Transformers Paper • 2003.07845 • Published Mar 17, 2020 • 1
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering Paper • 2408.09174 • Published Aug 17 • 51
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8 • 154
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands Paper • 2408.11048 • Published Aug 20 • 3
To Code, or Not To Code? Exploring Impact of Code in Pre-training Paper • 2408.10914 • Published Aug 20 • 40
Persistent homology of the cosmic web. I: Hierarchical topology in ΛCDM cosmologies Paper • 2011.12851 • Published Nov 25, 2020 • 1
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models Paper • 2408.08926 • Published Aug 15 • 4
Adaptive Topological Feature via Persistent Homology: Filtration Learning for Point Clouds Paper • 2307.09259 • Published Jul 18, 2023 • 1
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16 • 96
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning Paper • 2408.07931 • Published Aug 15 • 18
LASER: LLM Agent with State-Space Exploration for Web Navigation Paper • 2309.08172 • Published Sep 15, 2023 • 11
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search Paper • 2408.08152 • Published Aug 15 • 51
LieTransformer: Equivariant self-attention for Lie Groups Paper • 2012.10885 • Published Dec 20, 2020 • 1
TopoBenchmarkX: A Framework for Benchmarking Topological Deep Learning Paper • 2406.06642 • Published Jun 9 • 1
OpenResearcher: Unleashing AI for Accelerated Scientific Research Paper • 2408.06941 • Published Aug 13 • 29
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12 • 115
Poincaré Embeddings for Learning Hierarchical Representations Paper • 1705.08039 • Published May 22, 2017 • 1
A geometric framework for asymptotic inference of principal subspaces in PCA Paper • 2209.02025 • Published Sep 5, 2022 • 1
UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction Paper • 1802.03426 • Published Feb 9, 2018 • 1
Manifold Learning by Mixture Models of VAEs for Inverse Problems Paper • 2303.15244 • Published Mar 27, 2023 • 1
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters Paper • 2408.04093 • Published Aug 7 • 4
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models Paper • 2408.04840 • Published Aug 9 • 31
A Rank Stabilization Scaling Factor for Fine-Tuning with LoRA Paper • 2312.03732 • Published Nov 28, 2023 • 7
LLaVA-OneVision Collection a model good at arbitrary types of visual input • 15 items • Updated about 16 hours ago • 20
Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source Separation Paper • 2408.03588 • Published Aug 7 • 6
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling Paper • 2408.03695 • Published Aug 7 • 11
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases Paper • 2408.03910 • Published Aug 7 • 15
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Paper • 2408.03361 • Published Aug 6 • 85
D-Bot: Database Diagnosis System using Large Language Models Paper • 2312.01454 • Published Dec 3, 2023 • 1
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5 • 56