view post Post 2097 🔦 What? The Hub as a vector search backend!code: https://gist.github.com/davidberenstein1957/f0157a471ec59d9dd44ae6957f1d52ecbuild on DuckDB: https://huggingface.co/docs/hub/en/datasets-duckdb See translation 👀 3 3 👍 1 1 + Reply
Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception Paper • 2410.12788 • Published Oct 16, 2024 • 24
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Paper • 2410.12628 • Published Oct 16, 2024 • 30
Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance Paper • 2410.18889 • Published Oct 24, 2024 • 15
VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images Paper • 2408.16176 • Published Aug 28, 2024 • 8
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28, 2024 • 35