Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing Paper • 2407.04180 • Published Jul 4, 2024
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification Paper • 2410.05057 • Published Oct 7, 2024 • 7
PITCH: AI-assisted Tagging of Deepfake Audio Calls using Challenge-Response Paper • 2402.18085 • Published Feb 28, 2024
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published Jan 30 • 19
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published 19 days ago • 14
nyu-dice-lab/Llama-3-8B-WildChat-250k-qwen2-72b-mmlu-personahub_math_v5_regen_149960 Text Generation • Updated Jan 10
nyu-dice-lab/Llama-3-8B-WildChat-250k-qwen2-72b-mmlu-open_math_2_gsm8k_50k Text Generation • Updated Jan 9
nyu-dice-lab/Llama-3-8B-WildChat-250k-qwen2-72b-mmlu-personas-math-grade Text Generation • Updated Jan 8
nyu-dice-lab/Llama-3-8B-WildChat-dpo-qwen2572b-athene70b-jdg-Llama3-Harmlessness Text Generation • Updated Jan 8
nyu-dice-lab/Llama-3-8B-WildChat-250k-c4ai-command-r-plus-08-2024 Text Generation • Updated Jan 2 • 2