Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM Paper • 2511.23119 • Published Nov 28, 2025 • 1 • 1