U-MATH and μ-MATH - University-level math evaluation Collection Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS • 3 items • Updated 13 days ago • 15
Monet: Mixture of Monosemantic Experts for Transformers Paper • 2412.04139 • Published 20 days ago • 10
Farmer.Chat: Scaling AI-Powered Agricultural Services for Smallholder Farmers Paper • 2409.08916 • Published Sep 13 • 3
Plant foundation models Collection A collection of pre-trained DNA models for plant genomes. • 19 items • Updated Oct 23 • 4
Malaysian synthetic dataset Collection Use LLM to generate Malaysian context synthetic dataset. • 33 items • Updated 3 days ago • 1
RedCode: Risky Code Execution and Generation Benchmark for Code Agents Paper • 2411.07781 • Published Nov 12 • 1
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models Paper • 2410.20771 • Published Oct 28 • 3
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 19 days ago • 548
To Code, or Not To Code? Exploring Impact of Code in Pre-training Paper • 2408.10914 • Published Aug 20 • 41
Refusal in Language Models Is Mediated by a Single Direction Paper • 2406.11717 • Published Jun 17 • 2
PlantCaduceus (512bp len) Collection https://plantcaduceus.github.io • 8 items • Updated 7 days ago • 2
Larimar: Large Language Models with Episodic Memory Control Paper • 2403.11901 • Published Mar 18 • 32
view article Article Recommendation to Revisit the Diffuser Default LoRA Parameters By alvdansen • Jun 21 • 11
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs Paper • 2406.10209 • Published Jun 14 • 8
A Fine-tuning Dataset and Benchmark for Large Language Models for Protein Understanding Paper • 2406.05540 • Published Jun 8 • 3