15 182 223

lhl PRO

leonardlin

https://randomfoo.net/

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

mmnga/DeepSeek-V3-slice-jp64-gguf

liked a model 8 days ago

infly/INF-ORM-Llama3.1-70B

updated a collection 11 days ago

multilingual

View all activity

Articles

Organizations

leonardlin's activity

upvoted 7 papers 24 days ago

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published 27 days ago • 31

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published 27 days ago • 41

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Paper • 2412.12606 • Published 28 days ago • 41

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published 26 days ago • 24

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 27 days ago • 121

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published 28 days ago • 41

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 25 days ago • 339

upvoted 9 papers 5 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 68

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Paper • 2408.08291 • Published Aug 15, 2024 • 11

Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields

Paper • 2408.03822 • Published Aug 7, 2024 • 14

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 54

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Paper • 2408.02085 • Published Aug 4, 2024 • 18

Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers

Paper • 2408.05506 • Published Aug 10, 2024 • 9

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12, 2024 • 14

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 118

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13, 2024 • 31

upvoted a collection 5 months ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 183

upvoted 3 papers 5 months ago

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

Paper • 2408.01050 • Published Aug 2, 2024 • 8

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

Paper • 2306.14145 • Published Jun 25, 2023 • 1

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 66

lhl PRO

AI & ML interests

Recent Activity

Articles

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Not Legal Advice on AI Training Data in Japan

Evaling llm-jp-eval (evals are hard)

Organizations

leonardlin's activity