Robert Shaw's picture

17 6 4

Robert Shaw

robertgshaw2

·

rsnm2

AI & ML interests

None yet

Recent Activity

new activity 28 days ago

neuralmagic/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic:Update tokenizer_config.json

new activity 3 months ago

nm-testing/pixtral-12b-w4a16-actorder-group:What is an actorder group and what are the advantages of running this in vLLM?

new activity 3 months ago

neuralmagic/Sparse-Llama-3.1-8B-2of4:Can I apply a LoRA?

View all activity

Organizations

robertgshaw2's activity

upvoted a paper 5 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 50

upvoted a collection 5 months ago

Llama-3.1 Quantization

Neural Magic quantized Llama-3.1 models • 22 items • Updated Nov 22, 2024 • 43

upvoted a collection 9 months ago

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17, 2024 • 68

upvoted a paper over 1 year ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 55

upvoted a collection over 1 year ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated 2 days ago • 560

upvoted a paper over 1 year ago

Sparse Finetuning for Inference Acceleration of Large Language Models

Paper • 2310.06927 • Published Oct 10, 2023 • 14