Andreas Kirsch's picture

5 4 3

Andreas Kirsch

blackhc

·

https://blackhc.net

AI & ML interests

None yet

Recent Activity

New activity 1 day ago

HuggingFaceH4/blogpost-scaling-test-time-compute:Link to "canonical form" does not work

commented a paper 7 days ago

Rho-1: Not All Tokens Are What You Need

View all activity

Organizations

None yet

blackhc's activity

New activity in HuggingFaceH4/blogpost-scaling-test-time-compute 1 day ago

Link to "canonical form" does not work

#4 opened 1 day ago by

commented a paper 7 days ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 85 •

New activity in ILSVRC/imagenet-1k 4 months ago

Respect data_files when downloading and returning splits.

#20 opened 4 months ago by

New activity in open-llm-leaderboard-old/requests 8 months ago

CohereForAI Command R+ 4 bit quantized model failed to evaluate

#101 opened 8 months ago by

commented a paper 8 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 85 •

commented a paper 10 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 183 •