Edward Neuhaus's picture

Edward Neuhaus

Pretergeek

·

https://ko-fi.com/pretergeek

pretergeek

AI & ML interests

NLP, ML, LLMs, AI Ethics, Privacy in AI

Recent Activity

liked a dataset 2 days ago

bespokelabs/Bespoke-Stratos-17k

liked a model 2 days ago

winglian/reasoning-llama-3.1-8b-stratos-cold-start-v2

liked a dataset 4 days ago

SmallDoge/SmallThoughts

View all activity

Organizations

None yet

Pretergeek's activity

upvoted a collection about 1 month ago

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 208

upvoted a paper 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

upvoted 2 collections 3 months ago

Useful Spaces

13 items • Updated Feb 6 • 1

OpenChat-3.5-0106 with Extended Context

1 item • Updated Feb 6 • 1

upvoted a collection 4 months ago

Vision Language Models Papers 🖼️💬📝

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 36

upvoted a paper 4 months ago

Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers

Paper • 2409.20537 • Published Sep 30, 2024 • 14

upvoted a collection 5 months ago

RL/Alignment

197 items • Updated Jun 18, 2024 • 25

upvoted a paper 5 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 53

upvoted an article 5 months ago

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

By

•

Oct 21, 2024

• 19

upvoted a paper 5 months ago

RoFormer: Enhanced Transformer with Rotary Position Embedding

Paper • 2104.09864 • Published Apr 20, 2021 • 12

upvoted 3 papers 6 months ago

Large Language Models Must Be Taught to Know What They Don't Know

Paper • 2406.08391 • Published Jun 12, 2024 • 1

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29, 2024 • 53

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 88

upvoted 2 papers 7 months ago

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17, 2024 • 52

RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands

Paper • 2408.11048 • Published Aug 20, 2024 • 4

upvoted 2 collections 7 months ago

Emotional Intelligence Datasets

9 items • Updated Nov 2, 2024 • 4

OpenChat-3.5-0106 with Additional Layers

Upscaled models using the Block Expansion method. Unlike the more common DUP Scaling, BE doesn't require fine-tuning to recover lost performance. • 7 items • Updated Feb 6 • 2

upvoted 3 papers 7 months ago

A Rank Stabilization Scaling Factor for Fine-Tuning with LoRA

Paper • 2312.03732 • Published Nov 28, 2023 • 8

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 26

Rotary Position Embedding for Vision Transformer

Paper • 2403.13298 • Published Mar 20, 2024 • 4