2 98 66

robbie

robb-0

AI & ML interests

None yet

Recent Activity

reacted to clem's post with 🤗 about 10 hours ago

We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

reacted to akhaliq's post with 🚀 about 10 hours ago

Google drops Gemini 2.0 Flash Thinking a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more now available in anychat, try it out: https://huggingface.co/spaces/akhaliq/anychat

upvoted a paper about 10 hours ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

View all activity

Organizations

robb-0's activity

reacted to clem's post with 🤗 about 10 hours ago

Post

829

We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

1 reply

reacted to akhaliq's post with 🚀 about 10 hours ago

Post

13301

Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: akhaliq/anychat

3 replies

upvoted a paper about 10 hours ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 60

liked a Space 1 day ago

MiniSearch

👌

Minimalist web-searching app with browser-based AI assistant

upvoted a paper 1 day ago

Improving Language Plasticity via Pretraining with Active Forgetting

Paper • 2307.01163 • Published Jul 3, 2023 • 7

liked a Space 2 days ago

📚ArxivPaperSearch🔍

🔍

Search and summarize academic papers

reacted to awacke1's post with 😎 2 days ago

Post

1983

I introduce MIT license

ML Model Specialize Fine Tuner app "SFT Tiny Titans" 🚀

Demo video with source.

Download, train, SFT, and test your models, easy as 1-2-3!
URL: awacke1/TorchTransformers-NLP-CV-SFT

2 replies

upvoted 2 papers 3 days ago

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Paper • 2407.16958 • Published Jul 24, 2024 • 4

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published Dec 16, 2024 • 8

upvoted a collection 3 days ago

🐕Small-Doges

Collection

Doge family of small language models! • 11 items • Updated 2 days ago • 3

liked a model 3 days ago

BlinkDL/rwkv7-g1

Text Generation • Updated 5 days ago • 44

reacted to Jaward's post with 👀 3 days ago

Post

1947

Super Interesting Paper!
Proposes neural networks (CRNNs) that can learn to produce traveling waves in their hidden state in response to visual stimuli, thus enabling the transfer and integration of spatial information across neural connections. In other words they showed that neural networks have wave-like properties that blends and processes visual information over time, cool seeing a union of AI and physics in this way.
Paper: https://arxiv.org/pdf/2502.06034
Code: https://github.com/KempnerInstitute/traveling-waves-integrate

New activity in huggingchat/chat-ui 3 days ago

[UI] - Title of conversations with gibberish or...

#685 opened 3 days ago by

robb-0

liked a model 3 days ago

OuteAI/Lite-Oute-1-300M-Instruct

Text Generation • Updated Aug 25, 2024 • 508 • 10

reacted to ZennyKenny's post with 🤗 3 days ago

Post

2198

Really excited to start contributing to the SWE Arena project: https://swe-arena.com/

Led by IBM PhD fellow @terryyz , our goal is to advance research in code generation and app development by frontier LLMs.

upvoted a collection 3 days ago

story writing favourites

Collection

Models I personally liked for generating stories in the past. Not a recommendation, many of these are outdated. • 20 items • Updated 7 days ago • 50

liked 4 models 3 days ago