Simon Brandeis's picture

Simon Brandeis

sbrandeis

·

SBrandeis

AI & ML interests

None yet

Recent Activity

updated a dataset 5 days ago

huggingface/documentation-images

new activity 10 days ago

huggingface/HuggingDiscussions:manycore-research/SpatialLM-Llama-1B

reacted to julien-c's post with 🚀 26 days ago

Important notice 🚨 For Inference Providers who have built support for our Billing API (currently: Fal, Novita, HF-Inference – with more coming soon), we've started enabling Pay as you go (=PAYG) What this means is that you can use those Inference Providers beyond the free included credits, and they're charged to your HF account. You can see it on this view: any provider that does not have a "Billing disabled" badge, is PAYG-compatible.

View all activity

Organizations

sbrandeis's activity

upvoted an article about 2 months ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

Feb 18

• 95

upvoted an article 2 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 134

upvoted an article 10 months ago

Article

Benchmarking Text Generation Inference

May 29, 2024

• 30

upvoted 2 collections 12 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 737

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 91

upvoted 4 papers about 1 year ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 127

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 142

Locally Typical Sampling

Paper • 2202.00666 • Published Feb 1, 2022 • 2

Masked Audio Generation using a Single Non-Autoregressive Transformer

Paper • 2401.04577 • Published Jan 9, 2024 • 43

upvoted a collection about 1 year ago

MAGNeT

Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated Apr 4, 2024 • 40

upvoted a paper about 1 year ago

QuIP: 2-Bit Quantization of Large Language Models With Guarantees

Paper • 2307.13304 • Published Jul 25, 2023 • 2

upvoted 3 papers over 1 year ago

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Paper • 2312.09767 • Published Dec 15, 2023 • 27

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 80

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Paper • 2312.02145 • Published Dec 4, 2023 • 6

upvoted 2 collections over 1 year ago

Notus 7B v1

Notus 7B v1 models (DPO fine-tune of Zephyr SFT) and datasets used. More information at https://github.com/argilla-io/notus • 11 items • Updated Dec 11, 2024 • 18

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 236

upvoted 3 papers over 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

Positional Description Matters for Transformers Arithmetic

Paper • 2311.14737 • Published Nov 22, 2023 • 2

Thinking Fast and Slow in Large Language Models

Paper • 2212.05206 • Published Dec 10, 2022 • 1