PeepDaSlan9 (Ohenenoo)

upvoted a paper 8 days ago

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published 12 days ago • 15

upvoted a collection 15 days ago

Terminus XL

Collection

v-prediction SDXL clone with zero-terminal SNR noise schedule • 8 items • Updated Apr 24 • 6

upvoted a collection 26 days ago

🎭 Avatars

Collection

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 41 items • Updated 7 days ago • 52

upvoted an article 26 days ago

Article

Vision Language Models Explained

Apr 11

• 98

upvoted 2 articles 28 days ago

Article

Hugging Face x LangChain : A new partner package in LangChain

29 days ago

• 75

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

29 days ago

• 142

upvoted a collection 29 days ago

Yi-1.5 (2024/05)

Collection

10 items • Updated 23 days ago • 79

upvoted a paper 29 days ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 55

upvoted a collection about 1 month ago

Berkeley Function-Calling Leaderboard

Collection

2 items • Updated Apr 5 • 3

upvoted 8 papers about 1 month ago

Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning

Paper • 2305.18424 • Published May 28, 2023 • 1

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy

Paper • 2310.01334 • Published Oct 2, 2023 • 3

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models

Paper • 2310.02998 • Published Oct 4, 2023 • 1

Unraveling the Key Components of OOD Generalization via Diversification

Paper • 2312.16313 • Published Dec 26, 2023 • 1

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Paper • 2401.15914 • Published Jan 29 • 7

Ferret: Refer and Ground Anything Anywhere at Any Granularity

Paper • 2310.07704 • Published Oct 11, 2023 • 10

ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection

Paper • 2402.17888 • Published Feb 27 • 1

Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes

Paper • 2310.01840 • Published Oct 3, 2023 • 1

upvoted 4 collections about 1 month ago

upvoted 3 collections about 2 months ago

Edit Your Image!

Collection

Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26 • 23

Experimental Projects

Collection

Spaces that are too early or cutting edge for mainstream usage 🙂 • 4 items • Updated Nov 16, 2023 • 5

My Best Spaces

Collection

My most polished spaces • 7 items • Updated about 1 month ago • 34

upvoted a paper about 2 months ago

MaGGIe: Masked Guided Gradual Human Instance Matting

Paper • 2404.16035 • Published Apr 24 • 8

upvoted a collection about 2 months ago

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 101

upvoted an article about 2 months ago

Article

Deploy LLMs with Hugging Face Inference Endpoints

Jul 4, 2023

• 8

upvoted 4 collections about 2 months ago

〽️MistralAI

Collection

A collection of MistralAI models that you can trust in production! • 10 items • Updated 8 days ago • 7

image-anime-cartoon

Collection

1 item • Updated Mar 1 • 1

DeepFakeAI

Collection

Collection of awesome DeepFakeAI spaces • 3 items • Updated Apr 5 • 6

Idefics2 🐶

Collection

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 84

upvoted an article about 2 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 248

upvoted a collection about 2 months ago

WizardLM

Collection

0 items • Updated May 8 • 100

upvoted 6 papers about 2 months ago

GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo

Paper • 2404.07992 • Published Apr 11 • 2

InfMLLM: A Unified Framework for Visual-Language Tasks

Paper • 2311.06791 • Published Nov 12, 2023 • 2

MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices

Paper • 2312.16886 • Published Dec 28, 2023 • 18

LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models

Paper • 2312.02949 • Published Dec 5, 2023 • 8

Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models

Paper • 2312.06109 • Published Dec 11, 2023 • 19

Small Language Model Meets with Reinforced Vision Vocabulary

Paper • 2401.12503 • Published Jan 23 • 30

upvoted an article 2 months ago

Article

Mixture of Depth is Vibe

By

•

Apr 22

• 37

upvoted a paper 2 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 41

upvoted a collection 2 months ago

Transformers.js demos

Collection

A collection of my favorite WebML demos, built with Transformers.js! • 23 items • Updated May 8 • 41

upvoted a paper 2 months ago

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing

Paper • 2404.05717 • Published Apr 8 • 23

upvoted 2 collections 2 months ago

Open-source speech datasets annotated using Data-Speech

Collection

Open-source annotated speech datasets ranging from 1,000 hours to soon 50,000 hours. • 7 items • Updated 27 days ago • 3

best

Collection

14 items • Updated Feb 19 • 2

upvoted 3 papers 2 months ago

RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS

Paper • 2403.13806 • Published Mar 20 • 18

Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering

Paper • 2403.14554 • Published Mar 21 • 12

EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting

Paper • 2403.15124 • Published Mar 22 • 1

upvoted a collection 2 months ago

My NLP Spaces

Collection

Hugging Face transformers fine-tuned for various NLP tasks using TensorFlow. • 13 items • Updated Apr 9 • 1

upvoted 2 papers 2 months ago

CosmicMan: A Text-to-Image Foundation Model for Humans

Paper • 2404.01294 • Published Apr 1 • 15

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Paper • 2404.01197 • Published Apr 1 • 29

upvoted 2 papers 3 months ago

StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control

Paper • 2403.09055 • Published Mar 14 • 24

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 58

upvoted a collection 3 months ago

Trending 3D and Depth Demos

Collection

One place to keep track of all 3D and Depth demos • 14 items • Updated Apr 17 • 16

upvoted 2 papers 3 months ago

Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Paper • 2403.12032 • Published Mar 18 • 14

Video Editing via Factorized Diffusion Distillation

Paper • 2403.09334 • Published Mar 14 • 21

upvoted a collection 3 months ago

WebLINX Models

Collection

https://mcgill-nlp.github.io/weblinx • 17 items • Updated 8 days ago • 6

upvoted a paper 3 months ago

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

Paper • 2403.08268 • Published Mar 13 • 15

upvoted 2 collections 3 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated 28 days ago • 312

BERT release

Collection

Regroups the original BERT models released by the Google team. Except for the models marked otherwise, the checkpoints support English. • 8 items • Updated 28 days ago • 15

Ohenenoo

AI & ML interests

Organizations

PeepDaSlan9's activity

Vision Language Models Explained

Hugging Face x LangChain : A new partner package in LangChain

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Deploy LLMs with Hugging Face Inference Endpoints

Welcome Llama 3 - Meta's new open LLM

Mixture of Depth is Vibe