Pawel

Pwlot

Pwlot
Pwlot

AI & ML interests

AGI

Recent Activity

liked a model about 1 month ago

lerobot/pi0

liked a dataset 3 months ago

HuggingFaceTB/finemath

liked a model 10 months ago

jasperai/flash-sdxl

View all activity

Organizations

Pwlot's activity

liked a model about 1 month ago

lerobot/pi0

Robotics • Updated 30 days ago • 14.2k • 202

liked a dataset 3 months ago

HuggingFaceTB/finemath

Viewer • Updated Feb 6 • 48.3M • 8.6k • 300

liked a model 10 months ago

jasperai/flash-sdxl

Text-to-Image • Updated Jul 3, 2024 • 167 • 33

liked a dataset 10 months ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jan 31 • 3.3B • 317k • 658

liked a model 11 months ago

stabilityai/stable-zero123

Text-to-3D • Updated Jul 10, 2024 • 703

reacted to Sentdex's post with 👍 11 months ago

Post

8970

Okay, first pass over KAN: Kolmogorov–Arnold Networks, it looks very interesting!

Interpretability of KAN model:
May be considered mostly as a safety issue these days, but it can also be used as a form of interaction between the user and a model, as this paper argues and I think they make a valid point here. With MLP, we only interact with the outputs, but KAN is an entirely different paradigm and I find it compelling.

Scalability:
KAN shows better parameter efficiency than MLP. This likely translates also to needing less data. We're already at the point with the frontier LLMs where all the data available from the internet is used + more is made synthetically...so we kind of need something better.

Continual learning:
KAN can handle new input information w/o catastrophic forgetting, which helps to keep a model up to date without relying on some database or retraining.

Sequential data:
This is probably what most people are curious about right now, and KANs are not shown to work with sequential data yet and it's unclear what the best approach might be to make it work well both in training and regarding the interpretability aspect. That said, there's a rich long history of achieving sequential data in variety of ways, so I don't think getting the ball rolling here would be too challenging.

Mostly, I just love a new paradigm and I want to see more!

KAN: Kolmogorov-Arnold Networks (2404.19756)

5 replies

liked a Space 11 months ago

610

StoryDiffusion

👁

Generate images from text prompts and reference images

liked a dataset 12 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 190k • 2.08k

liked a Space 12 months ago

1.41k

InstantMesh

📚

Create a 3D model from an image in 10 seconds!

liked 3 Spaces about 1 year ago

269

Repo duplicator

😻

Copy a Hugging Face repository

1.03k

DragGan - Drag Your GAN

👆

Manipulate images using drag points

689

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection

reacted to clem's post with 👍 about 1 year ago

Post

Is synthetic data the future of AI? 🔥🔥🔥

@HugoLaurencon @Leyo & @VictorSanh are introducing HuggingFaceM4/WebSight , a multimodal dataset featuring 823,000 pairs of synthetically generated HTML/CSS codes along with screenshots of the corresponding rendered websites to train GPT4-V-like models 🌐💻

While crafting their upcoming foundation vision language model, they faced the challenge of converting website screenshots into usable HTML/CSS codes. Most VLMs suck at this and there was no public dataset available for this specific task, so they decided to create their own.

They prompted existing LLMs to generate 823k HTML/CSS codes of very simple websites. Through supervised fine-tuning of a vision language model on WebSight, they were able to generate the code to reproduce a website component, given a screenshot.

You can explore the dataset here: HuggingFaceM4/WebSight

What do you think?