Guillerm PRO

doxiy

AI & ML interests

student IA

Recent Activity

updated a Space 2 months ago

doxiy/First_agent_template

updated a model 12 months ago

doxiy/bert-finetuned-ner

updated a model 12 months ago

doxiy/bert-finetuned-ner-accelerate

View all activity

Organizations

doxiy's activity

updated a Space 2 months ago

First Agent Template

⚡

Find the current local time in any timezone

updated 2 models 12 months ago

doxiy/bert-finetuned-ner

Token Classification • Updated Apr 30, 2024

doxiy/bert-finetuned-ner-accelerate

Updated Apr 30, 2024

updated a model about 1 year ago

doxiy/ppo-LunarLander-v2

Reinforcement Learning • Updated Mar 23, 2024

reacted to abhishek's post with 👍 over 1 year ago

Post

Happy to announce, brand new, open-source Hugging Face Competitions platform 🚀 Now, create a machine learning competition for your friends, colleagues or the world for FREE* and host it on Hugging Face: the AI community building the future. Creating a competition requires only two steps: pip install competitions, then run competitions create and create competition by answering a few questions 💥 Checkout the github repo: https://github.com/huggingface/competitions and docs: https://hf.co/docs/competitions

6 replies

reacted to clem's post with 🤯 over 1 year ago

Post

Is synthetic data the future of AI? 🔥🔥🔥

@HugoLaurencon @Leyo & @VictorSanh are introducing HuggingFaceM4/WebSight , a multimodal dataset featuring 823,000 pairs of synthetically generated HTML/CSS codes along with screenshots of the corresponding rendered websites to train GPT4-V-like models 🌐💻

While crafting their upcoming foundation vision language model, they faced the challenge of converting website screenshots into usable HTML/CSS codes. Most VLMs suck at this and there was no public dataset available for this specific task, so they decided to create their own.

They prompted existing LLMs to generate 823k HTML/CSS codes of very simple websites. Through supervised fine-tuning of a vision language model on WebSight, they were able to generate the code to reproduce a website component, given a screenshot.

You can explore the dataset here: HuggingFaceM4/WebSight

What do you think?

12 replies

upvoted 2 papers over 1 year ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 197

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244