Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
fdaudens 
posted an update 25 days ago
Post
1020
Excited to share a new project to make journalists’ lives easier when gathering information!

Collecting data like lists, URLs, etc., from websites is not always easy (and sometimes painful). Web scraping requires technical skills that only a handful of people in each newsroom have.

I recently stumbled upon @scrapegraphai , a scraper that does the heavy lifting with AI for the user with a simple prompt in natural language. I asked them if they could integrate the Hugging Face Hub to use open-source models and created a no-code, easy-to-use interface on Gradio.

You can then save time and focus on storytelling!

🔧 How It Works
1. Input Your Prompt and Source URL
2. Click ‘Scrape and Summarize’
3. Receive Summarized Results

👩‍💻 Get Involved!
This is just the first version of the tool, and it’s pretty basic. I’ve uploaded it to the Journalists on Hugging Face community so we can work together on it. Whether you’re a developer, a data scientist, or a journalist with ideas, you can contribute to this project.

You can also copy this app to your own account or organization to customize it to your needs.

👉 Test the scraper here: JournalistsonHF/ai-scraper

🤝 Join the Journalists on 🤗 community: https://huggingface.co/JournalistsonHF
In this post