Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
singhsidhukuldeepΒ 
posted an update May 10
Post
1955
Are you tired of writing scripts to scrape data from the web? πŸ˜“

ScrapeGraphAI is here for you! πŸŽ‰

ScrapeGraphAI is an OPEN-SOURCE web scraping Python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, etc.). πŸŒπŸ“Š

Just say which information you want to extract (in human language) and the library will do it for you! πŸ—£οΈπŸš€

It supports GPT, Gemini, and open-source models like Mistral. πŸ”

A few things that I could not find in the docs but would be amazing to see 🀞:
- Captcha handling πŸ”
- Persistent data output formatting πŸ“
- Streaming output πŸ“‘
- ExplanationπŸ˜‚ of the tag line: "ScrapeGraphAI: You Only Scrape Once" What does that even mean? 🀣 Is this YOLO? πŸ€”

Link: https://github.com/VinciGit00/Scrapegraph-ai
Demo code: https://github.com/amrrs/scrapegraph-code/blob/main/sourcegraph.ipynb

ε°ε¨ζ‹‰ε±θ‚‘οΌŒε€šζ­€δΈ€δΈΎοΌŒ

Β·

Hi @scapking

I am sorry what does this mean?
Google Translate is translating this to very offensive stuff!

Adam langley

I tried it from your website. I showed it a couple of Wikipedia pages and asked it to summarise what explained on that page. Unfortunately, it was not successful.