Post
1955
Are you tired of writing scripts to scrape data from the web? π
ScrapeGraphAI is here for you! π
ScrapeGraphAI is an OPEN-SOURCE web scraping Python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, etc.). ππ
Just say which information you want to extract (in human language) and the library will do it for you! π£οΈπ
It supports GPT, Gemini, and open-source models like Mistral. π
A few things that I could not find in the docs but would be amazing to see π€:
- Captcha handling π
- Persistent data output formatting π
- Streaming output π‘
- Explanationπ of the tag line: "ScrapeGraphAI: You Only Scrape Once" What does that even mean? π€£ Is this YOLO? π€
Link: https://github.com/VinciGit00/Scrapegraph-ai
Demo code: https://github.com/amrrs/scrapegraph-code/blob/main/sourcegraph.ipynb
ScrapeGraphAI is here for you! π
ScrapeGraphAI is an OPEN-SOURCE web scraping Python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, etc.). ππ
Just say which information you want to extract (in human language) and the library will do it for you! π£οΈπ
It supports GPT, Gemini, and open-source models like Mistral. π
A few things that I could not find in the docs but would be amazing to see π€:
- Captcha handling π
- Persistent data output formatting π
- Streaming output π‘
- Explanationπ of the tag line: "ScrapeGraphAI: You Only Scrape Once" What does that even mean? π€£ Is this YOLO? π€
Link: https://github.com/VinciGit00/Scrapegraph-ai
Demo code: https://github.com/amrrs/scrapegraph-code/blob/main/sourcegraph.ipynb