crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

scraping crawling python automation crawler

View on GitHub Website

8.8k Stars

710 Forks

8.8k Watchers

Python Language

apache-2.0 License

100 SrcLog Score

Cost to Build

$1.77M

Market Value

$10.78M

How is this calculated?

Growth over time

3 data points · 2025-04-01 → 2026-04-01

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about crawlee-python

Question copied to clipboard

What is the apify/crawlee-python GitHub project? Description: "Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone crawlee-python

Clone via HTTPS

git clone https://github.com/apify/crawlee-python.git

Clone via SSH

[email protected]:apify/crawlee-python.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the crawlee-python issue tracker:

Open GitHub Issues

Similar to crawlee-python

tensorflow awesome-python system-design-primer flask thefuck free-programming-books-zh_CN cli django requests keras ansible scikit-learn scrapy TensorFlow-Examples certbot pytorch python-patterns tornado face_recognition core pandas CNTK python-guide reddit wechat_jump_game interactive-coding-challenges compose data-science-ipython-notebooks ipython pipenv