wiki-scraper

wiki-scraper

marinakiseleva

This web crawler uses Scrapy py to crawl Wikipedia. It prints the page title, total word count, and page category (using openpyxl) to an Excel workbook, in order to analyze the verbosity of articles by category.

2 Stars
1 Forks
2 Watchers
Python Language
Cost to Build
$500
Market Value
$500

Growth over time

8 data points  ·  2021-08-01 → 2025-07-01
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about wiki-scraper

Question copied to clipboard

What is the marinakiseleva/wiki-scraper GitHub project? Description: "This web crawler uses Scrapy py to crawl Wikipedia. It prints the page title, total word count, and page category (using openpyxl) to an Excel workbook, in order to analyze the verbosity of articles by category. ". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone wiki-scraper

Clone via HTTPS

git clone https://github.com/marinakiseleva/wiki-scraper.git

Clone via SSH

[email protected]:marinakiseleva/wiki-scraper.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the wiki-scraper issue tracker:

Open GitHub Issues