Domain-specific-data-collection-from-structured-and-unstructured-sources

Domain-specific-data-collection-from-structured-and-unstructured-sources

anmolagarwal999

Data collection (scraping+dynamic crawling) for domain "Computer Scientists" from 13 websites include Wikipedia, Google Scholar, DBLP etc and merging them to create a high quality tabular dataset.

1 Stars
1 Forks
1 Watchers
Jupyter Notebook Language
Cost to Build
$1.42M
Market Value
$470.4K

Growth over time

5 data points  ·  2022-02-01 → 2025-07-01
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about Domain-specific-data-collection-from-structured-and-unstructured-sources

Question copied to clipboard

What is the anmolagarwal999/Domain-specific-data-collection-from-structured-and-unstructured-sources GitHub project? Description: "Data collection (scraping+dynamic crawling) for domain "Computer Scientists" from 13 websites include Wikipedia, Google Scholar, DBLP etc and merging them to create a high quality tabular dataset.". Written in Jupyter Notebook. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone Domain-specific-data-collection-from-structured-and-unstructured-sources

Clone via HTTPS

git clone https://github.com/anmolagarwal999/Domain-specific-data-collection-from-structured-and-unstructured-sources.git

Clone via SSH

[email protected]:anmolagarwal999/Domain-specific-data-collection-from-structured-and-unstructured-sources.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the Domain-specific-data-collection-from-structured-and-unstructured-sources issue tracker:

Open GitHub Issues