18 Forks
153 Stars
153 Watchers

webpalm

WebPalm is a powerful command-line tool for website mapping and web scraping. With its recursive approach, it can generate a complete tree of all webpages and their links on a website. It can also extract data from the body of each page using regular expressions, making it an ideal tool for web scraping and data extraction.

How to download and setup webpalm

Open terminal and run command
git clone https://github.com/Malwarize/webpalm.git
git clone is used to create a copy or clone of webpalm repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with webpalm https://github.com/Malwarize/webpalm/archive/master.zip

Or simply clone webpalm with SSH
[email protected]:Malwarize/webpalm.git

If you have some problems with webpalm

You may open issue on webpalm support forum (system) here: https://github.com/Malwarize/webpalm/issues

Similar to webpalm repositories

Here you may see webpalm alternatives and analogs

 scrapy    Sasila    Price-monitor    webmagic    colly    headless-chrome-crawler    Lulu    newcrawler    scrapple    goose-parser    arachnid    gopa    scrapy-zyte-smartproxy    node-crawler    arachni    newspaper    webster    spidy    N2H4    easy-scraping-tutorial    antch    pomp    talospider    podcastcrawler    FileMasta    lux    scrapy-redis    haipproxy    DotnetSpider    TumblThree