search-engine
Simple search engine application that is capable of crawling articles from a website, store them in predefined format and later index them. These documents are available to be searched for by full-text querries from user interface.
How to download and setup search-engine
Open terminal and run command
git clone https://github.com/markovd18/search-engine.git
git clone is used to create a copy or clone of search-engine repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with search-engine https://github.com/markovd18/search-engine/archive/master.zip
Or simply clone search-engine with SSH
[email protected]:markovd18/search-engine.git
If you have some problems with search-engine
You may open issue on search-engine support forum (system) here: https://github.com/markovd18/search-engine/issuesSimilar to search-engine repositories
Here you may see search-engine alternatives and analogs
scrapy Sasila colly headless-chrome-crawler Lulu crawler newspaper isp-data-pollution webster cdp4j spidy stopstalk-deployment N2H4 memorious easy-scraping-tutorial antch pomp Harvester diffbot-php-client talospider corpuscrawler Python-Crawling-Tutorial learn.scrapinghub.com crawling-projects dig-etl-engine crawlkit scrapy-selenium spidyquotes zcrawl podcastcrawler