A tool to crawl a site and log any resources that return a 404. Results are presented with a searchable todo-style checklist.
- Install Node
- Clone repo
git clone [email protected]:hudakdidit/site_crawler.git
- Install dependencies
npm install
- Setup config file: run
mv config-example.json config.json
. Update thesite
andport
properties as necessary.
Start webpack and the express web server
npm start
Start webpack the express web server, and the web crawler
npm run dev-crawl
Start the express web server
npm run server
Start the crawler script.
npm run crawl