
Site Index 


Site Index is a nodejs utility that will crawl a domain and index any reachable html urls then output to a urls.json file.

./site-index --help

Site Index

  Will crawl a site and generate the json file for all the urls found. Also
  converts a sitemap to a json file.


  --domain       (Required) Domain to crawl.
  --output file                 (Required) Folder to output the information to.
  --uri /path/to/file.html      You might want to add just one more path to index.
  --html                        Save the raw html to file.
  --type sitemap|crawl|single   Use the sitemap or crawl to index a site for links.
  --verbose                     Output progress information on the index.
  --help                        Print this usage guide.