Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
litespeed_wiki:cache:lscwp:crawler [2018/03/29 20:13]
Lisa Clarke Adjusted title
litespeed_wiki:cache:lscwp:crawler [2018/07/05 12:14]
qtwrk
Line 18: Line 18:
 ===== Running the Crawler ===== ===== Running the Crawler =====
 {{:​litespeed_wiki:​cache:​lscwp:​lscwp-crawler-watch.png?​direct&​800|}} {{:​litespeed_wiki:​cache:​lscwp:​lscwp-crawler-watch.png?​direct&​800|}}
 +
 +
  
 If you've opted to watch the crawler status, your screen will look something like this. The messages in the status window will vary from these, as this screenshot was grabbed from a small installation with few pages to crawl. If you've opted to watch the crawler status, your screen will look something like this. The messages in the status window will vary from these, as this screenshot was grabbed from a small installation with few pages to crawl.
 +
 +{{:​litespeed_wiki:​cache:​lscwp:​troubleshooting:​lscwp-crawler2.jpg|}}
 +
 +This is an explanation:​
 +
 +''​Size:​ 181'':​ This means the sitemap files has 181 URLs, one URL per line.
 +
 +''​Crawler:​ #​1'':​ This means you are watching action of Crawler number 1 , there could be multiple crawlers working depends on your setting.
 +
 +''​Position:​ 1'':​ This means this crawler is currently fetching the 1st URL from sitemap file.
 +
 +''​Threads:​ 2'':​ This means this is thread number 2, there could be multiple threads fetching, it is smart and will adjust based on your load [[litespeed_wiki:​cache:​lscwp:​configuration:​crawler|settings]].
 +
 +''​Status:​ Stopped due to reset meta position'':​ This means while it's crawling, site has purged or sitemap changed, so it will restart from top.
 +
 +
 +
  
 If you wish to keep a particular path from being crawled, you may enter it in the **Sitemap Generation Blacklist** box and press **Save**. After the crawler has run for the first time, if it encounters any pages marked ''​do-not-cache''​ they will be added to this Blacklist automatically. If you wish to keep a particular path from being crawled, you may enter it in the **Sitemap Generation Blacklist** box and press **Save**. After the crawler has run for the first time, if it encounters any pages marked ''​do-not-cache''​ they will be added to this Blacklist automatically.
  
  • Admin
  • Last modified: 2020/11/14 15:32
  • by Lisa Clarke