Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
litespeed_wiki:cache:lscwp:crawler [2018/03/29 20:13] Lisa Clarke Adjusted title |
litespeed_wiki:cache:lscwp:crawler [2018/07/05 12:14] qtwrk |
||
---|---|---|---|
Line 18: | Line 18: | ||
===== Running the Crawler ===== | ===== Running the Crawler ===== | ||
{{:litespeed_wiki:cache:lscwp:lscwp-crawler-watch.png?direct&800|}} | {{:litespeed_wiki:cache:lscwp:lscwp-crawler-watch.png?direct&800|}} | ||
+ | |||
+ | |||
If you've opted to watch the crawler status, your screen will look something like this. The messages in the status window will vary from these, as this screenshot was grabbed from a small installation with few pages to crawl. | If you've opted to watch the crawler status, your screen will look something like this. The messages in the status window will vary from these, as this screenshot was grabbed from a small installation with few pages to crawl. | ||
+ | |||
+ | {{:litespeed_wiki:cache:lscwp:troubleshooting:lscwp-crawler2.jpg|}} | ||
+ | |||
+ | This is an explanation: | ||
+ | |||
+ | ''Size: 181'': This means the sitemap files has 181 URLs, one URL per line. | ||
+ | |||
+ | ''Crawler: #1'': This means you are watching action of Crawler number 1 , there could be multiple crawlers working depends on your setting. | ||
+ | |||
+ | ''Position: 1'': This means this crawler is currently fetching the 1st URL from sitemap file. | ||
+ | |||
+ | ''Threads: 2'': This means this is thread number 2, there could be multiple threads fetching, it is smart and will adjust based on your load [[litespeed_wiki:cache:lscwp:configuration:crawler|settings]]. | ||
+ | |||
+ | ''Status: Stopped due to reset meta position'': This means while it's crawling, site has purged or sitemap changed, so it will restart from top. | ||
+ | |||
+ | |||
+ | |||
If you wish to keep a particular path from being crawled, you may enter it in the **Sitemap Generation Blacklist** box and press **Save**. After the crawler has run for the first time, if it encounters any pages marked ''do-not-cache'' they will be added to this Blacklist automatically. | If you wish to keep a particular path from being crawled, you may enter it in the **Sitemap Generation Blacklist** box and press **Save**. After the crawler has run for the first time, if it encounters any pages marked ''do-not-cache'' they will be added to this Blacklist automatically. | ||