Posted by Oksana Budnikova on 03 February 2016 08:40 PM
Sitemap is a list of website pages accessible to search engine crawlers, helping to improve search engine optimization of a website by making sure that all the pages can be found.
The pages are listed in hierarchical structure in the XML format. You can see the sitemap on page: http://yoursitename.com/sitemap.xml . In addition, it has the information of how often the website pages are updated so search engines crawlers can come and refresh the information.
You can learn more about sitemap and its protocol at sitemaps.org.How To Add Pages to Sitemap
Every Site Page has the property "In Sitemap" which is seen on the Site Page section in the Admin Panel.
You can change this property either in the settings of each Site page or making a mass action at the Site Page section: Mass Actions -> Check All -> Include to the sitemap / Exclude from the sitemap.Sitemap Generation
Sitemap is generated by the function "generate_sitemap" of the module "sitemap".
In order to rebuild the sitemap after making changes in the set of Site Pages included to the sitemap please go to the Admin Panel -> System Settings -> Sitemap -> Sitemap cache -> Rebuild (please refer to the screenshot below). Alternatively, you can wait for the task_scheduler launch.
Built sitemap is stored in cache to speed up the sitemap rebuilding process.Sitemap Settings
Sitemap settings are located in the Admin Panel -> System Settings -> Sitemap.
The button "Rebuild" for Sitemap cache allows to manually rebuild sitemap content after adding or removing Site Pages to sitemap list.
"Crawler visit frequency" indicates the preferred frequency for search crawlers visit, which, in fact, may not affect the crawlers.
This value is considered as a hint and not a command. Even though search engine crawlers may consider this information when making decisions, they may crawl pages marked "hourly" less frequently than that, and they may crawl pages marked "yearly" more frequently than that. Crawlers may periodically crawl pages marked "never" so that they can handle unexpected changes to those pages.