Crawl Rate
Crawl rate is the number of requests a search engine crawler makes to a website in a day and was introduced to reduce server overload. Due to sophisticated algorithms, Google is able to determine and set an optimal crawl budget for individual sites, this is covered within our SEO Office Hours Notes along with further best practice advice.
Googlebot Can Recognise Faceted Navigation & Slow Down Crawling
Googlebot understands URL strucures well and can recognise faceted navigation and will slow down when it realises where the primary content is and where it has strayed from that. This is aided by GSC parameter handling.
Googlebot Crawls Known URLs Faster Following Significant Site Changes
Googlebot will crawl known URLs faster if it detects that significant changes have been made across the site to things like structured data, rel canonicals or redirects.
Images Crawled Less Frequently Than Pages
Images are crawled less frequently than content, so if these are migrated, to a CDN for example, it will take Googlebot longer to recrawl these.
Crawl Rate Issues Can be Reported For Manual Review
If a site is being crawled too much or too little you can report a problem with Googlebot in the Help Center and this can be adjusted by a Google engineer.
Google Temporarily Reduces Crawl Rate After A Platform Change
When a site changes its hosting platform Google will be more conservative about how much they crawl as they don’t know how much load the new server set up will support.
Crawl Frequency Attribute in XML Sitemaps Doesn’t Impact Crawl Rate
Google takes no notice of the crawl frequency attribute in XML sitemaps or any priority set. Only the last modification timestamp will impact crawl rate.
Google Uses Scheduler to Determine Recrawl Date
Google uses a scheduler before crawling to work out when they need to recrawl URLs. Google will increase crawl rate if it gets signals that it needs to do so e.g. updated modification date in sitemaps and internal linking (especially from the homepage).
Setting A Higher Crawl Rate Doesn’t Guarantee Google Will Crawl More
Setting your crawl rate to high means Google can crawl more but that doesn’t mean they will.
Views of Cached AMP Pages Won’t Appear in Log Files
AMP pages may make log files look unusual as Google is requesting the page to update the cache but the page will appear to receive little traffic because users see the cached version of the page.
A Noindex Reduces Crawl Rate
A page with a noindex tag will be crawled less frequently.