If the need arises to check how many of a site’s pages are indexed (ie. those URLs that are returned in search results) many SEOs’ first reaction is to fall back on the site: query, but in fact this query returns all URLs that Google is aware of, including canonicalized pages, so is probably inaccurate.
With this in mind, let’s go through some more accurate ways to find indexed pages, and then a method for combining them to find a more accurate final figure.
Index Status report (Google Index > Index Status)
This high-level report will show you the number of pages in the index, which can be compared against your own counts, although it can include duplicates that might be filtered out of ‘real-world’ search results.
Sitemap indexation count (Crawl > Sitemaps)
Will show you an indexation count for your Sitemaps. If the number of submitted pages matches the number of indexed pages, you know that 100% of the URLs are indexed. If you have Sitemaps with fewer files in each, you are more likely to be able to make use of this information.
Unique URL Sitemap indexation count
Google’s John Mueller recommends creating a Sitemap file containing all of your unique URLs, then submitting this in Search Console. This will give you more accurate information on the number of unique indexed URLs.
Pages with SERP impressions (Search Traffic > Search Analytics)
This is the more accurate way to check whether a page is indexed, however it can only be used to check one URL at a time.
1. Download and combine all the URLs from these sources in Excel, then de-dupe:
2. Crawl the URLs as a List Crawl to get the Unique Pages count:
You can either paste up to 2,000 URLs as a list, or upload a file.