Use Case

How Google Indexing Works

A page must be indexed to appear in search results. Google's indexing pipeline has four stages: discovery, crawling, rendering, and indexing. Understanding each stage helps you diagnose why pages are not showing up and how to fix it.

Discovery and Crawling

Google discovers URLs through three primary channels: sitemaps (XML files listing your URLs), links (following hyperlinks from already-known pages), and direct submission (URL Inspection tool in Search Console or the Indexing API).

Once discovered, Googlebot requests the page's HTML. The crawl rate depends on two factors: crawl rate limit (how fast your server can handle requests without degrading user experience) and crawl demand (how important and fresh Google considers the URL). High-authority, frequently-updated pages get crawled more often.

Robots.txt controls what Googlebot can crawl. If a URL is disallowed in robots.txt, Googlebot will not fetch it — but the page can still appear in search results if other pages link to it (showing a 'No information available' snippet). To prevent indexing, use the noindex meta tag instead.

Rendering

After fetching the HTML, Google renders the page to execute JavaScript. This is necessary for JavaScript-heavy frameworks like React, Angular, and Vue. Google uses a headless Chromium browser for rendering — the same engine as Chrome.

Rendering is resource-intensive, so Google queues pages for rendering separately from crawling. There can be a delay between when Google crawls the HTML and when it renders the JavaScript. During this gap, Google has only the server-rendered HTML to work with.

This is why server-side rendering (SSR) and static generation (SSG) are critical for SEO. Content that is in the initial HTML response gets indexed immediately during crawling. Content that requires JavaScript execution may wait hours or days for the render queue. Next.js defaults to Server Components, which solve this problem.

Indexing and Canonicalization

After rendering, Google processes the page content: extracting text, identifying the topic, evaluating quality, and determining the canonical URL. Not every crawled page gets indexed — Google may choose to skip pages it considers low-quality, duplicate, or not useful.

Canonicalization is Google's process of choosing which URL to keep when multiple URLs serve similar content. Google considers several signals: canonical tags, internal links, redirects, sitemap entries, and HTTPS vs HTTP. The canonical tag is the strongest signal, but Google may override it if other signals disagree.

Common indexing problems include: noindex directives (intentional or accidental), canonical pointing to a different URL, low-quality or thin content, and server errors during crawling. The URL Inspection tool in Search Console shows the exact indexing status and any issues Google encountered.

The Indexing API and IndexNow

For time-sensitive content, waiting for Google to discover and crawl your page is too slow. The Google Indexing API lets you notify Google immediately when a page is published or updated. Originally limited to JobPosting and BroadcastEvent types, it is now used more broadly.

IndexNow is an open protocol supported by Bing, Yandex, and other search engines. When you publish or update a page, you ping the IndexNow endpoint with the URL. The search engine then prioritizes crawling that URL. Unlike the Indexing API, IndexNow does not require OAuth setup.

Indxel automates both protocols. When you deploy, it detects new and changed URLs from your sitemap, submits them via the Indexing API and IndexNow, and tracks whether each submission results in successful indexing. No manual URL Inspection tool clicks required.

Diagnosing Indexing Issues

When a page is not indexed, use the URL Inspection tool in Search Console to check its status. Common statuses include: 'Crawled — currently not indexed' (Google crawled it but chose not to index it), 'Discovered — currently not indexed' (Google knows about it but has not crawled it yet), and 'Excluded by noindex tag'.

'Crawled — currently not indexed' usually means Google considers the page low-quality or a near-duplicate of another page. Improve the content, add unique value, and strengthen internal links to the page. Resubmit via URL Inspection after making changes.

'Discovered — currently not indexed' means Google has not prioritized crawling the URL yet. This is common for new pages on sites with lower crawl demand. Submit the URL via URL Inspection, add it to your sitemap, and ensure other pages link to it. Indxel monitors these statuses and auto-retries submission for stuck pages.

Serving and Ranking

Once indexed, pages enter Google's serving infrastructure. When a user searches, Google retrieves relevant pages from the index, scores them against hundreds of ranking signals, and returns the results page.

Ranking signals include relevance (does the page match the query intent), quality (E-E-A-T, backlinks, content depth), and user experience (Core Web Vitals, mobile-friendliness, HTTPS). The exact weights are not public, but Google has confirmed that content relevance, backlinks, and Core Web Vitals are major factors.

Pages can be removed from the index over time if they become inaccessible (server errors), get noindexed, lose all internal and external links, or if Google re-evaluates their quality. Ongoing monitoring ensures your pages stay indexed and ranked.

Frequently asked questions

How long does Google take to index a new page?

Anywhere from minutes to weeks. High-authority sites with frequent crawls can get pages indexed within hours. New sites with few backlinks may wait days or weeks. Using the Indexing API or URL Inspection tool can speed up the process.

Why is my page not indexed by Google?

Common reasons: noindex tag (intentional or accidental), canonical pointing elsewhere, blocked by robots.txt, low-quality or thin content, or server errors during crawling. Check the URL Inspection tool in Search Console for the specific reason.

Does JavaScript affect indexing?

Yes. Google renders JavaScript but queues it separately from crawling. Content that requires JavaScript may take longer to index than server-rendered HTML. Use SSR or SSG (like Next.js Server Components) to ensure content is in the initial HTML response.

Can I force Google to index a page?

You can request indexing via the URL Inspection tool or the Indexing API, but Google decides whether to index the page. Submitting a request prioritizes crawling but does not guarantee indexing — the page must still meet quality standards.

Learn more

What is Google indexing? Quick explainer How to monitor SEO performance

Related guides

SEO Monitoring Tools

Monitoring is how you catch SEO regressions after they ship. A deploy removes a canonical tag, a CMS update breaks structured data, a new page launches without a description. Monitoring tools alert you before rankings drop.

SEO Automation Tools for Developers

Most SEO tools are built for marketers. Developers need automation that fits into their workflow: CLI commands, CI/CD pipelines, and programmatic APIs. Here is the landscape of SEO automation tools built for the dev workflow.

The Complete Next.js SEO Guide

Next.js App Router changed how metadata works. Static exports, generateMetadata, and file-based conventions replaced the old Head component. This guide covers everything you need to ship SEO-correct Next.js apps.

Monitor your indexation

$npx indxel check

Get started Browse use cases