Question 1

Does robots.txt stop Google from indexing my pages?

Accepted Answer

Not entirely. Disallowing Googlebot from a URL prevents crawling, but Google can still index a page it has never crawled if other sites link to it — it just won't have read the content. To prevent indexing reliably, use a noindex meta tag or X-Robots-Tag header on the pages themselves. Robots.txt is for crawl budget management, not indexing prevention.

Question 2

Should I block AI crawlers in robots.txt?

Accepted Answer

This is a personal choice. Blocking AI crawlers prevents your content from being used to train language models. It has no effect on Google or Bing. Some publishers block them to protect their content commercially. Others allow them as they feel it increases their content's indirect reach. Neither choice affects your SEO with traditional search engines.

Question 3

What should I always disallow in robots.txt?

Accepted Answer

At minimum: /admin/, /login/, /wp-admin/ (if WordPress), /private/, any staging or development directories, /cart/, /checkout/ (e-commerce). These pages have no SEO value and allowing bots to crawl them wastes your crawl budget on non-indexable or sensitive pages.

Question 4

What is the Crawl-delay directive?

Accepted Answer

Crawl-delay tells a bot to wait a specified number of seconds between requests. Crawl-delay: 1 means wait 1 second between each page fetch. This reduces server load from aggressive crawlers. Note: Google does not support Crawl-delay in robots.txt — to control Googlebot's crawl rate, use Google Search Console settings instead.

Question 5

Where exactly should I place the robots.txt file?

Accepted Answer

Always at the root domain: example.com/robots.txt. Subdomains have separate robots.txt files: blog.example.com/robots.txt. A robots.txt for the root domain does not apply to subdomains. Most web servers are configured to serve files from the root directory automatically.

Robots.txt Generator

What Is robots.txt

AI Crawler Considerations

Frequently Asked Questions

Robots.txt Generator

What Is robots.txt

AI Crawler Considerations

Frequently Asked Questions

Related Tools