Components of Robot.txt File UserAgent, Disallow, Allow & Sitemap (original) (raw)

Last Updated : 26 Jul, 2025

The robots.txt file in SEO acts as a gatekeeper, before any good bots entering to your website they first visit the robots.txt file and read which pages are allowed to crawl and which are not.

A robots.txt file tells the Google crawler bot which URLs the crawler can access on your website.

Example of Robot.txt File

You can also visit our robots.txt file by this URL: https://www.geeksforgeeks.org/robots.txt

User-agent: *
Disallow: /wp-admin/
Disallow: /community/
Disallow: /wp-content/plugins/
Disallow: /content-override.php
User-agent: ChatGPT-User
Disallow: /

Components of Robot.txt File

Now lets explain above code

**User-agent means bots.
- *** means all.
**Disallow means if the URL contains this keyword don’t crawl.

For example:

If we put **Disallow on the URL https://www.geeksforgeeks.org/wp-admin/image.jpg and do not allow it to be crawled.

Now even if the URL is changed to **https://www.geeksforgeeks.org/news//wp-admin/image.jpg
then also **https://www.geeksforgeeks.org/new/wp-admin/image.jpg/ is not allowed to crawl (although **https://www.geeksforgeeks.org/news/ is allowed to crawl).

**User-agent: ChatGPT-User
- blocks the ChatGPT bot from crawling the whole website.

User-agent: *
Disallow: /

Above code block all web crawler to visit any page of website.

**Note: If you want any URL to deindex from Google Search quickly you can use Google Search Console removal request from your GSC account.