Viewing 3 replies - 1 through 3 (of 3 total)
  • I’m having the exact same problem, but I’m getting over 2,000 warnings. It says “Sitemap contains URL’s which are blocked by robots.txt” and it refers to this: wp-content/uploads, which is not blocked by the robots.txt file.

    Here’s what I have on my robots.txt file:

    User-agent: *

    Disallow: /feed/
    Disallow: /cgi-bin/
    Disallow: /wp-admin/
    Disallow: /wp-includes/
    Disallow: /trackback/
    Disallow: /xmlrpc.php
    Disallow: ?wptheme=
    Disallow: /unpublished/

    Allow: /tag/mint/
    Allow: /tag/feed/
    Allow: /wp-content/online/
    Allow: /wp-content/uploads/

    Sitemap: http://www.singleinstilettos.com/sitemap_index.xml
    Sitemap: http://www.singleinstilettos.com/post-sitemap.xml
    Sitemap: http://www.singleinstilettos.com/page-sitemap.xml
    Sitemap: http://www.singleinstilettos.com/attachment-sitemap.xml

    User-agent: ia_archiver
    Disallow: /

    Can someone give both of us some advice on how to fix? Thanks!!

    joycegrace

    (@joycegrace)

    You are probably no-indexing some pages or category archives in the WordPress SEO settings, and then including them in the sitemap, when they should not be included. If that doesn’t solve it, I would go to the sitemap settings for where the excluded robots.txt exclusions are and force it “never include.” Or you can set the index to “always index” and include in sitemap. The two should work together – if something is in the sitemap, it should not be blocked by the robots.txt file. Also of note is that you can have a manual robots.txt file and also have other pages no-indexed by the WordPress SEO plugin that won’t show in the robots.txt file on your server.

    I have a problem url robots.txt blocked up was 60 I had lost setting how to make the robots.txt blocked url. I robots.txt settings below. please suggestions

    Sitemap: http://www.indonesiabeautyful.com/sitemap.xml
    Sitemap: http://www.indonesiabeautyful.com/sitemap.xml.gz

    User-agent: Googlebot-Image
    Disallow:

    User-agent: Mediapartners-Google
    Disallow:

    User-agent: duggmirror
    Disallow: /

    User-agent: *
    Disallow: /cgi-bin/
    Disallow: /wp-admin/
    Disallow: /wp-includes/
    Disallow: /wp-content/plugins/
    Disallow: /wp-content/cache/
    Disallow: /wp-content/themes/
    Disallow: /trackback/
    Disallow: /feed/
    Disallow: /comments/
    Disallow: /category/
    Disallow: /trackback/
    Disallow: /feed/
    Disallow: /comments/
    Disallow: /?

Viewing 3 replies - 1 through 3 (of 3 total)
  • The topic ‘Sitemap URLs blocked by Robots.txt’ is closed to new replies.