Viewing 10 replies - 1 through 10 (of 10 total)
  • Hi @kvm1!
    If you browse to one of those URLs what are you seeing? You should be seeing a blank page. Occasionally we see themes that implement the WordPress template redirect hook incorrectly which results in all requests that have a query string redirecting to the sites frontpage. In those cases, the wordfence_lh URL will return the full website instead of a blank page. This will cause Google to think it’s an actual page it should index. There is also a known issue with WPX hosting. They block all wordfence_lh requests. If you are on WPX hosting you can either disable Wordfence Live Traffic or you can reach out to WPX hosting and ask that they remove the block.

    Thread Starter mIPs

    (@kvm1)

    Hi @wfasa,
    if I browse to one of the URLs, I see a blank page. We are not hosting on WPX hosting. So, if I understand you right, I should speak with the developers of the theme?

    Hi @kvm1,
    If you are seeing a blank page when you visit those URLs then everything is as it should be. You do not have to reach out to your theme devs.

    It has to be something else going wrong then. What exactly is the error you are getting from Google?

    Thread Starter mIPs

    (@kvm1)

    Hi @wfasa,

    this is the error description I see (translated from german with google translator):

    “Googlebot could access your website, but certain URLs are blocked for the Googlebot mobile for smartphones in your robots.txt file. These URLs do not appear in the Blocked URLs report and may have been unintentionally blocked.”

    Hi again,
    So it’s saying that it’s being blocked via your robots.txt file when browsing via mobile. Have you checked the robots.txt file to see if there is anything in there that shouldn’t be there?

    Thread Starter mIPs

    (@kvm1)

    Hey @wfasa,

    this is the content of the robots.txt:

    # global
    User-agent: *
    Disallow: /cgi-bin
    Disallow: /wp-admin
    Disallow: /wp-includes
    Disallow: /wp-content/plugins
    Disallow: /wp-content/cache
    Disallow: /wp-content/themes
    Disallow: /trackback
    Disallow: /feed
    Disallow: /comments
    Disallow: /category/*/*
    Disallow: */trackback
    Disallow: */feed
    Disallow: */comments
    Disallow: /*?*
    Disallow: /*?
    Disallow: /readme.html

    User-agent: Cliqzbot
    Disallow: /

    User-agent: SemrushBot
    Disallow: /

    User-agent: MJ12bot
    Disallow: /

    Disallow: /widerrufsbelehrung
    Disallow: /support/supportformular

    I do not see anything suspicious there!?

    You are disallowing all requests with query strings from all user agents including Google.

    This means “rules for everyone including Google”
    User-agent: *

    And these rules say “anything with ? (query string) is disallowed

    Disallow: /*?*
    Disallow: /*?

    The /?wordfence_lh follows that pattern so that would be why it’s blocked.

    I’ll be honest I’m not an SEO expert but I don’t think it should be necessary to block anything for Google bot. It should behave and not crawl things it’s not supposed to.

    • This reply was modified 7 years, 8 months ago by wfasa.
    Thread Starter mIPs

    (@kvm1)

    That makes a lot of sense. I’ve removed the “Disallow: /*?*” and kept the “Disallow: /*?”, and will monitor if this brings any change to the Search Console error log. I’ll write a reply with the results on monday.

    Thread Starter mIPs

    (@kvm1)

    Hi @wfasa,

    I removed the lines
    Disallow: /*?
    Disallow: /*?*

    It took some days, but now there are no more errors reported.

    Thanks for your assistance.

    That’s good news. Thanks for the update @kvm1!

Viewing 10 replies - 1 through 10 (of 10 total)

The topic ‘Loads of “https://url.com/?wordfence_lh=1&hid=***” errors in search console’ is closed to new replies.