WordPress.org

Ready to get started?Download WordPress

Forums

[resolved] Strange links and crawling errors (22 posts)

  1. Nils
    Member
    Posted 1 year ago #

    Heya people,

    I'm getting very weird 301 crawling errors in Google Webmaster Tools since 21st of March. I have no idea about the reason behind these errors, as they have a very uncommon and nested link structure.

    The website I'm talking about is http://mafia-daily.net/, and one of these strange links looks like this:
    m2story/%7C%7Chttp://mafia-daily.net/m1screenshots/%7C%7Chttp://mafia-daily.net%7C%7Chttp://mafia-daily.net/2010/08/innige-bande-ueber-die-runden-kommen-die-letzten-beiden-videoclips-sind-online/%7C%7Cnone%7C%7Chttp://mafia-daily.net/?attachment_id=8551%7C%7Cnone%7C%7Cnone%7C%7Chttp://mafia-daily.net/category/mafia1/%7C%7Chttp://mafia-daily.net/category/mafia2/%7C%7Chttp://mafia-daily.net/category/mafia3/%7C%7Chttp://mafia-daily.net/category/downloads/

    Seems to be randomly created, consists of different pages, posts, categories and attachments. Every single link part is separated by two pipes || (= %7C).

    The only thing I have changed since the 21st of March was a little tweak in the functions.php of the theme, but I have reverted back to the old version and I'm still getting the errors. Also I'm using the UberMenu Plugin for a few days now, but it's not creating links like the above.
    I've got more than 200 of these errors by the time of writing, getting more every day. Don't know where Google can find these links, I have absolutely no clue.

    Did someone of you guys see something like this before?
    Any help would be greatly appreciated.

    Thanks and happy Easter,
    Nils

  2. bottleneck
    Member
    Posted 1 year ago #

    The only thing I have changed since the 21st of March was a little tweak in the functions.php of the theme,

    Just mark all those errors as fixed and wait for the next crawling.

    If errors come back, try to run

    Broken Link Checker

  3. Nils
    Member
    Posted 1 year ago #

    I have marked them as fixed a few days ago, they all came back and it's getting more and more.

    Broken link checker also didn't find such links.
    But thanks anyway.

    I mean, what kind of link is this? Never seen such "link chains".

  4. bottleneck
    Member
    Posted 1 year ago #

    Run this database query

    select * from wp_posts where post_content LIKE '%7C%7Chttp%';

    See when it started.

  5. Nils
    Member
    Posted 1 year ago #

    I've got a complete database backup here on my hard drive. Opened it with Notepad++ and looked even for "%7C", but the search returned no result. Such links definitely do not exist on my site. That's what's so weird.

    I also thought about my cache plugin (Hyper Cache), but the cached files do not contain any | or %7C either.

    Something else must "create" these link chains.

  6. G Yohannes
    Member
    Posted 1 year ago #

    I am repeatedly getting 'new user registration' messages with email addresses I am not familiar with? Can anyone explain why? Thank you

  7. Nils
    Member
    Posted 1 year ago #

    Maybe you should create your own topic.

  8. bottleneck
    Member
    Posted 1 year ago #

    I can't force you but please run that query to see the difference.

  9. Nils
    Member
    Posted 1 year ago #

    OK, I'll do so right now.

  10. Nils
    Member
    Posted 1 year ago #

    Just ran the command in wp_posts table. No results found.

  11. bottleneck
    Member
    Posted 1 year ago #

    There must be something which triggered Google.

    Got a raw access log? You could sort out 'googlebot' requests or 'HTTP/1.1" 301' in a single file and look through it.

  12. bottleneck
    Member
    Posted 1 year ago #

    My last advice would be to check your posts via HTML tab.

    I personally have seen strings of %20%20%20%20 inserted before http, making the weirdest links.

    I blame one of the plugins, won't tell which one, I can't prove it.

  13. Nils
    Member
    Posted 1 year ago #

    I can only find two logs: error.log and access.log.
    Looking through the access.log I could find an entry of these links way before 21st of March. Seems as if Google never had a problem with these links until now.

    Most probably a plugin, I'll try to find out when it first occured and which plugin was activated at this time.

    Thank you so far for your help. :)

  14. bottleneck
    Member
    Posted 1 year ago #

    Sure. You are welcome. Let us know. :)

  15. Nils
    Member
    Posted 1 year ago #

    I found the first date, the strange links ware created. It was the 8th of March. At this day I added the plugin "WP Keyboard Navigation aka QuicKeys". This plugin enables the navigation through the pages and categories via shortcuts.

    Seems to be the problem. Will deactivate it now and wait for tomorrow for new crawling errors to happen.

    Fingers crossed!

    Edit: Just saw that it was even removed from the repository....

  16. G Yohannes
    Member
    Posted 1 year ago #

    Can anybody tell me why I am repetedly getting the below message? Thank you.

    New user registration on your site KSI AFRICANA:
    Username: alicja-panek
    E-mail: alicja-panek@o2.pl

  17. Nils
    Member
    Posted 1 year ago #

    Why don't you create your own thread? Your issue has nothing to do with this topic.

  18. Nils
    Member
    Posted 1 year ago #

    Just a quick update on this. Since I have removed this disturbing plugin, there are no new entries in the access.log which contain a "%7C%7C". Google Webmaster Tools now shows the results from yesterday, so I have to wait until tomorrow to see today's results. Hopefully the errors are gone.

    I'll report again tomorrow!

  19. bottleneck
    Member
    Posted 1 year ago #

    I already picture your happy smile tomorrow!

    Disclaimer: Not associated with April Fools Day.

    :)

  20. Nils
    Member
    Posted 1 year ago #

    Here's the update, as promised.

    Crawling errors went down from 200+ to 56. Even though I completely removed the plugin and cleared the cache, Google is still trying to crawl such links. Happened just 5 times today, but still it does for whatever reason. I expect the number of errors to go down to less than 10 by tomorrow. At least I hope so.

    Seems to be solved.

    By the way, I was in contact with the plugin author and he already updated the plugin. Such link chains should not happen again. Anyway, I don't want to use it again.

    Thanks again bottleneck, you saved me! :)

  21. bottleneck
    Member
    Posted 1 year ago #

    Hey. I didn't know that I saved you because I don't recall that I took a bullet for you. :) On a serious note, you are very welcome!

  22. Nils
    Member
    Posted 1 year ago #

    Crawling errors going up to 89 again...

    Why is google still trying to crawl these links even if they do not exist anymore? Very weird!

Topic Closed

This topic has been closed to new replies.

About this Topic