WordPress.org

Ready to get started?Download WordPress

Forums

[resolved] Yo mercime! XML Export Q! (4 posts)

  1. WebDev WaxLotus LLC
    Member
    Posted 5 years ago #

    Mercime (or anyone else, nosy!)

    I saw that you had something confident to say about XML exports...

    I am trying to help someone on this forum move from wp.com to their own blog. Her blog is mature (+40M export file). The new host as agreed to suspend the server limits to allow her to move in.
    The problem is that she can't export an intact XML file from wp.com. Yes, a support ticket has filed with wp.com...

    Stray tags and categories and spam -all deleted.

    It doesn't look like wp.com is going to respond. I looked into RSS scrapers and things like FeedWP/wp-o-matic which were promising, especially for the image caching/auto category generation etc but ultimately, it didn't work out.

    Do you have any suggestions?

    The ideal would be for wp.com (and org, haha) to allow export parameters. A specific category, author, tag, date range etc instead of glob. But that's dreaming. What works?

    BTW: I am aware of the option of splitting the large file into smaller ones, but first, I need (?) an uncorrupted file. What can I do to help manifest the uncorrupted file?

    Thanks.

  2. WebDev WaxLotus LLC
    Member
    Posted 5 years ago #

    Ok, so her blog is +32M per month! Hello! So luckily enough, wp.com's export coughs up the last 32M of post content into its file...

    Begin the laborious process of splitting each 32M file into smaller files, importing them all (resolving character whammies along the way), returning to the source wp.com blog to delete only the posts successfully transferred including related media from the media manager and pages published in the same range (took me a while to figure that one out!).

    Thankfully, the attachments (post thumbnails etc) are pulled as part of the import process too! Rerunning an import does not corrupt things. Published posts are skipped as are attachments. Unpublished stuff gets duplicated/created every time you import...

    Things are progressing.

  3. WebDev WaxLotus LLC
    Member
    Posted 5 years ago #

    Completed.

    A few reg-ex op's to perform, replace a few plugins, regenerate the sitemap, flip the DNS sqitch and we're done.

    What a PITA.

  4. @mercime
    Volunteer Moderator
    Posted 5 years ago #

    yo musnake, just saw your post. glad you solved your challenge. 3 cheers :-)

Topic Closed

This topic has been closed to new replies.

About this Topic