• Resolved tgryphon

    (@tgryphon)


    Trying to import a podcast hosted on a PodHawk install, via RSS importer.
    The RSS feed in question contains just over 3000 episodes.
    When attempting to import using the RSS Importer on the live podcast, it stops at around episode 100 chronologically (i.e. it starts from current and works backwards to the 100th oldest episode).
    I attempted to download the feed XML and use that on a local install of WordPress with PowerPress to speed it up. It now stops at around episode 350 chronologically.

    Any suggestions to work around this?

Viewing 6 replies - 1 through 6 (of 6 total)
  • Plugin Support Shawn

    (@shawnogordo)

    Are you sure the source RSS feed you’re importing contains all of the episodes you’re trying to import? Is the feed still online? If so, please post a link to the feed URL so we can take a look.

    Thread Starter tgryphon

    (@tgryphon)

    Feed is still live: feed://djgoofywhitekid.com/podhawk/podcast.php

    The source feed does include all episodes; I had to set PodHawk to display all episodes in the feed.

    I’ve also noticed that PodHawk uses a redirect for the enclosure tags, so I’ll eventually have to figure out a way to change those to the actual episode MP3 files. I’d obviously rather get the metadata imported first before dealing with that.

    Plugin Support Shawn

    (@shawnogordo)

    Thanks for posting the feed URL. I took a quick look and you’re right, all of the episodes are in there.

    Once you have everything imported to PowerPress, you may be able to use the Find & Replace tool to fix the URLs within the enclosure tags.

    I will ask the lead developer of PowerPress to look at this thread, as he may have some insight as to why you’re having difficulty importing all of your episodes.

    Plugin Author Angelo Mandato

    (@amandato)

    Hello @tgryphon,

    The import uses the guid value to determine what has been imported and what has not. It is common for web servers to put a time out for PHP scripts, so if the script times out you have to repeat the import process to continue importing. When you re-import the same feed 2+ times, it will identify the episodes (items) that already imported by the unique guid value, skipping those and proceeding to import the next items in the feed that not yet have been imported. We’ve tested feeds with just over 1,000 items in our sandbox, it usually takes 3-4 tries to get all of the episodes imported.

    If repeating the process does not import any additional episodes, please let us know so we can investigate the issue. The good news is that your feed is formatted nicely, it would be easy to simply slice the items into batches of 300 resulting in 10 xml files to import, but I do not think that is necessary but could be done in a last resort.

    Thread Starter tgryphon

    (@tgryphon)

    The import does identify what had been imported from the live XML, but the import still gets to item 100 or so and (I’m guessing) times out. Moving this to a local host got the number up to about 350 before timeout. The number wasn’t consistently equal, so I’m guessing it was a timeout vs an issue with the import function or the XML itself, though it could be a server memory setting as I did not play with those on either server.

    I was able to get everything imported by essentially importing the local version of the XML, waiting for a timeout, then chopping the XML at the episode last imported and saving as a new file, and restarting the import from the new file. This took seven iterations in my case.

    I’m considering this resolved for my purposes, but I’m glad to keep the live XML in question up, or forward the full local XML, for troubleshooting.

    Plugin Author Angelo Mandato

    (@amandato)

    I am glad you were able to power through the import. 3,000 is an amazing number of episodes. I saved a copy of your feed to test with. Thanks!

Viewing 6 replies - 1 through 6 (of 6 total)
  • The topic ‘Issues importing feed with many episodes’ is closed to new replies.