Support » Accessibility » PDF to HTML plugin? (accessibility and responsive need)

  • Is there a plugin to convert PDF to HTML on the fly, to display the content in HTML inside a WordPress website page?

    I work on a website that has several hundred PDF documents. I am trying to get these to be created accessible, so that exporting to HTML will be easier to handle. However, rendering the HTML one document at the time and uploading each as a page would be a big task, and could present problems when a PDF document is updated. It would be better to render on the fly (and cache, I suppose).

    I would like to do this to both improve accessibility and also, so make sure the content is responsive (which PDFs are not, really).

    I see many plugins that go from HTML to PDF but none that go the other way around. Seems like an opportunity for a developer šŸ™‚

    The page I need help with: [log in to see the link]

Viewing 2 replies - 1 through 2 (of 2 total)
  • Moderator bcworkz

    (@bcworkz)

    IMHO, dynamically converting a PDF document on every request would be very inefficient. AFAIK the process is computationally very “expensive”. Of course caching would be a huge benefit. Even better would be to batch process the files one time and save the result in a non-volatile manner. There are a number of command line utilities that will do such a conversion. These typically work on one file at a time, but a batch script could be created to process all files in a given folder.

    The resulting HTML document is likely saved as a static .html file. It’s possible to use such files in a WP site. When such a file is requested, WP is not even involved, behavior would be like any old skool web 1.0 site. This means any WP themeing, header, footer, etc. are not part of the content. You could embed a static .html file within a themed WP page by using <iframe>. It would be possible to create a custom template that dynamically embeds an .html file in a WP page.

    There may even be a plugin that will import static .html files into individual WP pages. If you know PHP coding, it wouldn’t be too difficult to create such a tool. It’d basically read file content and save it with wp_insert_post().

    Thread Starter mediaboxca

    (@mediaboxca)

    Thank you bcworkz!

    All good points. I’d rather keep the look and feel of the site on all pages so automating maybe not the best solution. If I can get the design team to create well tagged PDFs, then at least I will have won the first battle. I might be able to get away with doing a copy-and-paste from exported HTML to the “code” section in the body of the page and have the results I wanted in the first place. A little more tedious, but probably worth it.

Viewing 2 replies - 1 through 2 (of 2 total)
  • You must be logged in to reply to this topic.