• Resolved yjajkiew

    (@yjajkiew)


    Hello,

    I have an indexation problem with WPSolr plugin : “docx” word format document are not indexed.

    For instance I have a media “test-1.docx” with the title “Test 1” and I can’t find it nor in the search page nor in the solr index !

    In the administration page, tab 2.2 Data, I have checked all the application/* format and resynced but it didn’t change anything.

    Thanks

    https://wordpress.org/plugins/wpsolr-search-engine/

Viewing 3 replies - 1 through 3 (of 3 total)
  • Working fine.

    Here is what I’ve done:
    – Upload my .docx in media library
    – Read the media type of the document: application/vnd.openxmlformats-officedocument.wordprocessingml.document
    – Check the media type in tab “2.2 Data”
    – One more document appeared in the remaining docs to be indexed in tab “4. Send your data”
    – Index
    – Search in the .docx content, and found it in results.

    Thread Starter yjajkiew

    (@yjajkiew)

    Thanks for looking into it.

    The problem occurs when the document was uploaded before checking the “application/vnd.openxmlformats-officedocument.wordprocessingml.document” checkbox in tab 2.2 Data, then I don’t see a document remaining to be indexed.

    I suppose I do have to re-index everything but it’s not an optimal solution.

    Exact.

    When you update 2.2 settings, there is a probability that you need to reindex everything. Because you then potentially changed the data structure in Apache Solr.

Viewing 3 replies - 1 through 3 (of 3 total)
  • The topic ‘Word "docx" document not indexed’ is closed to new replies.