I came across your WPSearch, which is using Lucene, and this post:
http://www.kapustabrothers.com/2008/01/20/indexing-pdf-documents-with-zend_search_lucene/
So I thought: is this an option which can be build in your indexing engine?
I sure hope so, that would make a killer combo I think.