I have thousands of e-mail announcement posts in mbox format (actually they are in listserv archive format and need to be converted first with an existing script). I'd like it import them into WordPress as cleaned up posts. My mbox files are simply posts from one author with no comments.
A key feature would involve designing a slightly intelligent script that would allow you to define multiple text elements that are removed (footers in particular), text elements that are replaced (like my e-mail address in the body of messages with a web form link instead), and something that would remove most hard returns but not on certain lines with defined characteristics (like those starting with a dash or fewer than X characters long.
While I could pay someone to simply do this behind the scenes, I thought I'd first put it to the community and see if anyone wants to build and release a plug-in that would do this for others.
Send me a bid on what you'd charge to write a working plug-in: clift @ publicus .net
If we do this, we well test it with my "e-democracy" posts going back to 1998 before releasing. See bottom: http://www.dowire.org/wiki/Newswire
My WP blog: http://dowire.org/notes