I'm developing a plugin and I'm having some problems with character sets.
When the user (who writes in Thai) publishes a post on his blog it looks fine. But if I take the raw post from the database (using $post->post_content) then it comes out as a mix of accented European characters and symbols (see http://img255.imageshack.us/img255/84/newscreenm.jpg ). Oddly enough the title (which displays fine) is obtained using get_the_title($postid) which implies that its something specific in the way the content of the post is stored.
So all I can assume is that WordPress is putting the raw post content through something which is doing character conversion but for the life of me I can't find it.