I have an interesting problem, and wondered if you had any insight on how to attack it.
I have a bunch of tagged images, and a whole bunch of posts coming in. As each post comes in, I would like to pair the post with a related image based on the content of the post and the tags on the images.
I started using and loved YARPP many years ago, and immediately thought of it when this problem came up. I was wondering if you had any insight on how to make sure I get a good match.
My thinking so far is to toss out low value words (the, a, an, and, of, etc.) then go through the rest of the words and find which ones match tags I have used, but that does not seem like it would give the best match. I would need some way of measuring relevance.
I also have the possibility of the image being tagged "trees" and the post mentioning forest, but I can just tag better, unless there is a dictionary I can work against to make these associations.
I will be grateful for any related code to learn from, other suggested reading, or general thoughts you can provide.