This plugin hasn’t been updated in over 2 years. It may no longer be maintained or supported and may have compatibility issues when used with more recent versions of WordPress.

Thoth's Suggested Tags

Description

Thoth is a plugin that creates a widget in your “New Post” authoring page and recommends tags for your post based on the content.

Thoth scans the text for tags and associates them to a “tag strength”, an integer that represents how appropriate the tag is to recommend based on the post content. This value is determined by the word count of the tag, its frequency in the post, and its count in the wordpress database (number of times it has been tagged in other posts).

How It Works

Every time the user saves a draft or updates a post, Thoth

  • Splits the post content into chunks delimited by stop-characters.

  • Uses a simple k-mer counting algorithm to record every possible phrase in the content and its frequency.

  • Does post-processing on the phrase list and displays the final list as a tag cloud. For more information consult the “Features” section.

Features

  • Tag strength ($strength in the code) is an integer representing the likelihood of the tag being appropriate to the post. A tag’s strength is initially determined by its frequency in the post.

  • Stop Words – Filters out phrases beginning or ending with any words in the stop-word list.

  • Tag length – Tag strength is multiplied by the number of words in the tag with a maximum of 3. This means that longer recurring phrases are ranked higher than shorter ones.

  • Pluralization – For every potential single-word tag, Thoth adds a plural suffix ‘s’ and searches for matches in the potential tag list. If a match is found, the tag strength of the singular is transferred to the plural version (e.g. “download” becomes “downloads”). If a match is not found, the singular is used.

  • Capitalization – Capitalization is preserved only for words or phrases that are capitalized in every instance they appear in the post (i.e. they are proper noun/noun phrases). Else, capitalization is removed and the tags strength for capped/non capped are combined.

  • Existing tags – Thoth also retrieves all the tags used in your blog and searches for instances of them in the content of the post. In the case of a match, the tag strength is multiplied by 2 and incremented by the number of times that tag has been used in your blog. This means that if your blog has a unifying theme, certain tags are likely to be reused and will enable Thoth to make better suggestions.

Screenshots

  • screenshot-1.png

Installation

  1. Upload the thoth-tag-suggestions folder to your wp-content/plugins folder.
  2. Go to the “Plugins” administration panel.
  3. Activate Thoth’s Suggested Tags

Thoth is now with you.

Contributors & Developers

“Thoth's Suggested Tags” is open source software. The following people have contributed to this plugin.

Changelog

1.3 (July 26, 2010)

  • Reinforced stop-words list to better filter irrelevant tags

  • Fixed a bug where the database checking for tags was done on an empty database (this only occurred in WordPress MU)

1.2 (July 21, 2010)

  • Implemented capitalization check so that any capitalized word that also appears in the post in lowercase is automatically converted to lowercase (since it cannot be a proper noun).

  • Added an asterisk indicator at the end of tags that were matched to the database.

1.1 (July 13, 2010)

  • Added support for proper nouns and proper phrases using capital letters (e.g. “Grapes of Wrath” is a proper phrase and gets a higher recommendation).

  • Added a “video” and “audio” tag recommendation if any embedded video or audio files are detected in the post.

  • Implemented a check for duplicate tags. Previously, (for example) the tag “Windows Vista” with a strength of 6 would also generate tags “Windows” and “Vista” both with strengths of 6. This redundancy is now removed; tags will only be counted when they are not contained in another already recommended tag.

  • Reworked the entire post-processing code structure for better efficiency and readability.

  • Set default meta-box position to the sidebar

1.0 (July 1, 2010)

  • Implemented tag cloud output from array of recommended strings.

  • Cleaned up code

  • Improved comments

0.8 (June 30, 2010)

  • Suggested tags are now hyperlinks calling an external Javascript function to add the tag to the input and click the “add tag” button.

0.7 (June 28, 2010)

  • Implemented pluralizing, i.e. adding an ‘s’ suffix to each word and checking for matches in existing tag list

0.5 (June 24, 2010)

  • Implemented removal of all phrases beginning with stop words

  • Removal of phrases beginning or ending with stop words

  • Removal of phrases that appear only once.

  • Now splits text on stop-chars instead of stop-words (much better).

0.4 (June 23, 2010)

  • Added list of stop words

  • Splits the text based on stop words

  • Implemented phrase frequency counter based on k-mer counting algorithm

0.3 (June 21, 2010)

  • Implemented word frequency counter for single words

  • Suggest tags if they exist in both the database and the post content

0.2 (June 20, 2010)

  • Implemented WordPress add_meta_box() function to display in the ‘new_post’ page.

0.1 (June 19, 2010)

  • Downloaded WordPress and installed dummy plugin file