Support/Kitsune/Features/Forum-Data-Clustering

From MozillaWiki
Jump to navigation Jump to search

Title

Support forum data clustering

Problem statement/description

We have a great deal of unstructured data on the forums. This makes identifying areas of focus more difficult. This is not only a metrics issue, but it will make faceted search difficult. If our forum data is not structured (or structured differently than our KB articles), our search facets will not be useful.

Measurable outcome

An easy way to identify top issues trending in the forums. Active monitoring and alerts for breaking issues. Ability to serve accurate results from the forums in our search implementation.

Possible Solutions:

  • Manual tagging of questions based on a predefined set of tags in the AAQ Flow
  • Manual tagging of questions based on a predefined set of tags in the forums by contributors
  • Clustering through Mechanical Turk
  • Automated clustering based on machine learning