Layer2 Default Content Classification Engine for SharePoint (V1)
The way that the default
classification engine works is that it creates one joint text body from the
document content (retrieved by IFilters), as well as additional SharePoint
columns, the list or library name, and URL. By default (if no rule is created),
a term will be assigned to a SharePoint item or document if the term label or
synonyms are found in the content.
To increase the precision of the metadata assignment, content classification
rules can be added to a term using the Layer2 Taxonomy Manager. You can define
rules as logical expressions with the following keywords: OR, AND, NOT. In this
way, documents and items can be found that contain some specific required
tokens, but not others. You can also use regular expressions (REGEX) to include
or exclude specific patterns.
Fig.: Adding a content
classification rule to the SharePoint term "jdoe" using the Layer2 Taxonomy
To assign the term “jdoe” to an item you can make use of the
“jdoe” OR “John Doe” OR “J.Doe”
“jdoe” is assigned if the expression returns “True”.