Saturday, September 26, 2020

Adjectives as Terms in Taxonomies

Taxonomies need not follow strict standards, but rather best practices. There are standards for thesauri (ANSI/NISO Z39.19 and ISO 25964), and as taxonomies are similar to thesauri, it’s a good idea to follow thesaurus standards for taxonomy design to the extent applicable. According to thesaurus standards, terms should be nouns or noun phrases, not verbs or adjectives. Similarly, taxonomies usually comprise terms of only nouns or noun phrases. An exception would be in a faceted taxonomy, where there is a facet for a kind of attribute or characteristic, such as color, and there the terms could be adjectives. In this sense, taxonomies are more flexible and have more applications than thesauri do.

Product taxonomies tend to have adjective terms for some of their facets/attributes, including color, size, style, type, status, etc. These kinds of adjectives are reasonably straight-forward, although there may be nuances among colors and styles that are not generally known among the users of the taxonomy. It is rather other, descriptive adjectives that can be more challenging to include in a taxonomy because their meaning tends to be much more subjective than noun-based terms, and thus it’s difficult to tag/index consistently with them. 


I recently did some work on a taxonomy where descriptive adjectives were included in an “attribute descriptor” term set or facet. This was a taxonomy for images, including photographs, illustrations and graphical design components. Adjective terms included Elegant, Formal, Funny, Ornate, Simple, Modern, Vintage, among others. I also filled in the role of tagging for a short period of time and found how subjective it was to tag with such adjective terms. I was not confident that I was tagging with such adjectives in a consistent manner.  While the adjectives might have seemed like a good idea originally, they were not that practical compared to other components of the taxonomy. Fortunately, the attribute descriptors were not displayed to the user as a dynamic facet but rather supported search, so insufficiencies in adjective tagging were not so obvious.

A recent article in Vogue Business described how adjectives in fashion product ecommerce taxonomies are used, such as by Nordstrom, Rebag, and The Yes. These include terms such as Bright, Chic, Whimsical, Flowy, Billowy, Comfortable, etc. I wouldn’t want to try to tag with those. However, in these cases, the tagging was not manual but automated, using algorithms, hundreds of examples, and machine learning. While auto-categorization is not necessarily more correct than manual tagging, it is more consistent, and when it comes to the subjectivity of adjectives, the challenges are more around consistency than correctness. So, I can see that auto-categorization can be a solution to dealing with the challenges of adjective terms.

Now that it’s established that taxonomies can, in certain circumstances, contain adjective terms, the inclusion of adjectives in a taxonomy should be done with care. If you will have adjectives as terms, my recommendations are:

  • Keep them separate from other taxonomy terms, by having them in their own term list, vocabulary, or facet.
  •  Ideally, keep the number of adjective terms limited to a few clearly distinguishable terms.
  • Expect to spend more time and possibly expertise in developing, editing, and maintaining adjective terms than noun-based terms.
  • Consider implementing auto-categorization (auto-tagging), if resources permit it.
  • Whether tagging is manual or automated, prepare multiple examples of assets/content items for each adjective term to demonstrate what is the appropriate content for tagging with each adjective.

A thesaurus is more specific than a taxonomy, as a thesaurus has terms for what content is about. A taxonomy has terms for what content is about but other aspects and attributes of content as well. Thus, a taxonomy may include adjectives, whereas a thesaurus does not. Adjective terms, however, should be created with care and special attention to how they will be used in tagging.

2 comments:

  1. I think that it is possible to provide for concepts such as colour and style while still conforming to thesaurus standards. Colour names can be considered as nouns, as in the Art and Architecture Thesaurus "physical attributes" facet. If necessary styles can be expressed as nouns by using terms such as "whimsical style", "chic style", "rustic style" or "traditional style". Examples of these are included in the AAT "styles and periods" facet (albeit with parentheses which in my opinion are unnecessary).

    I remain unconvinced that it is necessary to invent something called a "taxonomy" for "a thesaurus that doesn't follow the rules"! It is rather restrictive to say:

    "A thesaurus is more specific than a taxonomy, as a thesaurus has terms for what content is about. A taxonomy has terms for what content is about but other aspects and attributes of content as well."

    A thesaurus can certainly be used to describe what something is as well as what it is about, for example in the case of thesauri used to name museum objects and their attributes.

    ReplyDelete
    Replies
    1. Thanks, Leonard, for your comments. It's good to hear from you. Yes, you bring up a good point that attributes can be described in terms that are noun phrases (such as by adding the word "style") rather than adjectives. The Getty AAT Thesaurus is a good example.
      As for my statement comparing a taxonomy and thesaurus, that was just an idea that occurred to me at the time of writing this post. Perhaps I should think about it some more.

      Delete