Monday, May 11, 2015

Taxonomist Survey

I had created a survey of taxonomists to gather some information for writing my book, The Accidental Taxonomist. It was mainly for Chapter 2: Who Are Taxonomists?  With the word “taxonomist” in the title, I had to write something about taxonomists, and not just about taxonomies, and this was the best way I could get more information than some anecdotes from colleagues.

But that was in late 2008, 6½ years ago. Has there been change in the industry since? In most fields, 6-7 years is not long at all, but in field of taxonomies, there could be changes. First of all, there have been significant changes in the economy over that particular period (recession and partial recovery), and, at least for internal, enterprise taxonomies, the role of the taxonomist could be considered something expendable in tight economic times. (I know, as I was laid off in 2008 and again in 2010.) More significantly, the field of information science is evolving very rapidly. So, I released a new survey this month.

My previous survey had 9 multiple choice questions and one open response. I chose to keep those questions with no changes or only minor wording changes, in order to compare the changes over time. I also decided to add a few more questions. To help me come up with the questions, I asked for input from an audience of presentation I have last month ("Taxonomy Displays: Bridging UX & Taxonomy Design" at the Content Strategy Seattle Meetup. Suggestions from that group included questions on the size of taxonomies, job titles, and taxonomy work pain points. The current survey now has 14 multiple-choice questions, one very short answer (job title), and three open responses, although all questions are optional, and it is permitted to skip questions.

Where to find taxonomists to survey


In 2008, I could think of only one logical channel to find taxonomists, the Yahoo group called Taxonomy Community of Practice. But it is no longer the only group and no longer the most active. The Taxonomy Community of Practice Yahoo group averaged only 5 messages per month in the last 6 months. In contrast, the 6 months around the time of my last survey, this group average 39 message per month. This is most likely because the LinkedIn group of the same name, Taxonomy Community of Practice, which was created in September 2007, has taken over the most of the taxonomy discussions.  Furthermore, there are additional LinkedIn groups, such as “Controlled Vocabularies”  and “Thesaurus Professionals.” The American Society for Indexing started a Taxonomies & Controlled Vocabularies Special Interest Group in late 2007, and SLA (Special Libraries Association) started a TaxonomyDivision in 2009, both of which have member discussion lists.

I have announced the current survey in all of these groups and more. However, I do not expect to reach significantly more taxonomists than before. That’s because, whereas the single Yahoo group back in 2008 tended to be subscribed to by email (individual or digest), the proliferation of groups and lists of similar or overlapping subjects has led to subscribers/members to opt out of direct emails. Additionally, email software, such as Gmail, can filter messages from lists to a category/tab that users may choose to overlook. So, my email announcements of the survey to groups may go unnoticed by many group members. It would be tempting to individually contact everyone I know personally who is involved in taxonomy work, but that could be a personal bias that would skew the pool of respondents.

Taxonomist tendencies


There have already been enough respondents to the current survey, that I can safely say that the largest number do taxonomy work as their primary responsibility, as with the previous survey, and that, like before, the majority are employees, rather than contractors, freelancers, or independent consultants. The most common educational or professional background (although not the majority) is library/information science. What is striking, though, is that despite the fact that 48% of respondents in 2008 had an MLS/MLIS degree (and from the early survey returns, the percentage is even slightly higher), only a small percentage of taxonomists learned taxonomy skills through formal educational institution coursework. Self-taught through reading, on-the-job experience, and on-the-job training, and conference workshops or seminars are each methods of learning taxonomies that are more prevalent than college courses. Additional, more specific comparisons will be the subject of a future blog post.






Saturday, April 25, 2015

Trends in Hierarchical Taxonomy Displays


Taxonomies connect users to content. So, how a taxonomy is displayed to users is very important in its effectiveness. This is a topic about which I gave a conference presentation back in 2011 and will present again next week. As I update my previous presentation, looking at some of the same public websites with taxonomies, I have observed some changes that might be considered as trends.

While faceted taxonomies (used to filter/refine/limit results by certain criteria with choices of taxonomy terms) have become more common on ecommerce or other database websites, they are not suitable in all circumstances, and when a taxonomy has a large number of topical terms, a hierarchical arrangement of those topics might be better.

Displayed full hierarchical taxonomies, however are more difficult to find. They are not as often the default.  Some have disappeared entirely such as the Yahoo directory, which was discontinued in December 2014 after 20 years. (Admittedly, trying to classify as many websites as possible into a hierarchy, as the web keeps growing, is a never ending task.) In other cases, the search box is more prominent on the page, and the link browse categories needs to be hunted for.

In the past, I had observed two main different kinds of hierarchical displays: one-level-per-page and expandable hierarchies with plus signs. The first has evolved, the second is has become rare, and a third method has emerged.

One level of taxonomy hierarchy per page was the design of the former Yahoo directory and had been early on the style followed on other sites. An example that closely follows the Yahoo Directory, is the dmoz/Open Directory Project. A list of category labels or topics at each level takes up the entire screen/page display, without the display of other content. Displaying additional content on every page has become important, so hierarchical taxonomy categories now tend to be confined to more compact lists to free up space on the web page for content. This works for some taxonomies, not all. Meanwhile, a list of terms at the same level that take up the entire page is a style that is rarely followed anymore.

Expandable hierarchy “trees,” typically with plus signs next to topics to expand a topic’s subcategories has become quite rare, at least in public web sites. An example are the USA Today topics. This hierarchical taxonomy design had been developed based on the recognizable desktop file folder structure, such as in Windows. In the meantime, users have become familiar with different representations of topic hierarchies on the web, so mimicking expandable file menus is no longer the only way to engage users. Expandable topic hierarchies are not as easy to update and change on websites and, it can take a long time to load the web page. Expandable hierarchies allow the users to have more than one hierarchical level expanded at once, which facilitates exploring the taxonomy. As much as we taxonomists might enjoy browsing a taxonomy, the goal is to get users to content rather than have them spend time exploring the taxonomy.

A third method of displaying multiple levels of a hierarchical taxonomy is through “fly-out” subcategory lists. Examples include Lynda.com (under "Browse the Library") and Books & Authors. I had not noticed this method before, so it seems to be a new trend. They are similar to submenus in website navigation, but rather than for website navigation, the topics are linked to indexed content items, which are listed in a result set for each subtopic. Fly-out subcategories allow the users to still see the parent category list, if the user wanted to back out to it, like in an expandable tree hierarchy. But unlike an expandable tree hierarchy, you cannot have multiple parent categories expanded at the same time, which is not that important anyway. The fly-out subcategory style is thus a positive trend in hierarchical taxonomy displays.

Tuesday, March 31, 2015

Varied Taxonomy Uses and Taxonomist Functions

Someone asked me recently if taxonomies were applicable to some marketing analytics he was pondering. I was not sure without further discussion. The interesting thing about taxonomies is that they have such varied uses.  Perhaps because there is no single dominant use of taxonomies, taxonomists have to go into long explanations of how taxonomies are beneficial. There is no neat list of taxonomy uses. Following are some broad categories of taxonomy usage, all but the last of which, I have worked on.
  • A key component of a product of published information for retrieval (such as in a news, periodical article, or reference database)
  • A (partial) solution to an information management problem of an organization
  • A method to connect customers to products or services, typically on a website
  • A method to connect users to information on a public information-sharing or networking website (monetized by advertising or other means)
  • As descriptive metadata in a document management, content management, records management, or digital asset management system, to support tagging and subsequently support retrieval of internal content.
  • A method to model data, information, or knowledge to serve an organization’s knowledge management strategy

Sometimes more than one of the goals may be pursued simultaneously by the same owner of the taxonomy. This is when it gets complicated, and it needs to be carefully considered whether a single taxonomy or separate taxonomies would be best.

Building up a clear list of the applications of taxonomies, not something in marketing-speak, and more specific than the areas listed above, would be a worthwhile service of the websites of taxonomy consultants and taxonomist-related professional organizations.

Taxonomy consultants need to ask from the start whether the taxonomy project they are hired to work on will be primarily for internal or external access, and not make assumptions.  It could be for both, but usually one purpose is seen as primary. Once, in my earlier days of consulting I made an assumption, and my proposal for an “enterprise taxonomy” was even accepted by the client, before I realized that their taxonomy would be primarily for public web content.

Varied taxonomist job functional areas


Just as taxonomies may have varied uses, so the functions of a taxonomist are varied. One interesting aspect about the taxonomy field, and taxonomy consulting in particular, is that transcends both internal (employee facing) and external (customer or public facing) functions of an organization. I have personally found this a very interesting aspect of the profession.

Taxonomists who are employed may work in various different departments of an organization. As such, taxonomists could find themselves either part of internal functioning groups (knowledge management, content management, information technology) or external-oriented groups (marketing and related web services).  I have worked in the organizational departments of editorial, software product development, information technology (as it was overseeing the SharePoint implementation), and consulting services, all of which while in the role of a taxonomist.  Additionally, I have seen taxonomist job postings in departments of marketing, ecommerce, communications, libraries, data governance, financial service operations, information management and technology, and the Information Management and Tech Writing department.

In any organization where one or more taxonomists are employed within a specific department, there are likely taxonomy-related needs in other departments. It would be beneficial to the organization if the taxonomists’ skills could be applied to special taxonomy-related projects outside their home department, such as across both marketing and information management.

Thursday, February 26, 2015

How to Find a Taxonomist

The need for well designed taxonomies is growing, and the number of people becoming skilled in designing and creating taxonomies is growing. But finding the right taxonomist at the right time and place is still difficult, whether contract/temporary or permanent. If you are looking to hire a taxonomist, these are some places where you might look.

Recruiting/staffing firms


The taxonomy profession is not well defined, as it is for engineers, accountants, librarians, etc. It’s still too small and narrower to support recruiting/staffing firms that are specialized in the taxonomy field, in contrast to other professions. Yet it is also an interdisciplinary, cross-functional field, which does not fit nicely within another broader professional. Sometimes taxonomists are considered a type of librarian/information scientist and their recruitment falls to staffing firms in that field. Yet other times taxonomists are considered adjuncts to the profession of content management and their recruitment falls to information technology staffing firms. They may also be considered a part of knowledge management, so they could come under the interest of more general management recruiters. These splits makes it difficult for the hiring firm to select a recruiting partner. If possible, an employer should contract with multiple recruitment firms in different areas of specialty, such as library science, information technology, and knowledge/business management. Be aware that some qualified taxonomists are looking for only temporary contract work, and some are looking for primarily full-time permanent work.

Taxonomy is so specialized that even recruiters who deal with the field might do so rarely, and thus they don’t really know what taxonomists do. As a hiring manager, you will need to educate the recruiter about the details of the job responsibilities and qualifications.

Social media


There is no doubt that LinkedIn is an excellent way to search for professionals, but its effectiveness depends on the size of your own network. Join the Taxonomy Community of Practice LinkedIn group  and other similar groups to expend your network to include taxonomists. It’s very easy to search on “taxonomist” or “taxonomy” in people’s titles. Keep in mind that only a fraction of the taxonomists out there are available for new work.

You can also find professional taxonomists who have given taxonomy-related presentations and posted them on SlideShare. SlideShare covers all topics, so when searching on “taxonomies” make sure to skip over Bloom’s taxonomy (educational outcomes) and anything biological. Of taxonomists who post slides, a high percentage of them tend to be consultants available for projects, as they know to use SlideShare to promote themselves.

Finally, there is a Yahoo group, Taxonomy jobs, which is a simple distribution list. If you have a taxonomy job, permanent or contract, that you are trying to fill, send the job description to the listowner: taxonomy-jobs-owner@yahoogroups.com

Professional associations


There is no professional association dedicated to taxonomists, but there are two associations with relevant sections or divisions. SLA (Special Libraries Association), an association of corporate librarians, subject-specialized librarians, and information professionals, has a Taxonomy Division for networking and professional development of its taxonomist-members. Its website does not list taxonomists available for contracting, but the main SLA website does. At http://hq.sla.org/consultonline  you can search for consultants with the area of expertise in “Thesaurus Development.” (Hopefully this will be supplemented or replaced with “Taxonomy Development” in the future.) This consultant database is not well known, even among members, so there are many taxonomist-consultants who are not listed. For a fee, you can also post a job on the SLA website.

The American Society for Indexing (ASI) has a Taxonomies and Controlled Vocabularies Special Interest Group. ASI and this special interest group are much smaller than SLA and its Taxonomy Division, but the overwhelming majority of ASI members are independent contractors. The Taxonomies and Controlled Vocabularies SIG website lists its members who are available for contract taxonomy projects.

If you know someone personally who is a member of either of these organizations but is not available or precisely qualified, you can always ask that person for professional referrals.

Other tips


Because the right taxonomist can be difficult to find, be prepared to hire someone who works remotely with only occasional visits to your office. Similarly with a consultant or consultancy, you should not hesitate to contact a consultant in another part of the country. They are accustom to traveling, they may utilize subcontractors in your area, or they may refer the project to another consultant in your area.

As for myself, I have found taxonomy work through all of these means: recruiters, staffing firms, LinkedIn, SlideShare, and professional association member referrals. I am not looking for taxonomy work now, since I am happily employed (and hence dispensing this advice), but I can also provide referrals to other taxonomists.






Saturday, January 31, 2015

Taxonomy Software Trends



I reviewed various taxonomy/thesaurus management software offerings recently, in preparation for the last of my 3-part webinar series, Practical Taxonomy Creation, and I noticed some trends since I last looked into software in such detail for my book over 5 years ago: more cloud/web-based software, more SKOS/RDF/Semantic web framework software, and more plugins to SharePoint, content management systems, and search engines.

The number of commercial vendors selling taxonomy/thesaurus management software is not significantly different, as some have left the market, and others have entered, and the rest have continued with updates and improvements. There are fewer commercial low-end, inexpensive, single-user desktop offerings, however. Products I have reviewed in the past and that have gone away include Webchoir TCS-10 Personal and Term Tree 2000. The Mac OS program Cognatrix has been unavailable for the past year, although the vendor intends to release it again as an Apple App Store program following the release of the next major version of Mac OS.

Subscription, web-based software


Synaptica pioneered web-based thesaurus management software when it introduced its product in 1995, when the Web was still young, but now other vendors also offer web-based subscription software. Data Harmony Thesaurus Master from Access Innovations was originally only available in a java-based multi-platform client-server installation. For the past several years a web version has also been available, and Access Innovations president Marjorie Hlava said in an email: “Increasingly our customers use the cloud version of the software.” Newer thesaurus management software products to the market have also been solely cloud-based. These include PoolParty, introduced by the Semantic Web Company in 2009, and TopBraid Enterprise Vocabulary Net (EVN), released by TopQuadrant in 2010. Meanwhile Synaptica began offering Synpatica Express, a cloud-computing solution for individuals and smaller businesses. Finally, the long affordable mainstay MultiTes Pro, a Windows-based desktop program that that has been available since 1983 in a single user version and then also for multiple users, introduced a multi-user cloud version about six years ago, which in 2013 was updated and renamed as MultiTes Online.

The cloud-based software offerings are, of course, priced on annual (and in one case, monthly) subscription fees, instead one-time license costs with lesser priced updates. Hopefully this means that more organizations will try out developing a taxonomy in the appropriate tool with the reduced commitment of cost for a shorter time.

SKOS/RDF/OWL Semantic web framework software


Supporting linked data and interoperability with Semantic Web content has become more important. Therefore, World Wide Web consortium (W3C) recommendations, such as the SKOS (Simple Knowledge Organization System) framework, RDF (Resource Description Framework) specifications, and OWL (Web Ontology Language) are being adopted by newer thesaurus/taxonomy software. The newer products, PoolParty and TopBraid EVN are both built around SKOS models. Synaptica and Data Harmony Thesaurus Master have been able to export to a SKOS and OWL schema for a long time, but it was only in 2013 that Data Harmony added user-defined fields to the SKOS export to include all fields in a term record. Additionally, in 2011 Synaptica introduced an Ontology Publishing Suite to publish an ontology or part of an ontology to the Web.

My first criterion for thesaurus management software is that is that in enforces relationship rules in accordance with the ANSI/NISO Z39.19-2005 standards. SKOS is not an alternative standard, but rather a framework that can be followed in addition to ANSI/NISO Z39.19-2005. Ideally a software product complies with both, and some now do.

Plug-ins and connectors for search and content management


The most common software for internal content management (even though it is not really a content management system) is SharePoint. Prior to 2010, SharePoint handled controlled vocabulary metadata in such a simple way (not even in hierarchies) that there was no point in trying to use taxonomies.  Starting with SharePoint 2010, with its Managed Metadata Services, taxonomies can now be utilized in its Term Store. However, despite Term Store improvements from SharePoint 2010 to 2013, it is still far from having the features and capabilities of a dedicated thesaurus management software product. Thus, ideally you create the taxonomy in the dedicated tool and port it over to SharePoint, and now almost all enterprise-level thesaurus management software products have methods to connect to SharePoint, whether through APIs, plug-ins, or dedicate “connector” modules.

There are also increasing numbers of content management systems and search software products being supported by thesaurus management connections. For example, SmartLogic Semaphore Ontology Manager has integrations with a greater number of applications than in the past, including SharePoint, Google Search Appliance, Apache Solr, OpenText, MarkLogic, and IBM Watson. PoolParty has a WordPress plugin, in addition to integrations with SharePoint and Drupal. Surely more such connections will be added, as I have recently heard of requests for taxonomy imports into Drupal.