You are here

structured vocabularies

Facet analysis as the theoretical basis of vocabulary tool construction

Facet analysis remains virtually alone as a rational intellectual method for the construction of subject terminologies. Developed for the linear arrangement of items in libraries, the rigour of its analytical approach, and the clear delineation of relationships between concepts in a subject domain make it an excellent contender for the management of subject content in a digital environment. The robustness of the methodology is revealed in the way in which a compatible classification and thesaurus can be derived from the same source data where the structure of the terminology is marked up for machine manipulation. Experience with the Bliss Bibliographic Classification 2nd edition (BC2) shows how the underlying structure of these different formats is semantically similar, and that all the structural components of the terminology can be expressed in a machine readable manner. The systematic conceptual structure suitable for document categorization can be translated relatively easily to a language based metadata tool more appropriate for document description and tagging. To a large extent software can also affect the (human) readable output of the different types of tool, and the conversion between them. Vocabulary control in the narrower sense does, however, present some problems for a process developed for the conceptual representation of knowledge, and needs addressing. Insofar as the faceted terminology is a surrogate for the subject domain itself, a faceted system supports a very wide range of inter-concept relationships and provides an effective tool for browsing and navigation as well as query formulation and modification. However, not all of these potential relationships are currently expressible in the standards for subject representation, either those for bibliographic use, or for web representation. Current work on BC2 is examining the way in which mark-up languages can be used not only to create a version of the classification for dissemination on the web, but also to represent the potential complexity of its semantic structure. Candidate systems such as SKOS (Simple Knowledge Organization System) are not presently able to handle more than the simplest structures in a faceted terminology, and ways in which the range of relationships can be extended offers a substantial challenge. Co-operative work with scholars in the area of humanities computing suggests that existing techniques for the mark-up of texts, to support internal analysis and content representation, have much in common with facet analysis as an approach to the comparable structuring of metadata. In combination these methods may offer a solution to improving the usability of metadata tools and providing more subtle and sophisticated means of subject representation.

Presentation Type: 
Talk
Language: 
English
Presenter: 
Challenges and opportunities for KOS standards

BS 8723: Structured vocabularies for information retrieval – A guide to this new standard was offered and key questions were asked about it concerning challenges and opportunities. „Should we think of BS 8723-2 as needing rigorous implementation, or is it more of a springboard from which to launch new ideas? Facet analysis practice as developed by the CRG – will it come of age, or has the genie already left the bottle? „Is it useful/practical to apply a tight definition to “taxonomy”? Which offers greater benefit: one exchange format for all applications, or a choice of formats, supported by transforms between them? „ Format extension: should it reach towards ontologies, or towards classification schemes, or what? „And what about folksonomies – is there any benefit from considering them at all in BS 8723 or ISO 25964?

Presentation Type: 
Talk
Language: 
English
Presentation Audio: 
Presentation Visual: 
Subscribe to structured vocabularies