Last year, I wrote about Linked Data projects in the libraries area and summarized their main points. That list included:
- The Library of Congress Subject Headings (LCSH)
- The Swedish Union Catalogue (LIBRIS)
- RAMEAU Subject Headings
- Dewey Decimal Classification (DDC) Summaries
- Virtual International Authority File (VIAF)
Meanwhile further projects have started and since I will give another talk on Linked Data in the libraries area at ODOK2010, it is time for an update. I used the recent listing of Datasets in the next LOD Cloud as entry point of my search for relevant projects.
The Library of Congress Authorities and Vocabularies
It seems that the Subject Headings were only the beginning of the LOC Linked Data activities. Meanwhile the LoC has extended its authority and vocabulary service by the Thesaurus of Graphic Materials, the MARC Code List for Relators, and some other smaller vocabularies. The LoC Authority and Vocabularies service uses SKOS to express concepts and follows the Linked Data principles in order to “enable both humans and machines to programmatically access authority data at the Library of Congress via URIs“.
The About page gives some more details about the Linked Data aspect of this service. Interesting points are:
- the LoC makes available the vocabulary terminology system as as public domain data set
- the LoC also plans to extend it service with further vocabularies (e.g., MARC Geographic Area Codes, MARC Language Codes, etc)
- the motivation for LoC to implement the Linked Data approach: provenance for data values (through the use of loc.gov URIs); simpler download mechanism; the opportunity to convey best practices and lessons learned in the area of Linked Data and libraries.
Gemeinsame Normdatei (GND)
The Gemeinsame Normdatei (GND) unifies the following German authority files, which are currently used in descriptions of cultural and scientific resources for authority control and subject indexing in German, Austrian, and Swiss libraries and library networks:
- Personennormdatei (Personal Name Authority File) (PND)
- Gemeinsame Körperschaftsdatei (Corporate Body Authority File) (GKD)
- Subject Headings Authority File (SWD)
- Uniform Title File for Musical Works (EST)
The GND is hosted by the German National Library (DNB) and documentation about its Linked Data services is available at: https://wiki.d-nb.de/display/LDS.
Here is an example for the entry “Bertolt Brecht” in the GKD: http://d-nb.info/gnd/118514768. Adding an “about” suffix to the URI (e.g., http://d-nb.info/gnd/118514768/about) returns the corresponding RDF/XML representation.
The DNB Linked Data service also exposes approx. 50.000 classes and 110 groups of the German DDC.
The Open Library
The Open Library project has the ambitious goal to create one Web page for every book. At the moment they have gathered around 20 million records and provide a RESTful API for accessing and modifying library objects.
Because of the RESTful system architecture each book (e.g., http://openlibrary.org/works/OL13621971W) and each author (http://openlibrary.org/authors/OL6479608A) is identified by a unique URI. Dereferencing these URIs with the suffix .rdf (http://openlibrary.org/works/OL13621971W.rdf) returns the corresponding RDF representation.
The Open Library uses concepts from the DDC and terms from the Library of Congress for book indexing and classification. These terms are the moment plain literals and not dereferencable URIs. The Open Library also maintains its own subject heading system, which includes Subjects, Places, People, and Times. Each entry already has its own URI (e.g., http://openlibrary.org/subjects/gold_discoveries) but is, at the moment, not available in RDF or linked to other subject heading systems.
Hungarian National Library (NSZL) catalog
The OPAC catalogue and authority data of the Hungarian National Library is already exposed as Linked Data on the Web. It provides access to bibliographic data (e.g., http://nektar.oszk.hu/en/manifestation/2645471), name authority entries (e.g., http://nektar.oszk.hu/auth/33589), as well as subject terms (http://nektar.oszk.hu/auth/magyar_irodalom) and geographical names.
It seems that content negotiation is not supported at the moment. The RDF representations can be accessed by appending “data” to the URI paths, e.g., http://nektar.oszk.hu/data/manifestation/2645471. A SPARQL endpoint is available at: http://setaria.oszk.hu/joseki/index.html
Standard-Thesaurus Wirtschaft
The ZBW maintains the STW Thesaurus for Economics and exposes an RDF(a) representation of the 6,000 standardized subject headings and about 18,000 entry terms on the Web. The concepts are expressed in SKOS and have persistent URI identifiers for bookmarking and linking, e.g., http://zbw.eu/stw/descriptor/10031-4.
Short side-note: this is a really great service because it nicely combines the human- and machine-readable aspects of Linked Data. I would say it is a good reference for similar services in other institutions.
MARC Codes List
This source exposes the MARC codes for Geographic Areas, Languages, Countries, and Musical Genres as Linked Data on the Web.
Other Linked Data Services
- The Mannheim University Library has also started a first beta Linked Data Service.
- TheSoz Thesaurus for the Social Sciences exposes about 11,600 concepts of the social science discipline as Linked Data.