Language Technology Linked Open Data at Språkbanken - LTLOD@SB

The nonprofit organization ConceptCoding.Org has been appointed by DART to extend the multilingual and multi-modal lexical ontology resources of the CCF to be made available in a Linked Open Data (LOD) format as part of the LTLOD@SB project - http://spraakbanken.gu.se/swe/ltlodsb.

The goal of the project, coordinated by Språkbanken (the Swedish “Language Bank”) and run in cooperation with DART, is to make the lexical resources of Språkbanken and the CCF available also as LOD (Linked Open Data). Thereby they will be more accessible and useful for more parties, thus making the substantial invested work more applicable and exploitable. Formats are chosen to ensure widest possible interoperability with the growing range of corresponding European and other international lexical resources; RDF (Resource Description Framework) och LEMON (LExicon Model for ONtologies).

From the perspective of ConceptCoding.Org this is a very welcome and appreciated initiative. It helps us to continue the development of the quality and usefulness of the free CCF resources. In connection to this work we will achieve a strengthened linkage between the CCF multilingual and multi-modal resources, and the mainstream lexical resources of Språkbanken (including SALDO, LEXIN and others), as well as to other international resources like Princeton WordNet, FrameNet etc.

Data content of the the CCF in RDF format is available here

Sparql endpoint will soon be available also.

A sparql query that will return different entities that has to do with coffee. The encoding of certain characters is neccessary.

curl --digest --verbose --url http://lod.conceptcoding.org:8001/sparql/?query="PREFIX ccf%3A %3Chttp%3A//www.conceptcoding.org/Ontologies/2014/07/ccf-owl-ns%23%3E PREFIX ccfen: %3Chttp%3A//www.conceptcoding.org/ontologies/2011/09/01/BaseReferenceOntology%23%3ESELECT * WHERE %7B ?s ccf%3ArepresentationData ?o . ?s ccf%3ArepresentationType ccfen%3Aen . FILTER regex(?o, %22coffee%22, %22i%22) %7D LIMIT 10"

Ongoing versioning of CCF datasets

The versioning strategy has changed from releasing everything anew every other year into a more incremental update process. This has already been investigated earlier and changes were made to follow the OWL format for importing Ontologies that are dependent. This change will take place with the next release of the online resource library, and is currently beeing regression tested. The Ontology statement contains a reference to an imported Ontology so that searches can traverse back over older datasets. The searches will not be affected from a client perspective. The only noticeable change is that it may now refer to more Ontoglogies than previously. The naming strategy implies that a new Ontology can be released when there is enough new symbols and codes available for the community. The major impact of this change is that adding, deleting and changing is handled more locally in a particular Ontology version rather than on the whole dataset.