Latin WordNet 2.0 API

Overview

The core of the Latin WordNet consists of basically four data types: lemmas, synsets, relations, and semfields. From one point of view, lemmas 'possess' synsets, in that words have different referential senses corresponding to discrete concepts. From another, synsets 'include' lemmas, in that a concept can be referred to by different words. This is similar to the onomasiological and semasiological distinction in structural linguistics. Relations are of two basic kinds -- semantic and lexical -- and represent linkages of various sorts (see below) between synsets or lemmas. The kinds of relations that can exist between two items depends on the part of speech of the 'source' item. Semfields gather together many different semantically related synsets under general conceptual domains, independent of their parts of speech. The WordNet API permits programmatic access to all four data types.

The API is accessed through URLs appended to the WordNet's host address, http://latinwordnet.exeter.ac.uk. Typically, the API will return a list of results, which consist of nested dictionary-like mapping objects.

Usage

Index

A complete list of all lemmas presently included in the WordNet, ordered alphabetically and by part of speech, is available through the index. /index returns a (long) list of items with morphological information and a unique resource identification number (URI) keyed to the Linking Latin for disambiguation. It can be filtered by part of speech ('n', 'v', 'a', 'r') or by morphological class (e,g,, 'v1spia--1-' for only first conjugation active verbs). index/*/, without any morphological specification, is equivalent to /index.

            http://latinwordnet.exeter.ac.uk/index/ # complete index
http://latinwordnet.exeter.ac.uk/v/ # only verbs
http://latinwordnet.exeter.ac.uk/*/n-s---mn2-/ # only masculine nouns of the second declension
http://latinwordnet.exeter.ac.uk/n/n-p---nn2-/ # neuter _pluralia tantum_ of the second declension
            
        

Lemmas

Detailed information about individual lemmas is available by appending /lemmas to the base API address and then providing filtering arguments that specify the relevant headword, and, optionally, part of speech, and morphological tag. If for some reason a morphological tag is provided without specifying the part of speech * must be indicated.

                http://latinwordnet.exeter.ac.uk/virtus/n/ # /virtus would also be acceptable
http://latinwordnet.exeter.ac.uk/dico/v/ # returns two items
http://latinwordnet.exeter.ac.uk/dico/v/v1spia--3-/ # disambiguates from the first conjugation verb
http://latinwordnet.exeter.ac.uk/furor # returns __furor, -ari__ and __furor, -oris__
http://latinwordnet.exeter.ac.uk/furor/*/n-s---mn3-/ # only the noun of this form

For complete disambiguation, it is also possible to access a specific lemma using its Linking Latin URI: /lemmas?uri=.

To see the meanings (synsets) presently assigned to a word, /synsets should be appended to any lemma query.

http://latinwordnet.exeter.ac.uk/sicula/n/synsets

Similarly, a word's lexical relations can be obtained by appending /relations, while its semantic relations are obtainable via its synsets, using /synsets/relations.

Synsets

Detailed information about a particular sense (synset) in the WordNet is available using /synsets followed by the part of speech and relevant offset identification number. To obtain information about the lemmas belonging to a particular synset, append /lemmas. Alternatively, the semantic relations pertaining to a synset are available at /relations.

            http://latinwordnet.exeter.ac.uk/n/03316977/ # 'a protective structure or device (usually metal)
http://latinwordnet.exeter.ac.uk/v/01207150/lemmas
http://latinwordnet.exeter.ac.uk/a/01918843/relations
            
        

Semfields

Semfields represent very large conceptual domains encompassing many synsets. Presently the Latin WordNet takes advantage of the Dewey Decimal Classification System as a topic index, in order to provide an appropriate degree of conceptual granularity and hierarchy. To access a semfield record in the WordNet, you will need its DDCS code. E.g., '630' is 'Agriculture' in the hundreds division and 'Agriculture & Related Technologies' in the tens division.

http://latinwordnet.exeter.ac.uk/semfields/630

This listing describes the hierarchical (superordinate and subordinate) relations of the semfield in question. /synsets instead indicates the specific synsets within these domains, and /lemmas resolves each of these synsets to a list of lemmas.

Lemmatization

The Latin WordNet provides a lemmatization service at /lemmatize, using the morphological information in the database.

http://latinwordnet.exeter.ac.uk/lemmatize/reginarum

Results will consist of a list of possible lemmas for this form, along with relevant morphological analyses.

Translation

Additionally, the API offers a translation service to translate some words from English, French, Italian, Spanish and even Hebrew into Latin. The source language must be given as an ISO 639 code, and a part of speech can be optionally provided.

            http://latinwordnet.exeter.ac.uk/translate/en/war # English
                http://latinwordnet.exeter.ac.uk/translate/es/guerra # Spanish
            
        

Searching

Finally, the API provides a mechanism for searching for partial lemmas and for synsets or semfields by their English glosses.

                http://latinwordnet.exeter.ac.uk/lemmas?search=bula # All words containing the string 'bula'
                http://latinwordnet.exeter.ac.uk/synsets?search=mythology # Any synset with the string 'mythology' in its
            gloss
                http://latinwordnet.exeter.ac.uk/semfields?search=military # Any semfield with 'military' in its label