GF WordNet is a large interlingual lexicon created by combining resources such as WordNet, PanLex and Wikipedia, plus several translation dictionaries and morphological lexicons. The lexicon consists of abstract lemmas such as apple_1_N which are mapped for each language to an inflection table. The table contains all possible forms for the most appropriate verbalization of the lemma.
Like in WordNet, the abstract lemmas are organized in synsets which group together lemmas with similar meaning. The synsets, on the other hand, are oganized into a semantic graph copied from WordNet.
The lexicon is compatible with the GF's Resource Grammars Library. Moreover, the grammar can be used to render the examples in the WordNet glosses to different languages. We use the examples to ensure that each abstract lemma is mapped into an appropriate translation for all languages. Since the translations of the examples are generated from a single interlingual abstract syntax tree, it is also possible to compute the alignment between the translations. We show the alignment when the user clicks on a word from the examples.
When there is a corresponding Wikipedia article for a lemma, then we also show the tumbnail image from the corresponding article. Clicking on the image opens the article itself in a new tab.
The lexicon is a work in progress. The status of each verbalization is shown as follows:
The lexicon contains about 100 000 lemmas, however, only
for English all lemmas have a verbalization. The relative lexicon
sizes for different languages as well as the current lemma statuses
are summarized in the diagram bellow:
here the green colour corresponds to verbalizations that have already been validated.
The following is a list of all resources in addition to Wikipedia and PanLex that have been used in the creation of the current lexicon:
|Bulgarian||BulTreeBank Wordnet||Open Multilingual WordNet||Translations/Senses|
|BG Office||BG Office||Morphology|
|Catalan||Multilingual Central Repository||Open Multilingual WordNet||Translations/Senses|
|Chinese||Chinese Open Wordnet||Open Multilingual WordNet||Translations/Senses|
|Dutch||Open Dutch WordNet||Computational Lexicology Lab||Translations+Morphology|
|English||Princeton WordNet||Princeton and Open Multilingual WordNet||Translations/Senses|
|Oxford Advanced Learner's Dictionary||Morphology|
|Estonian||Estonian WordNet||Estonian WordNet||Translations/Senses|
|Finnish||FinnWordNet||FinnWordNet and Open Multilingual WordNet||Translations/Senses|
|Italian||MultiWordNet||Open Multilingual WordNet||Translations/Senses|
|Portuguese||OpenWN-PT||Open Multilingual WordNet||Translations/Senses|
|Slovenian||SloWNet||Open Multilingual WordNet||Translations/Senses|
|Spanish||Multilingual Central Repository||Open Multilingual WordNet||Translations/Senses|
|Swedish||WordNet-SALDO||Open Multilingual WordNet||Translations/Senses|
|Svenskt OrdNät||Svenskt OrdNät||Translations/Senses|
|Folkets Lexikon||Folkets Lexikon||Translations|
|Thai||Thai Wordnet||Open Multilingual WordNet||Translations/Senses|