own-pt / Openwordnet Pt
Programming Languages
OpenWordnet-PT: An Open Access Wordnet for Portuguese
How to use it?
-
You can browse or search the data in our web interface.
-
You can download the RDF files and load it with any RDF library available for your preferable programming language.
-
You can query the data using our SPARQL Endpoint.
About the RDF
-
Based on http://www.w3.org/TR/wordnet-rdf/ and http://semanticweb.cs.vu.nl/lod/wn30/ with some modifications. More files from the Princeton distribution were considered and not only the database files. More properties and classes are included.
-
Lisp code used to create the RDF files is available.
-
Since we re-use the Princeton WordNet (PWN) synset identifiers, we do not need to repeat all the relationships already listed in wordnet-en.nt.gz. In the own-pt.nt.gz we list simply the new relations that we have added for Portuguese.
Team
Contributors
Related projects
-
http://github.com/own-pt/cl-wnbrowser/ a browser and search interface for our wordnet powered by Common Lisp and Apache Solr.
-
http://compling.hss.ntu.edu.sg/omw/ a browser and search interface for all open wordnets. Our OpenWordnet-PT is the Portuguese Wordnet available on this site.
-
http://nlp.lsi.upc.edu/freeling/ See the freeling directory for information and the OpenWordnet-PT version in the format used by FreeLing.
-
http://ontopt.dei.uc.pt Another wordnet-like ontology in Portuguese. It has incorporated OpenWordnet-PT.
How to cite?
See http://arademaker.github.io/bibliography/coling2012.html
How to contribute?
Please use the GitHub issue tracker (https://github.com/arademaker/wordnet-br/issues) or use the web interface to make suggestions about the data.
History of the Project
The initial version was generated by combining the following data:
-
Princeton WordNet 3.0 was used to obtain English glosses and English terms for synset IDs.
-
The unreleased 2010-12 version of [UWN] (http://www.mpi-inf.mpg.de/yago-naga/uwn/) and MENTA provided candidate terms in Portuguese, candidate glosses in Portuguese (from Wikipedia), and candidate terms in Spanish.
-
The EuroWordNet base concept list (
5000_bc.xml
) provides the base concept numbers. The original file was mapped from WordNet 2.0 to 3.0 using the mappings from WN-Map. When multiple mappings for a WordNet 2.0 synset existed, all possible WordNet 3.0 synsets were kept. Hence, there may be multiple entries with the same base concept number.
License
openWordnet-PT by Escola de Matemática Aplicada, Fundação Getulio Vargas is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at http://github.com/own-pt/openWordnet-PT.
Permissions beyond the scope of this license may be available at http://github.com/own-pt/openWordnet-PT.
Also please consult the LICENSE file.
Note that the wordnet-en.rdf file is based on Princeton WordNet 3.0, being simply its conversion to the RDF format. The Princeton WordNet 3.0 is distributed under the license http://wordnet.princeton.edu/wordnet/license/.