All Projects → Sciss → ws4j

Sciss / ws4j

Licence: other
WordNet Similarity for Java provides an API for several Semantic Relatedness/Similarity algorithms

Programming Languages

java
68154 projects - #9 most used programming language
scala
5932 projects

Projects that are alternatives of or similar to ws4j

wordnet
Stand-alone WordNet API
Stars: ✭ 39 (-4.88%)
Mutual labels:  wordnet
NLIDB
Natural Language Interface to DataBases
Stars: ✭ 100 (+143.9%)
Mutual labels:  wordnet
wordhoard
This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.
Stars: ✭ 78 (+90.24%)
Mutual labels:  wordnet
Pattern
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Stars: ✭ 8,112 (+19685.37%)
Mutual labels:  wordnet
TurkishWordNet
Turkish WordNet KeNet
Stars: ✭ 32 (-21.95%)
Mutual labels:  wordnet
gf-wordnet
A WordNet in GF
Stars: ✭ 15 (-63.41%)
Mutual labels:  wordnet
LMMS
Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings
Stars: ✭ 79 (+92.68%)
Mutual labels:  wordnet
Hierarchical-Word-Sense-Disambiguation-using-WordNet-Senses
Word Sense Disambiguation using Word Specific models, All word models and Hierarchical models in Tensorflow
Stars: ✭ 33 (-19.51%)
Mutual labels:  wordnet
m3gm
Max-Margin Markov Graph Models for WordNet (EMNLP 2018)
Stars: ✭ 40 (-2.44%)
Mutual labels:  wordnet
NatLang
NatLang is an English parser with an extensible grammar
Stars: ✭ 20 (-51.22%)
Mutual labels:  wordnet
textaugment
TextAugment: Text Augmentation Library
Stars: ✭ 280 (+582.93%)
Mutual labels:  wordnet
Wordbook
Wordbook is a dictionary application built for GNOME.
Stars: ✭ 56 (+36.59%)
Mutual labels:  wordnet
CogNet
CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
Stars: ✭ 26 (-36.59%)
Mutual labels:  wordnet
wn
A modern, interlingual wordnet interface for Python
Stars: ✭ 119 (+190.24%)
Mutual labels:  wordnet

WS4J

Maven Central

Statement

This project was exported from the original Google Code Location. The purpose is to publish an artifact to Maven Central. The repository has been changed to build with sbt instead of Maven.

To run the tests, download http://nlpwww.nict.go.jp/wn-ja/data/1.1/wnjpn.db.gz and unzip the .db file into the config directory.

The original author is Hideki Shima. Below is the original README:

contributing

Please see the file CONTRIBUTING.md


Thanks for downloading WS4J (http://code.google.com/p/ws4j).

Introduction

This software provides APIs for several semantic relatedness algorithms for, in theory, any WordNet instance. The codebase has been mostly ported from WordNet-Similarity-2.05 (http://wn-similarity.sourceforge.net/). We also use the data files from WordNet-Similarity-2.05 and WordNet-InfoContent-3.0, as seen in src/main/resources.

We tested WS4J with the JAWJAW on NICT Japanese WordNet (http://nlpwww.nict.go.jp/wn-ja/index.en.html) with which you can analyze English and Japanese (in Princeton WordNet 3.0 compatible synsets).

Preparation

By default, requirement for compilation are:

It's NORMAL that you see a build error in eclipse as JAWJAW is not contained in lib/ for the source code distribution. Before using WS4J, compile and package JAWJAW and put the jar file under the lib directory in this project: ./lib/jawjaw.jar Until you do this, WS4J does not compile.

When packaging JAWJAW, you may want to consider enabling on-memory DB mode which is disabled by default.

In case you want to use another WordNet API + instance, implement a WordNet wrapper following the real example for NICT wordnet + JAWJAW in edu.cmu.lti.lexical_db.NictWordNet

Testing

You can verify that the preparation is correctly done by running JUnit test cases.

Test cases: src/test/*

Maven command: mvn test

Launch file for Eclipse + m2e: launches/WS4J_Run_All_JUnitTests.launch

The expected results from the test cases are compatible with the original WordNet::Similarity in Perl (http://wn-similarity.sourceforge.net/).

Packaging

To customize WS4J, edit src/main/config/similarity.conf.

Here's a way to create a jar file including resource and config files.

Maven command: mvn install

Launch file for Eclipse + m2e: launches/WS4J_package_m2e.launch

Output jar file (may need a refresh on the directory): target/ws4j.jar

Using WS4J

See working examples in the following files.

Demos: src/main/java/edu/cmu/lti/ws4j/demo/SimilarityCalculationDemo.java

When using the WS4J jar package from other projects, make sure to also include depending libraries, i.e. junit, sqlite-jdbc, jawjaw. In maven's pom file, these dependencies can be written such as:

<dependencies>
  <dependency>
    <groupId>junit</groupId>
    <artifactId>junit</artifactId>
    <version>4.0</version>
    <scope>compile</scope>
  </dependency>
  <dependency>
    <groupId>org.xerial</groupId>
    <artifactId>sqlite-jdbc</artifactId>
    <version>3.7.2</version>
  </dependency>
  <dependency>
    <groupId>edu.cmu.lti</groupId>
    <artifactId>jawjaw</artifactId>
    <version>1.0.0</version>
    <scope>system</scope> 
    <systemPath>${basedir}/lib/jawjaw.jar</systemPath>
  </dependency>
  <dependency>
    <groupId>edu.cmu.lti</groupId>
    <artifactId>ws4j</artifactId>
    <version>1.0.0</version>
    <scope>system</scope> 
    <systemPath>${basedir}/lib/ws4j.jar</systemPath>
  </dependency>
</dependencies> 
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].