All Projects → NLPbox → stanford-corenlp-docker

NLPbox / stanford-corenlp-docker

Licence: other
build/run the most current Stanford CoreNLP server in a docker container

Programming Languages

Dockerfile
14818 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to stanford-corenlp-docker

datalinguist
Stanford CoreNLP in idiomatic Clojure.
Stars: ✭ 93 (+144.74%)
Mutual labels:  dependency-parser, corenlp
FAParser
A Fast(er) and Accurate Syntactic Parsing by Exacter Searching.
Stars: ✭ 17 (-55.26%)
Mutual labels:  dependency-parser, constituency-parser
udar
UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.
Stars: ✭ 15 (-60.53%)
Mutual labels:  dependency-parser
jstarcraft-nlp
专注于解决自然语言处理领域的几个核心问题:词法分析,句法分析,语义分析,语种检测,信息抽取,文本聚类和文本分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种自然语言处理算法,适配了多个自然语言处理框架. 兼容Lucene/Solr/ElasticSearch插件.
Stars: ✭ 92 (+142.11%)
Mutual labels:  corenlp
gum
Repository for the Georgetown University Multilayer Corpus (GUM)
Stars: ✭ 71 (+86.84%)
Mutual labels:  coreference
Hanlp
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Stars: ✭ 24,626 (+64705.26%)
Mutual labels:  dependency-parser
ling
Natural Language Processing Toolkit in Golang
Stars: ✭ 57 (+50%)
Mutual labels:  corenlp
Stanza
Official Stanford NLP Python Library for Many Human Languages
Stars: ✭ 5,887 (+15392.11%)
Mutual labels:  corenlp
dstlr
scalable knowledge graph construction from unstructured text
Stars: ✭ 82 (+115.79%)
Mutual labels:  corenlp
TweebankNLP
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+121.05%)
Mutual labels:  dependency-parser
syntaxnet
Syntaxnet Parsey McParseface wrapper for POS tagging and dependency parsing
Stars: ✭ 77 (+102.63%)
Mutual labels:  dependency-parser
Pyhanlp
中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁 自然语言处理
Stars: ✭ 2,564 (+6647.37%)
Mutual labels:  dependency-parser
frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+84.21%)
Mutual labels:  dependency-parser
dependency parsing tf
Tensorflow implementation of "A Fast and Accurate Dependency Parser using Neural Networks"
Stars: ✭ 77 (+102.63%)
Mutual labels:  dependency-parser
turing
✨ 🧬 Turing AI - Semantic Navigation, Chatbot using Search Engine and Many NLP Vendors.
Stars: ✭ 30 (-21.05%)
Mutual labels:  corenlp
ipymarkup
NER, syntax markup visualizations
Stars: ✭ 108 (+184.21%)
Mutual labels:  dependency-parser
python-corenlp-protobuf
Python bindings for Stanford CoreNLP's protobufs.
Stars: ✭ 21 (-44.74%)
Mutual labels:  corenlp
yap
Yet Another (natural language) Parser
Stars: ✭ 40 (+5.26%)
Mutual labels:  dependency-parser
node-corenlp
CoreNLP @ NodeJS
Stars: ✭ 63 (+65.79%)
Mutual labels:  corenlp
dpar
Neural network transition-based dependency parser (in Rust)
Stars: ✭ 41 (+7.89%)
Mutual labels:  dependency-parser

stanford-corenlp-docker

This Dockerfile will build and run the most current release of the Stanford CoreNLP server in a docker container.

Usage

To download and run a prebuilt version of the CoreNLP server from Docker Hub locally at http://localhost:9000, just type:

docker run -p 9000:9000 nlpbox/corenlp

By default, CoreNLP will use up to 4GB of RAM. You can change this by setting the JAVA_XMX environment variable. Here, we're giving it 3GB:

docker run -e JAVA_XMX=3g -p 9000:9000 -ti nlpbox/corenlp

In order to build and run the container from scratch (e.g. if you want to use the most current release of Stanford CoreNLP, type:

docker build -t corenlp https://github.com/NLPbox/stanford-corenlp-docker.git
docker run -p 9000:9000 corenlp

In another console, you can now query the CoreNLP REST API like this:

wget -q --post-data "Although they didn't like it, they accepted the offer." \
  'localhost:9000/?properties={"annotators":"parse","outputFormat":"json"}' \
  -O - | jq ".sentences[0].parse"

which will return this parse tree:

"(ROOT\n  (S\n    (SBAR (IN Although)\n      (S\n        (NP (PRP they))\n        (VP (VBD did) (RB n't)\n          (PP (IN like)\n            (NP (PRP it))))))\n    (, ,)\n    (NP (PRP they))\n    (VP (VBD accepted)\n      (NP (DT the) (NN offer)))\n    (. .)))"

If you need the full xml output and want to configure more parameters, try:

wget -q --post-data "Although they didn't like it, they accepted the offer." \
  'localhost:9000/?properties={ \
    "annotators":"tokenize,ssplit,pos,lemma,ner,parse", \
    "ssplit.eolonly":"false", "tokenize.whitespace":"true", \
    "outputFormat":"xml"}' \
  -O results.xml
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].