PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+189.8%)
YoutokentomeUnsupervised text tokenizer focused on computational efficiency
Stars: ✭ 728 (+395.24%)
SentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
Stars: ✭ 5,540 (+3668.71%)
VncorenlpA Vietnamese natural language processing toolkit (NAACL 2018)
Stars: ✭ 354 (+140.82%)
PythainlpThai Natural Language Processing in Python.
Stars: ✭ 582 (+295.92%)
ToiroA comparison tool of Japanese tokenizers
Stars: ✭ 95 (-35.37%)
TokenizerFast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (-10.2%)
Paper Survey📚Survey of previous research and related works on machine learning (especially Deep Learning) in Japanese
Stars: ✭ 140 (-4.76%)
Cocoaai🤖 The Cocoa Artificial Intelligence Lab
Stars: ✭ 134 (-8.84%)
Tod BertPre-Trained Models for ToD-BERT
Stars: ✭ 143 (-2.72%)
Scattertext PydataNotebooks for the Seattle PyData 2017 talk on Scattertext
Stars: ✭ 132 (-10.2%)
GooglelanguagerR client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Stars: ✭ 145 (-1.36%)
Practical Machine Learning With PythonMaster the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+1170.75%)
PrenlpPreprocessing Library for Natural Language Processing
Stars: ✭ 130 (-11.56%)
Konoha🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-11.56%)
MedquadMedical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites
Stars: ✭ 129 (-12.24%)
FxdesktopsearchA JavaFX based desktop search application.
Stars: ✭ 147 (+0%)
Learn To Select DataCode for Learning to select data for transfer learning with Bayesian Optimization
Stars: ✭ 140 (-4.76%)
Deep LyricsLyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network
Stars: ✭ 127 (-13.61%)
Neuraldialog LarlPyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 127 (-13.61%)
Mams For AbsaA Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
Stars: ✭ 135 (-8.16%)
NeusumCode for the ACL 2018 paper "Neural Document Summarization by Jointly Learning to Score and Select Sentences"
Stars: ✭ 143 (-2.72%)
Zamia AiFree and open source A.I. system based on Python, TensorFlow and Prolog.
Stars: ✭ 133 (-9.52%)
Nl2sql阿里天池首届中文NL2SQL挑战赛top6
Stars: ✭ 146 (-0.68%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (-3.4%)
UdaUnsupervised Data Augmentation (UDA)
Stars: ✭ 1,877 (+1176.87%)
Words countedA Ruby natural language processor.
Stars: ✭ 146 (-0.68%)
NlpaugData augmentation for NLP
Stars: ✭ 2,761 (+1778.23%)
TextacyNLP, before and after spaCy
Stars: ✭ 1,849 (+1157.82%)
Chars2vecCharacter-based word embeddings model based on RNN for handling real world texts
Stars: ✭ 130 (-11.56%)
Tree TransformerImplementation of the paper Tree Transformer
Stars: ✭ 148 (+0.68%)
CorpuscrawlerCrawler for linguistic corpora
Stars: ✭ 127 (-13.61%)
Ipa DictMonolingual wordlists with pronunciation information in IPA
Stars: ✭ 139 (-5.44%)
Neuro🔮 Neuro.js is machine learning library for building AI assistants and chat-bots (WIP).
Stars: ✭ 126 (-14.29%)
Awesome Nlp ResourcesThis repository contains landmark research papers in Natural Language Processing that came out in this century.
Stars: ✭ 145 (-1.36%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-5.44%)
KeitaMy personal toolkit for PyTorch development.
Stars: ✭ 124 (-15.65%)
AbsapapersWorth-reading papers and related awesome resources on aspect-based sentiment analysis (ABSA). 值得一读的方面级情感分析论文与相关资源集合
Stars: ✭ 142 (-3.4%)
Kaggle Crowdflower1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
Stars: ✭ 1,708 (+1061.9%)
Fnc 1 BaselineA baseline implementation for FNC-1
Stars: ✭ 123 (-16.33%)
NcrfppNCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Stars: ✭ 1,767 (+1102.04%)
ClicrMachine reading comprehension on clinical case reports
Stars: ✭ 123 (-16.33%)
Spacy Js🎀 JavaScript API for spaCy with Python REST API
Stars: ✭ 123 (-16.33%)
Multihead Siamese NetsImplementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
Stars: ✭ 144 (-2.04%)
Sluice NetworksCode for Sluice networks: Learning what to share between loosely related tasks
Stars: ✭ 135 (-8.16%)