TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-93.16%)
NLP-toolsUseful python NLP tools (evaluation, GUI interface, tokenization)
Stars: ✭ 39 (-98.4%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (-82.55%)
Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+800.37%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (-82.26%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+3.15%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (-82.06%)
Nlp chinese corpus大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+172.68%)
JcsegJcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for the latest lucene,solr,elasticsearch
Stars: ✭ 754 (-69.11%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-99.06%)
Lingua👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (-86.03%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (-85.74%)
NlprePython library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (-93.53%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (-85.33%)
Awesome Pytorch ListA comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Stars: ✭ 12,475 (+411.06%)
Nlp RecipesNatural Language Processing Best Practices & Examples
Stars: ✭ 5,783 (+136.91%)
Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (-53.63%)
Textblob ArArabic support for textblob
Stars: ✭ 60 (-97.54%)
Monkeylearn RubyOfficial Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.
Stars: ✭ 76 (-96.89%)
NeuronblocksNLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Stars: ✭ 1,356 (-44.45%)
Chinese nlu by using rasa nlu使用 RASA NLU 来构建中文自然语言理解系统(NLU)| Use RASA NLU to build a Chinese Natural Language Understanding System (NLU)
Stars: ✭ 99 (-95.94%)
Transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+2183.57%)
LtpLanguage Technology Platform
Stars: ✭ 3,648 (+49.45%)
TextfoolerA Model for Natural Language Attack on Text Classification and Inference
Stars: ✭ 298 (-87.79%)
Chatbot nerchatbot_ner: Named Entity Recognition for chatbots.
Stars: ✭ 273 (-88.82%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-95%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-94.92%)
PrenlpPreprocessing Library for Natural Language Processing
Stars: ✭ 130 (-94.67%)
Spacy Streamlit👑 spaCy building blocks and visualizers for Streamlit apps
Stars: ✭ 360 (-85.25%)
Monkeylearn PythonOfficial Python client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Python apps.
Stars: ✭ 143 (-94.14%)
PythainlpThai Natural Language Processing in Python.
Stars: ✭ 582 (-76.16%)
Wikipedia2vecA tool for learning vector representations of words and entities from Wikipedia
Stars: ✭ 655 (-73.17%)
Hanlp中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Stars: ✭ 24,626 (+908.85%)
UndertheseaUnderthesea - Vietnamese NLP Toolkit
Stars: ✭ 823 (-66.28%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (-67.64%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (-94.18%)
ScdvText classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-97.79%)
Nlp TutorialA list of NLP(Natural Language Processing) tutorials
Stars: ✭ 1,188 (-51.33%)
Lingua FrancaMycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-97.91%)
ToiroA comparison tool of Japanese tokenizers
Stars: ✭ 95 (-96.11%)
Bible text gcnPytorch implementation of "Graph Convolutional Networks for Text Classification"
Stars: ✭ 90 (-96.31%)
Texting[ACL 2020] Tensorflow implementation for "Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks"
Stars: ✭ 103 (-95.78%)
CorenlpStanford CoreNLP: A Java suite of core NLP tools.
Stars: ✭ 8,248 (+237.89%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-95.29%)
Lingopackage lingo provides the data structures and algorithms required for natural language processing
Stars: ✭ 113 (-95.37%)
Konoha🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-94.67%)
DanlpDaNLP is a repository for Natural Language Processing resources for the Danish Language.
Stars: ✭ 111 (-95.45%)
TextFeatureSelectionPython library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (-98.28%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-96.23%)
Ml Classify Text JsMachine learning based text classification in JavaScript using n-grams and cosine similarity
Stars: ✭ 38 (-98.44%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-95.58%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (-94.18%)
BrowsecloudA web app to create and browse text visualizations for automated customer listening.
Stars: ✭ 143 (-94.14%)