Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+7275.17%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+20.13%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+719.13%)
Bible text gcnPytorch implementation of "Graph Convolutional Networks for Text Classification"
Stars: ✭ 90 (-69.8%)
NeuronblocksNLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Stars: ✭ 1,356 (+355.03%)
Monkeylearn PythonOfficial Python client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Python apps.
Stars: ✭ 143 (-52.01%)
Textblob ArArabic support for textblob
Stars: ✭ 60 (-79.87%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-63.76%)
Ml Classify Text JsMachine learning based text classification in JavaScript using n-grams and cosine similarity
Stars: ✭ 38 (-87.25%)
Monkeylearn RubyOfficial Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.
Stars: ✭ 76 (-74.5%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (-35.91%)
Nlp RecipesNatural Language Processing Best Practices & Examples
Stars: ✭ 5,783 (+1840.6%)
Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+279.87%)
Spacy Streamlit👑 spaCy building blocks and visualizers for Streamlit apps
Stars: ✭ 360 (+20.81%)
Hanlp中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Stars: ✭ 24,626 (+8163.76%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-59.06%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+840.94%)
Wikipedia2vecA tool for learning vector representations of words and entities from Wikipedia
Stars: ✭ 655 (+119.8%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+165.1%)
Nlp TutorialA list of NLP(Natural Language Processing) tutorials
Stars: ✭ 1,188 (+298.66%)
ScdvText classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-81.88%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-43.96%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+744.97%)
Texting[ACL 2020] Tensorflow implementation for "Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks"
Stars: ✭ 103 (-65.44%)
Bert4doc ClassificationCode and source for paper ``How to Fine-Tune BERT for Text Classification?``
Stars: ✭ 220 (-26.17%)
Pytorch Transformers ClassificationBased on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, RoBERTa, and XLM models for text classification.
Stars: ✭ 229 (-23.15%)
NlpythonThis repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (-11.07%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (-5.03%)
Lingua Rs👄 The most accurate natural language detection library in the Rust ecosystem, suitable for long and short text alike
Stars: ✭ 260 (-12.75%)
Matterport3dsimulatorAI Research Platform for Reinforcement Learning from Real Panoramic Images.
Stars: ✭ 260 (-12.75%)
Medacy🏥 Medical Text Mining and Information Extraction with spaCy
Stars: ✭ 287 (-3.69%)
LanguagecrunchLanguageCrunch NLP server docker image
Stars: ✭ 281 (-5.7%)
LdaLDA topic modeling for node.js
Stars: ✭ 262 (-12.08%)
SwemThe Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms"
Stars: ✭ 279 (-6.38%)
Bist ParserGraph-based and Transition-based dependency parsers based on BiLSTMs
Stars: ✭ 257 (-13.76%)
Ai Job NotesAI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)
Stars: ✭ 3,191 (+970.81%)
Delfta Deep Learning Framework for Text
Stars: ✭ 289 (-3.02%)
Trade DstSource code for transferable dialogue state generator (TRADE, Wu et al., 2019). https://arxiv.org/abs/1905.08743
Stars: ✭ 287 (-3.69%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-14.43%)
ArticutapiAPI of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到 SIGHAN 2005 F1-measure 94% 以上,Recall 96% 以上的成績。
Stars: ✭ 252 (-15.44%)
AdaptnlpAn easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.
Stars: ✭ 278 (-6.71%)
Text2sql DataA collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (-3.69%)
BluebertBlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).
Stars: ✭ 273 (-8.39%)
Lbl2VecLbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.
Stars: ✭ 25 (-91.61%)
TextUnderstandingTsetlinMachineUsing the Tsetlin Machine to learn human-interpretable rules for high-accuracy text categorization with medical applications
Stars: ✭ 48 (-83.89%)
PyswipPySwip is a Python - SWI-Prolog bridge enabling to query SWI-Prolog in your Python programs. It features an (incomplete) SWI-Prolog foreign language interface, a utility class that makes it easy querying with Prolog and also a Pythonic interface.
Stars: ✭ 276 (-7.38%)
WeSTClass[CIKM 2018] Weakly-Supervised Neural Text Classification
Stars: ✭ 67 (-77.52%)
HiLAPCode for paper "Hierarchical Text Classification with Reinforced Label Assignment" EMNLP 2019
Stars: ✭ 116 (-61.07%)
Bert For Sequence Labeling And Text ClassificationThis is the template code to use BERT for sequence lableing and text classification, in order to facilitate BERT for more tasks. Currently, the template code has included conll-2003 named entity identification, Snips Slot Filling and Intent Prediction.
Stars: ✭ 293 (-1.68%)
GectorOfficial implementation of the paper “GECToR – Grammatical Error Correction: Tag, Not Rewrite” // Published on BEA15 Workshop (co-located with ACL 2020) https://www.aclweb.org/anthology/2020.bea-1.16.pdf
Stars: ✭ 287 (-3.69%)
Clean Text🧹 Python package for text cleaning
Stars: ✭ 284 (-4.7%)
Nlp tasksNatural Language Processing Tasks and References
Stars: ✭ 2,968 (+895.97%)
node-fasttextNodejs binding for fasttext representation and classification.
Stars: ✭ 39 (-86.91%)