DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+1633.75%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (-41.49%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+331.58%)
Aidl kbA Knowledge Base for the FB Group Artificial Intelligence and Deep Learning (AIDL)
Stars: ✭ 219 (-32.2%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+433.13%)
WordgcnACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Stars: ✭ 230 (-28.79%)
Bert Embedding🔡 Token level embeddings from BERT model on mxnet and gluonnlp
Stars: ✭ 424 (+31.27%)
FlairA very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (+3325.7%)
DanlpDaNLP is a repository for Natural Language Processing resources for the Danish Language.
Stars: ✭ 111 (-65.63%)
Projects🪐 End-to-end NLP workflows from prototype to production
Stars: ✭ 397 (+22.91%)
Entity Recognition DatasetsA collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (+175.85%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+893.5%)
BiosentvecBioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Stars: ✭ 308 (-4.64%)
Textblob ArArabic support for textblob
Stars: ✭ 60 (-81.42%)
Syntree2vecAn algorithm to augment syntactic hierarchy into word embeddings
Stars: ✭ 9 (-97.21%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-41.8%)
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (-67.18%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-66.56%)
Nlp NotebooksA collection of notebooks for Natural Language Processing from NLP Town
Stars: ✭ 513 (+58.82%)
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+121.36%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+3851.39%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+326.63%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (-30.03%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (-13.93%)
CleoraCleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.
Stars: ✭ 303 (-6.19%)
Link GrammarThe CMU Link Grammar natural language parser
Stars: ✭ 286 (-11.46%)
Clean Text🧹 Python package for text cleaning
Stars: ✭ 284 (-12.07%)
Medical Datasetstracking medical datasets, with a focus on medical imaging
Stars: ✭ 296 (-8.36%)
Textractextract text from any document. no muss. no fuss.
Stars: ✭ 3,165 (+879.88%)
NlpSelected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Stars: ✭ 304 (-5.88%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (-12.38%)
LanguagecrunchLanguageCrunch NLP server docker image
Stars: ✭ 281 (-13%)
GraphbrainLanguage, Knowledge, Cognition
Stars: ✭ 294 (-8.98%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (-13%)
SwemThe Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms"
Stars: ✭ 279 (-13.62%)
PyresparserA simple resume parser used for extracting information from resumes
Stars: ✭ 297 (-8.05%)
AdaptnlpAn easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.
Stars: ✭ 278 (-13.93%)
TrankitTrankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Stars: ✭ 311 (-3.72%)
TextfoolerA Model for Natural Language Attack on Text Classification and Inference
Stars: ✭ 298 (-7.74%)
BluebertBlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).
Stars: ✭ 273 (-15.48%)
PyswipPySwip is a Python - SWI-Prolog bridge enabling to query SWI-Prolog in your Python programs. It features an (incomplete) SWI-Prolog foreign language interface, a utility class that makes it easy querying with Prolog and also a Pythonic interface.
Stars: ✭ 276 (-14.55%)
LibpostalA C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Stars: ✭ 3,312 (+925.39%)
Nlp tasksNatural Language Processing Tasks and References
Stars: ✭ 2,968 (+818.89%)
Autonlp🤗 AutoNLP: train state-of-the-art natural language processing models and deploy them in a scalable environment automatically
Stars: ✭ 263 (-18.58%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+1241.8%)
LtpLanguage Technology Platform
Stars: ✭ 3,648 (+1029.41%)
Open3d MlAn extension of Open3D to address 3D Machine Learning tasks
Stars: ✭ 284 (-12.07%)
Recurrent Entity NetworksTensorFlow implementation of "Tracking the World State with Recurrent Entity Networks".
Stars: ✭ 276 (-14.55%)
TdcTherapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics
Stars: ✭ 291 (-9.91%)
Nlp TutorialTutorial: Natural Language Processing in Python
Stars: ✭ 274 (-15.17%)
Chatbot nerchatbot_ner: Named Entity Recognition for chatbots.
Stars: ✭ 273 (-15.48%)
ZhihuThis repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
Stars: ✭ 3,307 (+923.84%)
AutogluonAutoGluon: AutoML for Text, Image, and Tabular Data
Stars: ✭ 3,920 (+1113.62%)