spacy conllPipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
Stars: ✭ 60 (+233.33%)
DframcyDataframe Integration with spaCy.
Stars: ✭ 74 (+311.11%)
Datasciencera curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+9494.44%)
Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+6188.89%)
alter-nluNatural language understanding library for chatbots with intent recognition and entity extraction.
Stars: ✭ 45 (+150%)
PyinflectA python module for word inflections designed for use with spaCy.
Stars: ✭ 52 (+188.89%)
Lambda PacksPrecompiled packages for AWS Lambda
Stars: ✭ 997 (+5438.89%)
NLP QuickbookNLP in Python with Deep Learning
Stars: ✭ 516 (+2766.67%)
ScispacyA full spaCy pipeline and models for scientific/biomedical documents.
Stars: ✭ 855 (+4650%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+9466.67%)
MordecaiFull text geoparsing as a Python library
Stars: ✭ 579 (+3116.67%)
DrFAQDrFAQ is a plug-and-play question answering NLP chatbot that can be generally applied to any organisation's text corpora.
Stars: ✭ 29 (+61.11%)
SmltarManuscript of the book "Supervised Machine Learning for Text Analysis in R" by Emil Hvitfeldt and Julia Silge
Stars: ✭ 125 (+594.44%)
Spacy Stanza💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
Stars: ✭ 508 (+2722.22%)
Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+122000%)
Learning Social Media Analytics With RThis repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (+466.67%)
Projects🪐 End-to-end NLP workflows from prototype to production
Stars: ✭ 397 (+2105.56%)
SMMTSocial Media Mining Toolkit (SMMT) main repository
Stars: ✭ 116 (+544.44%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (+405.56%)
Adam qasADAM - A Question Answering System. Inspired from IBM Watson
Stars: ✭ 330 (+1733.33%)
rita-dslA Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format
Stars: ✭ 60 (+233.33%)
Displacy💥 displaCy.js: An open-source NLP visualiser for the modern web
Stars: ✭ 311 (+1627.78%)
Ml Dl ScriptsThe repository provides usefull python scripts for ML and data analysis
Stars: ✭ 119 (+561.11%)
spacy readabilityspaCy pipeline component for adding text readability meta data to Doc objects.
Stars: ✭ 54 (+200%)
StopwordsMultilingual Stopword Lists in R
Stars: ✭ 89 (+394.44%)
spacy-clausieImplementation of the ClausIE information extraction system for python+spacy
Stars: ✭ 106 (+488.89%)
autonomioCore functionality for the Autonomio augmented intelligence workbench.
Stars: ✭ 27 (+50%)
SuperCombinators[Deprecated] A Swift parser combinator framework
Stars: ✭ 19 (+5.56%)
talks💥 Browser-based slides or PDFs of our talks and presentations
Stars: ✭ 91 (+405.56%)
finglishA Finglish to Persian converter.
Stars: ✭ 60 (+233.33%)
amrlibA python library that makes AMR parsing, generation and visualization simple.
Stars: ✭ 107 (+494.44%)
KonlpyPython package for Korean natural language processing.
Stars: ✭ 1,098 (+6000%)
intertextDetect and visualize text reuse
Stars: ✭ 97 (+438.89%)
presidio-researchThis package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
Stars: ✭ 62 (+244.44%)
NgramFast n-Gram Tokenization
Stars: ✭ 55 (+205.56%)
hello-nlpA natural language search microservice
Stars: ✭ 85 (+372.22%)
Lexisnexistools📰 Working with newspaper data from 'LexisNexis'
Stars: ✭ 59 (+227.78%)
StringiTHE String Processing Package for R (with ICU)
Stars: ✭ 204 (+1033.33%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (+138.89%)
Rust UnicUNIC: Unicode and Internationalization Crates for Rust
Stars: ✭ 189 (+950%)
SdIntuitive find & replace CLI (sed alternative)
Stars: ✭ 2,755 (+15205.56%)
Gsoc2018 3gm💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (+100%)
Text DetectorTool which allow you to detect and translate text.
Stars: ✭ 173 (+861.11%)
frogFrog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+288.89%)
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (+83.33%)
VERSEVancouver Event and Relation System for Extraction
Stars: ✭ 13 (-27.78%)
bert-tensorflow-pytorch-spacy-conversionInstructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through uses DeepPavlov's RuBERT as example.
Stars: ✭ 26 (+44.44%)
clustextEasy, fast clustering of texts
Stars: ✭ 18 (+0%)
OreAn R interface to the Onigmo regular expression library
Stars: ✭ 54 (+200%)