Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+387.65%)
LexiconA data package containing lexicons and dictionaries for text analysis
Stars: ✭ 87 (-46.3%)
Friend.lyA social media platform with a friend recommendation engine based on personality trait extraction
Stars: ✭ 41 (-74.69%)
Open Semantic SearchOpen Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
Stars: ✭ 386 (+138.27%)
GeniusEasily access song lyrics from Genius in a tibble.
Stars: ✭ 111 (-31.48%)
KateCode & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Stars: ✭ 135 (-16.67%)
PyphoneticsA Python 3 phonetics library.
Stars: ✭ 61 (-62.35%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (-69.14%)
GraphbrainLanguage, Knowledge, Cognition
Stars: ✭ 294 (+81.48%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-29.01%)
TidytextText mining using tidy tools ✨📄✨
Stars: ✭ 975 (+501.85%)
QdapQuantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis
Stars: ✭ 146 (-9.88%)
NlpplnNLP pipeline software using common workflow language
Stars: ✭ 31 (-80.86%)
Text predictorChar-level RNN LSTM text generator📄.
Stars: ✭ 99 (-38.89%)
AutophraseAutoPhrase: Automated Phrase Mining from Massive Text Corpora
Stars: ✭ 835 (+415.43%)
Awesome Nlp📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Stars: ✭ 12,626 (+7693.83%)
BigartmFast topic modeling platform
Stars: ✭ 563 (+247.53%)
Orange3 Text🍊 📄 Text Mining add-on for Orange3
Stars: ✭ 83 (-48.77%)
KhcoderKH Coder: for Quantitative Content Analysis or Text Mining
Stars: ✭ 126 (-22.22%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+120.99%)
NgramFast n-Gram Tokenization
Stars: ✭ 55 (-66.05%)
RplosR client for the PLoS Journals API
Stars: ✭ 289 (+78.4%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+962.96%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-73.46%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (-8.64%)
Gsoc2018 3gm💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (-77.78%)
Textcluster短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (-29.01%)
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (-79.63%)
Tidy Text MiningManuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
Stars: ✭ 961 (+493.21%)
Learning Social Media Analytics With RThis repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (-37.04%)
SpiderA configurable web spider with a easy-to-use web console
Stars: ✭ 954 (+488.89%)
Hands On Natural Language Processing With PythonThis repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
Stars: ✭ 146 (-9.88%)
BagofconceptsPython implementation of bag-of-concepts
Stars: ✭ 18 (-88.89%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-43.83%)
Rake NltkPython implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
Stars: ✭ 793 (+389.51%)
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+1125.31%)
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+341.36%)
R Text DataList of textual data sources to be used for text mining in R
Stars: ✭ 85 (-47.53%)
Nlp NotebooksA collection of notebooks for Natural Language Processing from NLP Town
Stars: ✭ 513 (+216.67%)
Datasciencera curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+966.05%)
LdavisR package for web-based interactive topic model visualization.
Stars: ✭ 466 (+187.65%)
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (-55.56%)
ChemdataextractorAutomatically extract chemical information from scientific documents
Stars: ✭ 152 (-6.17%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+131.48%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+114.81%)
KonlpyPython package for Korean natural language processing.
Stars: ✭ 1,098 (+577.78%)
TokenizersFast, Consistent Tokenization of Natural Language Text
Stars: ✭ 161 (-0.62%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (-1.23%)
Textfeatures👷♂️ A simple package for extracting useful features from character objects 👷♀️
Stars: ✭ 148 (-8.64%)
PipeitPipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (-64.81%)