ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (-28.46%)
Nlp NotebooksA collection of notebooks for Natural Language Processing from NLP Town
Stars: ✭ 513 (-78.69%)
textgoText preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
Stars: ✭ 33 (-98.63%)
trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (-70.46%)
clustextEasy, fast clustering of texts
Stars: ✭ 18 (-99.25%)
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-98.88%)
JoSH[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Stars: ✭ 55 (-97.71%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (-91.86%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (-98.01%)
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (-70.29%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-95.51%)
GeniusEasily access song lyrics from Genius in a tibble.
Stars: ✭ 111 (-95.39%)
QdapQuantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis
Stars: ✭ 146 (-93.93%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (-42.09%)
Elmo TutorialA short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)
Stars: ✭ 145 (-93.98%)
Learning Social Media Analytics With RThis repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (-95.76%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (-96.01%)
KateCode & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Stars: ✭ 135 (-94.39%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (-96.22%)
LexiconA data package containing lexicons and dictionaries for text analysis
Stars: ✭ 87 (-96.39%)
Multi rakeMultilingual Rapid Automatic Keyword Extraction (RAKE) for Python
Stars: ✭ 162 (-93.27%)
Awesome Nlp📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Stars: ✭ 12,626 (+424.55%)
R Text DataList of textual data sources to be used for text mining in R
Stars: ✭ 85 (-96.47%)
Textcluster短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (-95.22%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (-93.85%)
DanlpDaNLP is a repository for Natural Language Processing resources for the Danish Language.
Stars: ✭ 111 (-95.39%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (-93.35%)
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (-95.6%)
Hands On Natural Language Processing With PythonThis repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
Stars: ✭ 146 (-93.93%)
FastrtextR wrapper for fastText
Stars: ✭ 103 (-95.72%)
LftmImproving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Stars: ✭ 168 (-93.02%)
Text predictorChar-level RNN LSTM text generator📄.
Stars: ✭ 99 (-95.89%)
Postgres Word2vecutils to use word embedding like word2vec vectors in a postgres database
Stars: ✭ 96 (-96.01%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+430.25%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-96.22%)
Datasciencera curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (-28.25%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-96.47%)
DebiasweRemove problematic gender bias from word embeddings.
Stars: ✭ 175 (-92.73%)
Orange3 Text🍊 📄 Text Mining add-on for Orange3
Stars: ✭ 83 (-96.55%)
KhcoderKH Coder: for Quantitative Content Analysis or Text Mining
Stars: ✭ 126 (-94.77%)
ChemdataextractorAutomatically extract chemical information from scientific documents
Stars: ✭ 152 (-93.69%)
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (-97.01%)
Hash EmbeddingsPyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
Stars: ✭ 126 (-94.77%)
ClustercatFast Word Clustering Software
Stars: ✭ 65 (-97.3%)
PyphoneticsA Python 3 phonetics library.
Stars: ✭ 61 (-97.47%)
Textblob ArArabic support for textblob
Stars: ✭ 60 (-97.51%)
TokenizersFast, Consistent Tokenization of Natural Language Text
Stars: ✭ 161 (-93.31%)
MimickCode for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
Stars: ✭ 152 (-93.69%)
Nlp overviewOverview of Modern Deep Learning Techniques Applied to Natural Language Processing
Stars: ✭ 1,104 (-54.13%)
KonlpyPython package for Korean natural language processing.
Stars: ✭ 1,098 (-54.38%)
Lstm Context EmbeddingsAugmenting word embeddings with their surrounding context using bidirectional RNN
Stars: ✭ 57 (-97.63%)