TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (+42.11%)
BibleUtilitiesSet of utilities to scan, parse, and work with Bible references.
Stars: ✭ 20 (+5.26%)
misinfo📊 Tools to Perform ‘Misinformation’ Analysis on a Text Corpus (wrapper for methods in https://github.com/PDXBek/Misinformation)
Stars: ✭ 17 (-10.53%)
Guten-gutterStrips boilerplate from Project Gutenberg text files
Stars: ✭ 16 (-15.79%)
palladianPalladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
Stars: ✭ 32 (+68.42%)
readerDistant Reader, a tool for using & understanding a corpus
Stars: ✭ 18 (-5.26%)
readabilityFast readability scores for text data
Stars: ✭ 22 (+15.79%)
DeskBibleThe application for the study of the Bible on Android
Stars: ✭ 23 (+21.05%)
text-analysisWeaving analytical stories from text data
Stars: ✭ 12 (-36.84%)
TabInOutFramework for information extraction from tables
Stars: ✭ 37 (+94.74%)
AravecAraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Stars: ✭ 239 (+1157.89%)
HumanPilotSpatial Transcriptomics human DLPFC pilot study part of the spatialLIBD project
Stars: ✭ 22 (+15.79%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+931.58%)
DartBible-Fluttercross-platform mobile bible app [Android & iOS / iPhone / iPad]; written in Dart programming language
Stars: ✭ 26 (+36.84%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+905.26%)
unboundbibleUnbound Bible is an open source and a free, multilingual Bible-reader program for Mac, Linux and Windows.
Stars: ✭ 25 (+31.58%)
BreadabilityReworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)
Stars: ✭ 186 (+878.95%)
wdlRunRElastic, reproducible, and reusable genomic data science tools from R backed by cloud resources
Stars: ✭ 34 (+78.95%)
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (+852.63%)
textlearnRA simple collection of well working NLP models (Keras, H2O, StarSpace) tuned and benchmarked on a variety of datasets.
Stars: ✭ 16 (-15.79%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+742.11%)
extractnetA Dragnet that also extract author, headline, date, keywords from context
Stars: ✭ 52 (+173.68%)
Awesome Nlp📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Stars: ✭ 12,626 (+66352.63%)
r-docker-tutorialA docker tutorial for reproducible research
Stars: ✭ 245 (+1189.47%)
PipeitPipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (+200%)
QdapQuantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis
Stars: ✭ 146 (+668.42%)
v2Version 2 of the getBible API
Stars: ✭ 34 (+78.95%)
KateCode & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Stars: ✭ 135 (+610.53%)
KhcoderKH Coder: for Quantitative Content Analysis or Text Mining
Stars: ✭ 126 (+563.16%)
epanetReaderRead text files in Epanet's .inp and .rpt formats into R
Stars: ✭ 18 (-5.26%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (+110.53%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (+505.26%)
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (+42.11%)
GeniusEasily access song lyrics from Genius in a tibble.
Stars: ✭ 111 (+484.21%)
malay-datasetText corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+894.74%)
Text predictorChar-level RNN LSTM text generator📄.
Stars: ✭ 99 (+421.05%)
nasapowerAPI Client for NASA POWER Global Meteorology, Surface Solar Energy and Climatology in R
Stars: ✭ 79 (+315.79%)
PubMed-Best-MatchMachine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches
Stars: ✭ 36 (+89.47%)
sword-to-orgConvert Sword modules to Org-mode outlines
Stars: ✭ 32 (+68.42%)
woollyThe Text Mining Elixir
Stars: ✭ 48 (+152.63%)
Orange3 Text🍊 📄 Text Mining add-on for Orange3
Stars: ✭ 83 (+336.84%)
AdjutantRuns a pubmed query, returns results and allows user to explore high-level structure of returned documents
Stars: ✭ 59 (+210.53%)
PyphoneticsA Python 3 phonetics library.
Stars: ✭ 61 (+221.05%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-15.79%)
restaurant-finder-featureReviewsBuild a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (+10.53%)
sentometricsAn integrated framework in R for textual sentiment time series aggregation and prediction
Stars: ✭ 77 (+305.26%)
estrattoparsing fixed width files content made easy
Stars: ✭ 12 (-36.84%)
cusumcharterEasier CUSUM control charts. Returns simple CUSUM statistics, CUSUMs with control limit calculations, and function to generate faceted CUSUM Control Charts
Stars: ✭ 17 (-10.53%)
cranlogsDownload Logs from the RStudio CRAN Mirror
Stars: ✭ 70 (+268.42%)
mikropmlUser-Friendly R Package for Supervised Machine Learning Pipelines
Stars: ✭ 34 (+78.95%)
theographic-webA linked encyclopedia of biblical people, places, periods, and passages
Stars: ✭ 19 (+0%)