civicmineText mining cancer biomarkers for the CIVIC database
Stars: ✭ 19 (-5%)
woollyThe Text Mining Elixir
Stars: ✭ 48 (+140%)
tf-idf-pythonTerm frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (+390%)
Guten-gutterStrips boilerplate from Project Gutenberg text files
Stars: ✭ 16 (-20%)
trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+3455%)
textreadrTools to uniformly read in text data including semi-structured transcripts
Stars: ✭ 65 (+225%)
BIT AINo description or website provided.
Stars: ✭ 41 (+105%)
jupyter-cacheA defined interface for working with a cache of executed jupyter notebooks
Stars: ✭ 28 (+40%)
crminer⛔ ARCHIVED ⛔ Fetch 'Scholary' Full Text from 'Crossref'
Stars: ✭ 17 (-15%)
03 Python Flow ControlFlow control is the order in which statements or blocks of code are executed at runtime based on a condition. Learn Conditional statements, Iterative statements, and Transfer statements
Stars: ✭ 207 (+935%)
Text-Classification-LSTMs-PyTorchThe aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (+125%)
SparseLSHA Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+535%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+200%)
2017-summer-workshopExercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Stars: ✭ 33 (+65%)
learning2hash.github.ioWebsite for "A survey of learning to hash for Computer Vision" https://learning2hash.github.io
Stars: ✭ 14 (-30%)
deafrica-sandbox-notebooksRepository for Digital Earth Africa Sandbox, including: Jupyter notebooks, scripts, tools and workflows for geospatial analysis with Open Data Cube and xarray
Stars: ✭ 108 (+440%)
Quantum-Computing-ResourcesThis repository contains the best resources for learning practical quantum computing. This repository will be updated frequently.
Stars: ✭ 60 (+200%)
NAGPythonExamplesExamples and demos showing how to call functions from the NAG Library for Python
Stars: ✭ 46 (+130%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (+35%)
TeachingDataScienceCourse notes for Data Science related topics, prepared in LaTeX
Stars: ✭ 102 (+410%)
koshort(deprecated) 🐱 koshort is a Python package for Korean internet spoken language crawling and processing... or maybe Korean domestic cat.
Stars: ✭ 62 (+210%)
AdjutantRuns a pubmed query, returns results and allows user to explore high-level structure of returned documents
Stars: ✭ 59 (+195%)
clustextEasy, fast clustering of texts
Stars: ✭ 18 (-10%)
notebook-environmentsManage python virtual environments on the working notebook server
Stars: ✭ 43 (+115%)
rkThe remote Jupyter kernel/kernels administration utility
Stars: ✭ 53 (+165%)
gofastrMake a DocumentTermMatrix faster
Stars: ✭ 19 (-5%)
AravecAraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Stars: ✭ 239 (+1095%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (+100%)
R.TeMiSR.TeMiS: R Text Mining Solution
Stars: ✭ 21 (+5%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+880%)
IOTA101IOTA Developer Essentials
Stars: ✭ 38 (+90%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+855%)
BreadabilityReworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)
Stars: ✭ 186 (+830%)
KaggleKaggle Kernels (Python, R, Jupyter Notebooks)
Stars: ✭ 26 (+30%)
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (+805%)
thrones2vecUsing Word2Vec to explore semantic similarities between the entities of "A Song of Ice and Fire" ("Game of Thrones").
Stars: ✭ 27 (+35%)
TokenizersFast, Consistent Tokenization of Natural Language Text
Stars: ✭ 161 (+705%)
malay-datasetText corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+845%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+700%)
adam homeADAM python client and notebooks
Stars: ✭ 12 (-40%)
Awesome Nlp📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Stars: ✭ 12,626 (+63030%)
notebook-free-notebookA professional, lock-in-free Jupyter dev env for coders, teams and non-trivial, large Jupyter projects
Stars: ✭ 38 (+90%)
Textfeatures👷♂️ A simple package for extracting useful features from character objects 👷♀️
Stars: ✭ 148 (+640%)
misinfo📊 Tools to Perform ‘Misinformation’ Analysis on a Text Corpus (wrapper for methods in https://github.com/PDXBek/Misinformation)
Stars: ✭ 17 (-15%)
QdapQuantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis
Stars: ✭ 146 (+630%)
KateCode & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Stars: ✭ 135 (+575%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-20%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+355%)
NEMO-examplesSimple configurations to study specific oceanic physical processes and be used as a tool for training
Stars: ✭ 14 (-30%)
sacred📖 Sacred texts in R
Stars: ✭ 19 (-5%)
SciCompforChemistsScientific Computing for Chemists text for teaching basic computing skills to chemistry students using Python, Jupyter notebooks, and the SciPy stack. This text makes use of a variety of packages including NumPy, SciPy, matplotlib, pandas, seaborn, NMRglue, SymPy, scikit-image, and scikit-learn.
Stars: ✭ 65 (+225%)
TabInOutFramework for information extraction from tables
Stars: ✭ 37 (+85%)