emperor-os(new released v2.5 LTS.2022-06-25) It has focused on developing an All in One operating system for programming, designing and data science.Emperor-OS has over 500 apps and important tools
Stars: ✭ 32 (-74.8%)
R Text DataList of textual data sources to be used for text mining in R
Stars: ✭ 85 (-33.07%)
PySPODA Python package for spectral proper orthogonal decomposition (SPOD).
Stars: ✭ 50 (-60.63%)
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (-43.31%)
T-CorExImplementation of linear CorEx and temporal CorEx.
Stars: ✭ 31 (-75.59%)
LabelPropagationA NetworkX implementation of Label Propagation from a "Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks" (Physical Review E 2008).
Stars: ✭ 101 (-20.47%)
KonlpyPython package for Korean natural language processing.
Stars: ✭ 1,098 (+764.57%)
readerDistant Reader, a tool for using & understanding a corpus
Stars: ✭ 18 (-85.83%)
NgramFast n-Gram Tokenization
Stars: ✭ 55 (-56.69%)
tsamA python-based time series aggregation module (tsam) which can be used to reduce the number of time steps using typical periods or by decreasing the temporal resolution
Stars: ✭ 112 (-11.81%)
sacred📖 Sacred texts in R
Stars: ✭ 19 (-85.04%)
Gsoc2018 3gm💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (-71.65%)
TabInOutFramework for information extraction from tables
Stars: ✭ 37 (-70.87%)
Tidy Text MiningManuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
Stars: ✭ 961 (+656.69%)
SpiderA configurable web spider with a easy-to-use web console
Stars: ✭ 954 (+651.18%)
rabbitmq-clustererThis project is ABANDONWARE. Use https://www.rabbitmq.com/cluster-formation.html instead.
Stars: ✭ 72 (-43.31%)
simon-frontend💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻
Stars: ✭ 114 (-10.24%)
Rake NltkPython implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
Stars: ✭ 793 (+524.41%)
candis🎀 A data mining suite for gene expression data.
Stars: ✭ 28 (-77.95%)
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+462.99%)
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-89.76%)
Nlp NotebooksA collection of notebooks for Natural Language Processing from NLP Town
Stars: ✭ 513 (+303.94%)
LdavisR package for web-based interactive topic model visualization.
Stars: ✭ 466 (+266.93%)
website-to-jsonConverts website to json using jQuery selectors
Stars: ✭ 37 (-70.87%)
gosquitogosquito ("go" + "mosquito") is a pluggable tool for data gathering, data processing and data transmitting to various destinations.
Stars: ✭ 25 (-80.31%)
FixedEffectjlrR interface for Fixed Effect Models
Stars: ✭ 20 (-84.25%)
MAL-MapCluster and visualize relationships between anime on MyAnimeList
Stars: ✭ 201 (+58.27%)
impfuzzyFuzzy Hash calculated from import API of PE files
Stars: ✭ 67 (-47.24%)
R.TeMiSR.TeMiS: R Text Mining Solution
Stars: ✭ 21 (-83.46%)
imbalanced-ensembleClass-imbalanced / Long-tailed ensemble learning in Python. Modular, flexible, and extensible. | 模块化、灵活、易扩展的类别不平衡/长尾机器学习库
Stars: ✭ 199 (+56.69%)
NlpythonThis repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (+108.66%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (-53.54%)
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-55.91%)
sparseSparse matrix formats for linear algebra supporting scientific and machine learning applications
Stars: ✭ 136 (+7.09%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-74.02%)
FEATHERThe reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".
Stars: ✭ 34 (-73.23%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+11.81%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (-74.8%)
DaDengAndHisPython【微信公众号:大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱
[email protected] Stars: ✭ 59 (-53.54%)
Sampled-MinHashingA method to mine beyond-pairwise relationships using Min-Hashing for large-scale pattern discovery
Stars: ✭ 24 (-81.1%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (-68.5%)
readabilityFast readability scores for text data
Stars: ✭ 22 (-82.68%)
faytheAn experimental cluster brings Prometheus and OpenStack together
Stars: ✭ 18 (-85.83%)
TurboDataMinerThe objective of this Burp Suite extension is the flexible and dynamic extraction, correlation, and structured presentation of information from the Burp Suite project as well as the flexible and dynamic on-the-fly modification of outgoing or incoming HTTP requests using Python scripts. Thus, Turbo Data Miner shall aid in gaining a better and fas…
Stars: ✭ 46 (-63.78%)
NNetalgorithm for study: multi-layer-perceptron, cluster-graph, cnn, rnn, restricted boltzmann machine, bayesian network
Stars: ✭ 24 (-81.1%)
Semantic-Busobject flow treatment, data transformation
Stars: ✭ 49 (-61.42%)
Machine-learningThis repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (-78.74%)
tsp-essayA fun study of some heuristics for the Travelling Salesman Problem.
Stars: ✭ 15 (-88.19%)