QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+329.17%)
nuts-mlFlow-based data pre-processing for deep learning
Stars: ✭ 32 (-33.33%)
Estadistica Con RApuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (+318.75%)
AsclepiusOpen Price Comparison for US Hospitals
Stars: ✭ 20 (-58.33%)
InstascrapePowerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (+320.83%)
hierarchical-clusteringA Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Stars: ✭ 62 (+29.17%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+297.92%)
merkalysisA marketing tool that helps you to market your products using organic marketing. This tool can potentially save you 1000s of dollars every year. The tool predicts the reach of your posts on social media and also suggests you hashtags for captions in such a way that it increases your reach.
Stars: ✭ 28 (-41.67%)
flask-angular-data-scienceRepository for a data science starter app using Flask, Angular and Docker. https://medium.com/@dvelsner/deploying-a-simple-machine-learning-model-in-a-modern-web-application-flask-angular-docker-a657db075280
Stars: ✭ 84 (+75%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (-33.33%)
ChefboostA Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (+266.67%)
get smartiesDummy variable generation with fit/transform capabilities
Stars: ✭ 23 (-52.08%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+256.25%)
Semantic-Busobject flow treatment, data transformation
Stars: ✭ 49 (+2.08%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (+252.08%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (+79.17%)
Pipelinethe `pipeline` shell command
Stars: ✭ 168 (+250%)
scikit-hubnessA Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (-14.58%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+4002.08%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+26489.58%)
Sourced Cesource{d} Community Edition (CE)
Stars: ✭ 153 (+218.75%)
Alimusic🎼天池阿里音乐流行趋势预测大赛,项目中涵盖了从初赛到复赛的全部核心代码。复赛的聚合数据可以在百度网盘下载,更详细的思路介绍欢迎访问我的博客。
Stars: ✭ 147 (+206.25%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (+204.17%)
scikitcrf NERPython library for custom entity recognition using Sklearn CRF
Stars: ✭ 17 (-64.58%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+36400%)
tymonAn AI Assistant More Than a Toolkit
Stars: ✭ 46 (-4.17%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (+185.42%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+6466.67%)
TipdmTipDM建模平台,开源的数据挖掘工具。
Stars: ✭ 130 (+170.83%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+25%)
RavenRAVEN is a flexible and multi-purpose probabilistic risk analysis, uncertainty quantification, parameter optimization and data knowledge-discovering framework.
Stars: ✭ 122 (+154.17%)
ReaperSocial media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (+400%)
Kddcup 20206th Solution for 2020-KDDCUP Debiasing Challenge
Stars: ✭ 118 (+145.83%)
SSIM Seq2SeqSSIM - A Deep Learning Approach for Recovering Missing Time Series Sensor Data
Stars: ✭ 32 (-33.33%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (+139.58%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+6256.25%)
BellaBella is a pure python post-exploitation data mining tool & remote administration tool for macOS. 🍎💻
Stars: ✭ 112 (+133.33%)
mlBase machine learning image and environment.
Stars: ✭ 15 (-68.75%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+383.33%)
Gitlogg💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
Stars: ✭ 102 (+112.5%)
conferencias matutinas amloCSVs de las versiones estenográficas de las conferencias matutinas del Presidente Andres Manuel López Obrador ( Mañaneras AMLO )
Stars: ✭ 25 (-47.92%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+354.17%)
Automlpipeline.jlA package that makes it trivial to create and evaluate machine learning pipeline architectures.
Stars: ✭ 223 (+364.58%)
Decision-Tree-ImplementationA python 3 implementation of decision tree (machine learning classification algorithm) from scratch
Stars: ✭ 19 (-60.42%)
Deception-Detection-on-Amazon-reviews-datasetA SVM model that classifies the reviews as real or fake. Used both the review text and the additional features contained in the data set to build a model that predicted with over 85% accuracy without using any deep learning techniques.
Stars: ✭ 42 (-12.5%)
foreshadowAn automatic machine learning system
Stars: ✭ 29 (-39.58%)
dh-coreFunctional data science
Stars: ✭ 123 (+156.25%)
compvInsanely fast Open Source Computer Vision library for ARM and x86 devices (Up to #50 times faster than OpenCV)
Stars: ✭ 155 (+222.92%)