AutoTabularAutomatic machine learning for tabular data. ⚡🔥⚡
Stars: ✭ 51 (+13.33%)
mindwareAn efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.
Stars: ✭ 34 (-24.44%)
DataCon🏆DataCon大数据安全分析大赛,2019年方向二(恶意代码检测)冠军源码、2020年方向五(恶意代码分析)季军源码
Stars: ✭ 69 (+53.33%)
EvolutionaryForestAn open source python library for automated feature engineering based on Genetic Programming
Stars: ✭ 56 (+24.44%)
NlpythonThis repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (+488.89%)
50-days-of-Statistics-for-Data-ScienceThis repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.
Stars: ✭ 19 (-57.78%)
KagglerCode for Kaggle Data Science Competitions
Stars: ✭ 614 (+1264.44%)
gan tensorflowAutomatic feature engineering using Generative Adversarial Networks using TensorFlow.
Stars: ✭ 48 (+6.67%)
fastknnFast k-Nearest Neighbors Classifier for Large Datasets
Stars: ✭ 64 (+42.22%)
clinkClink is a library that provides APIs and infrastructure to facilitate the development of parallelizable feature engineering operators that can be used in both C++ and Java runtime.
Stars: ✭ 24 (-46.67%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+664.44%)
hrv-analysisPackage for Heart Rate Variability analysis in Python
Stars: ✭ 225 (+400%)
Hyperparameter hunterEasy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (+1340%)
feature engineFeature engineering package with sklearn like functionality
Stars: ✭ 758 (+1584.44%)
dominance-analysisThis package can be used for dominance analysis or Shapley Value Regression for finding relative importance of predictors on given dataset. This library can be used for key driver analysis or marginal resource allocation models.
Stars: ✭ 111 (+146.67%)
AutodlAutomated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+1797.78%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (+71.11%)
msdaLibrary for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector
Stars: ✭ 80 (+77.78%)
AutoTSAutomated Time Series Forecasting
Stars: ✭ 665 (+1377.78%)
icicleIcicle Streaming Query Language
Stars: ✭ 16 (-64.44%)
sklearn-audio-classificationAn in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
Stars: ✭ 31 (-31.11%)
fengfeng - feature engineering for machine-learning champions
Stars: ✭ 27 (-40%)
Open source demosA collection of demos showcasing automated feature engineering and machine learning in diverse use cases
Stars: ✭ 391 (+768.89%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (-2.22%)
FeatexpFeature exploration for supervised learning
Stars: ✭ 688 (+1428.89%)
autoencoders tensorflowAutomatic feature engineering using deep learning and Bayesian inference using TensorFlow.
Stars: ✭ 66 (+46.67%)
Predicting-Transportation-Modes-of-GPS-TrajectoriesUnderstanding transportation mode from GPS (Global Positioning System) traces is an essential topic in the data mobility domain. In this paper, a framework is proposed to predict transportation modes. This framework follows a sequence of five steps: (i) data preparation, where GPS points are grouped in trajectory samples; (ii) point features gen…
Stars: ✭ 37 (-17.78%)
ProtrComprehensive toolkit for generating various numerical features of protein sequences
Stars: ✭ 30 (-33.33%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1260%)
FeaturetoolsAn open source python library for automated feature engineering
Stars: ✭ 5,891 (+12991.11%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+20%)
NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Stars: ✭ 797 (+1671.11%)
Feagen(deprecated) A fast and memory-efficient Python data engineering framework for machine learning.
Stars: ✭ 33 (-26.67%)
zcaZCA whitening in python
Stars: ✭ 29 (-35.56%)
cortana-intelligence-customer360This repository contains instructions and code to deploy a customer 360 profile solution on Azure stack using the Cortana Intelligence Suite.
Stars: ✭ 22 (-51.11%)
PubMed-Best-MatchMachine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches
Stars: ✭ 36 (-20%)
Feature SelectionFeatures selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (+1086.67%)
exemplary-ml-pipelineExemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-48.89%)
mistqlA miniature lisp-like language for querying JSON-like structures. Tuned for clientside ML feature extraction.
Stars: ✭ 260 (+477.78%)
skrobotskrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Stars: ✭ 22 (-51.11%)
featurewizUse advanced feature engineering strategies and select best features from your data set with a single line of code.
Stars: ✭ 229 (+408.89%)
kaggle-berlinMaterial of the Kaggle Berlin meetup group!
Stars: ✭ 36 (-20%)
Awesome Feature EngineeringA curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Stars: ✭ 433 (+862.22%)
EngineXEngine X - 实时AI智能决策引擎、规则引擎、风控引擎、数据流引擎。 通过可视化界面进行规则配置,无需繁琐开发,节约人力,提升效率,实时监控,减少错误率,随时调整; 支持规则集、评分卡、决策树,名单库管理、机器学习模型、三方数据接入、定制化开发等;
Stars: ✭ 369 (+720%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-22.22%)
Mljar SupervisedAutomated Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning 🚀
Stars: ✭ 961 (+2035.56%)
Kaggle Quora Question PairsKaggle:Quora Question Pairs, 4th/3396 (https://www.kaggle.com/c/quora-question-pairs)
Stars: ✭ 705 (+1466.67%)
Market-Mix-ModelingMarket Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
Stars: ✭ 31 (-31.11%)