Kaggle Quora Question PairsKaggle:Quora Question Pairs, 4th/3396 (https://www.kaggle.com/c/quora-question-pairs)
Stars: ✭ 705 (+2511.11%)
Mljar SupervisedAutomated Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning 🚀
Stars: ✭ 961 (+3459.26%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+100%)
EvalmlEvalML is an AutoML library written in python.
Stars: ✭ 145 (+437.04%)
KagglerCode for Kaggle Data Science Competitions
Stars: ✭ 614 (+2174.07%)
Hanzi char featurizer汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese characters (pronunciation features, glyph features) as features for deep learning
Stars: ✭ 187 (+592.59%)
NlpythonThis repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (+881.48%)
NniAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Stars: ✭ 10,698 (+39522.22%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (+29.63%)
mistqlA miniature lisp-like language for querying JSON-like structures. Tuned for clientside ML feature extraction.
Stars: ✭ 260 (+862.96%)
Machine Learning Workflow With PythonThis is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (+481.48%)
AutodlAutomated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+3062.96%)
GeomancerAutomated feature engineering for geospatial data
Stars: ✭ 194 (+618.52%)
Hyperparameter hunterEasy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (+2300%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+707.41%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+1174.07%)
Auto ml[UNMAINTAINED] Automated machine learning for analytics & production
Stars: ✭ 1,559 (+5674.07%)
cortana-intelligence-customer360This repository contains instructions and code to deploy a customer 360 profile solution on Azure stack using the Cortana Intelligence Suite.
Stars: ✭ 22 (-18.52%)
Kaggle CompetitionsThere are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (+218.52%)
gan tensorflowAutomatic feature engineering using Generative Adversarial Networks using TensorFlow.
Stars: ✭ 48 (+77.78%)
RemixautomlR package for automation of machine learning, forecasting, feature engineering, model evaluation, model interpretation, data generation, and recommenders.
Stars: ✭ 159 (+488.89%)
Feagen(deprecated) A fast and memory-efficient Python data engineering framework for machine learning.
Stars: ✭ 33 (+22.22%)
LightautomlLAMA - automatic model creation framework
Stars: ✭ 196 (+625.93%)
ProtrComprehensive toolkit for generating various numerical features of protein sequences
Stars: ✭ 30 (+11.11%)
AlbedoA recommender system for discovering GitHub repos, built with Apache Spark
Stars: ✭ 149 (+451.85%)
FeatexpFeature exploration for supervised learning
Stars: ✭ 688 (+2448.15%)
FeaturetoolsAn open source python library for automated feature engineering
Stars: ✭ 5,891 (+21718.52%)
Fe4ml Zh📖 [译] 面向机器学习的特征工程
Stars: ✭ 2,323 (+8503.7%)
Feature SelectionFeatures selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (+1877.78%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+9440.74%)
Awesome Feature EngineeringA curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Stars: ✭ 433 (+1503.7%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-44.44%)
Open source demosA collection of demos showcasing automated feature engineering and machine learning in diverse use cases
Stars: ✭ 391 (+1348.15%)
Datasist A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (+355.56%)
HyperactiveA hyperparameter optimization and data collection toolbox for convenient and fast prototyping of machine-learning models.
Stars: ✭ 182 (+574.07%)
AlinkAlink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Stars: ✭ 2,936 (+10774.07%)
feature engineFeature engineering package with sklearn like functionality
Stars: ✭ 758 (+2707.41%)
NyaggleCode for Kaggle and Offline Competitions
Stars: ✭ 209 (+674.07%)
mindwareAn efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.
Stars: ✭ 34 (+25.93%)
BlurrData transformations for the ML era
Stars: ✭ 96 (+255.56%)
AutofeatLinear Prediction Model with Automated Feature Engineering and Selection Capabilities
Stars: ✭ 178 (+559.26%)
Home Credit Default RiskDefault risk prediction for Home Credit competition - Fast, scalable and maintainable SQL-based feature engineering pipeline
Stars: ✭ 68 (+151.85%)
FIFA-2019-AnalysisThis is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (+3.7%)
tsflexFlexible time series feature extraction & processing
Stars: ✭ 252 (+833.33%)
TsfelAn intuitive library to extract features from time series
Stars: ✭ 202 (+648.15%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+7618.52%)
TpotA Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Stars: ✭ 8,378 (+30929.63%)