TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+8583.33%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (+83.33%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (+45.83%)
autoencoders tensorflowAutomatic feature engineering using deep learning and Bayesian inference using TensorFlow.
Stars: ✭ 66 (+175%)
TsfelAn intuitive library to extract features from time series
Stars: ✭ 202 (+741.67%)
Predicting-Transportation-Modes-of-GPS-TrajectoriesUnderstanding transportation mode from GPS (Global Positioning System) traces is an essential topic in the data mobility domain. In this paper, a framework is proposed to predict transportation modes. This framework follows a sequence of five steps: (i) data preparation, where GPS points are grouped in trajectory samples; (ii) point features gen…
Stars: ✭ 37 (+54.17%)
Mljar SupervisedAutomated Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning 🚀
Stars: ✭ 961 (+3904.17%)
Machine Learning Workflow With PythonThis is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (+554.17%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+2450%)
AutodlAutomated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+3458.33%)
tsflexFlexible time series feature extraction & processing
Stars: ✭ 252 (+950%)
NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Stars: ✭ 797 (+3220.83%)
Kaggle Quora Question PairsKaggle:Quora Question Pairs, 4th/3396 (https://www.kaggle.com/c/quora-question-pairs)
Stars: ✭ 705 (+2837.5%)
zcaZCA whitening in python
Stars: ✭ 29 (+20.83%)
EvalmlEvalML is an AutoML library written in python.
Stars: ✭ 145 (+504.17%)
PubMed-Best-MatchMachine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches
Stars: ✭ 36 (+50%)
Hyperparameter hunterEasy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (+2600%)
exemplary-ml-pipelineExemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-4.17%)
GeomancerAutomated feature engineering for geospatial data
Stars: ✭ 194 (+708.33%)
KagglerCode for Kaggle Data Science Competitions
Stars: ✭ 614 (+2458.33%)
skrobotskrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Stars: ✭ 22 (-8.33%)
kaggle-berlinMaterial of the Kaggle Berlin meetup group!
Stars: ✭ 36 (+50%)
LycorisA lightweight and easy-to-use deep learning framework with neural architecture search.
Stars: ✭ 180 (+650%)
FIFA-2019-AnalysisThis is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (+16.67%)
ContinuumA clean and simple data loading library for Continual Learning
Stars: ✭ 136 (+466.67%)
River🌊 Online machine learning in Python
Stars: ✭ 2,980 (+12316.67%)
Fwumious wabbitFwumious Wabbit, fast on-line machine learning toolkit written in Rust
Stars: ✭ 96 (+300%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+1333.33%)
RoadmapGitBook: OSCP RoadMap
Stars: ✭ 89 (+270.83%)
Hanzi char featurizer汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese characters (pronunciation features, glyph features) as features for deep learning
Stars: ✭ 187 (+679.17%)
HyperganComposable GAN framework with api and user interface
Stars: ✭ 1,104 (+4500%)
NlpythonThis repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (+1004.17%)
Vowpal wabbitVowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
Stars: ✭ 7,815 (+32462.5%)
Auto ml[UNMAINTAINED] Automated machine learning for analytics & production
Stars: ✭ 1,559 (+6395.83%)
OnlinemoocVue前台 + Django3.1 + DjangoRestful Framework + Ant Design Pro V4后台 开发的在线教育网站及后台管理
Stars: ✭ 587 (+2345.83%)
Boost CookbookOnline examples from "Boost C++ Application Development Cookbook":
Stars: ✭ 306 (+1175%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+808.33%)
Competitive-Feature-LearningOnline feature-extraction and classification algorithm that learns representations of input patterns.
Stars: ✭ 32 (+33.33%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+125%)
NniAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Stars: ✭ 10,698 (+44475%)
data aggregationThis repository contains the code for the CVPR 2020 paper "Exploring Data Aggregation in Policy Learning for Vision-based Urban Autonomous Driving"
Stars: ✭ 26 (+8.33%)
cortana-intelligence-customer360This repository contains instructions and code to deploy a customer 360 profile solution on Azure stack using the Cortana Intelligence Suite.
Stars: ✭ 22 (-8.33%)
mistqlA miniature lisp-like language for querying JSON-like structures. Tuned for clientside ML feature extraction.
Stars: ✭ 260 (+983.33%)
fengfeng - feature engineering for machine-learning champions
Stars: ✭ 27 (+12.5%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-37.5%)
NyaggleCode for Kaggle and Offline Competitions
Stars: ✭ 209 (+770.83%)
AutofeatLinear Prediction Model with Automated Feature Engineering and Selection Capabilities
Stars: ✭ 178 (+641.67%)
Home Credit Default RiskDefault risk prediction for Home Credit competition - Fast, scalable and maintainable SQL-based feature engineering pipeline
Stars: ✭ 68 (+183.33%)
icicleIcicle Streaming Query Language
Stars: ✭ 16 (-33.33%)