exemplary-ml-pipelineExemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (+21.05%)
feature engineFeature engineering package with sklearn like functionality
Stars: ✭ 758 (+3889.47%)
featurewizUse advanced feature engineering strategies and select best features from your data set with a single line of code.
Stars: ✭ 229 (+1105.26%)
Market-Mix-ModelingMarket Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
Stars: ✭ 31 (+63.16%)
FIFA-2019-AnalysisThis is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (+47.37%)
dominance-analysisThis package can be used for dominance analysis or Shapley Value Regression for finding relative importance of predictors on given dataset. This library can be used for key driver analysis or marginal resource allocation models.
Stars: ✭ 111 (+484.21%)
NlpythonThis repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (+1294.74%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+1710.53%)
mistqlA miniature lisp-like language for querying JSON-like structures. Tuned for clientside ML feature extraction.
Stars: ✭ 260 (+1268.42%)
autoencoders tensorflowAutomatic feature engineering using deep learning and Bayesian inference using TensorFlow.
Stars: ✭ 66 (+247.37%)
fastknnFast k-Nearest Neighbors Classifier for Large Datasets
Stars: ✭ 64 (+236.84%)
Competitive-Feature-LearningOnline feature-extraction and classification algorithm that learns representations of input patterns.
Stars: ✭ 32 (+68.42%)
Machine Learning Workflow With PythonThis is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (+726.32%)
NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Stars: ✭ 797 (+4094.74%)
skrobotskrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Stars: ✭ 22 (+15.79%)
BlurrData transformations for the ML era
Stars: ✭ 96 (+405.26%)
ProtrComprehensive toolkit for generating various numerical features of protein sequences
Stars: ✭ 30 (+57.89%)
Awesome Feature EngineeringA curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Stars: ✭ 433 (+2178.95%)
Kaggle CompetitionsThere are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (+352.63%)
NniAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Stars: ✭ 10,698 (+56205.26%)
TsfelAn intuitive library to extract features from time series
Stars: ✭ 202 (+963.16%)
pyHSICLassoVersatile Nonlinear Feature Selection Algorithm for High-dimensional Data
Stars: ✭ 125 (+557.89%)
msdaLibrary for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector
Stars: ✭ 80 (+321.05%)
Feature SelectionFeatures selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (+2710.53%)
gan tensorflowAutomatic feature engineering using Generative Adversarial Networks using TensorFlow.
Stars: ✭ 48 (+152.63%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+1047.37%)
tsflexFlexible time series feature extraction & processing
Stars: ✭ 252 (+1226.32%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-21.05%)
LimboLibrary for VLSI CAD Design Useful parsers and solvers' api are implemented.
Stars: ✭ 84 (+342.11%)
federated pcaFederated Principal Component Analysis Revisited!
Stars: ✭ 30 (+57.89%)
ParametricUMAP paperParametric UMAP embeddings for representation and semisupervised learning. From the paper "Parametric UMAP: learning embeddings with deep neural networks for representation and semi-supervised learning" (Sainburg, McInnes, Gentner, 2020).
Stars: ✭ 132 (+594.74%)
Python Computer Vision from ScratchThis repository explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply…
Stars: ✭ 219 (+1052.63%)
mrmrmRMR (minimum-Redundancy-Maximum-Relevance) for automatic feature selection at scale.
Stars: ✭ 170 (+794.74%)
Machine LearningA repository of resources for understanding the concepts of machine learning/deep learning.
Stars: ✭ 29 (+52.63%)
Stock-Selection-a-FrameworkThis project demonstrates how to apply machine learning algorithms to distinguish "good" stocks from the "bad" stocks.
Stars: ✭ 239 (+1157.89%)
walWAL enables programmable waveform analysis.
Stars: ✭ 36 (+89.47%)
mosesStreaming, Memory-Limited, r-truncated SVD Revisited!
Stars: ✭ 19 (+0%)
DRComparisonComparison of dimensionality reduction methods
Stars: ✭ 29 (+52.63%)
AutoTabularAutomatic machine learning for tabular data. ⚡🔥⚡
Stars: ✭ 51 (+168.42%)
sefA Python Library for Similarity-based Dimensionality Reduction
Stars: ✭ 24 (+26.32%)
enstopEnsemble topic modelling with pLSA
Stars: ✭ 104 (+447.37%)
uapcaUncertainty-aware principal component analysis.
Stars: ✭ 16 (-15.79%)
L0LearnEfficient Algorithms for L0 Regularized Learning
Stars: ✭ 74 (+289.47%)
EvolutionaryForestAn open source python library for automated feature engineering based on Genetic Programming
Stars: ✭ 56 (+194.74%)
zcaZCA whitening in python
Stars: ✭ 29 (+52.63%)
video featuresExtract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.
Stars: ✭ 225 (+1084.21%)
BallStatistical Inference and Sure Independence Screening via Ball Statistics
Stars: ✭ 22 (+15.79%)
topometryA comprehensive dimensional reduction framework to recover the latent topology from high-dimensional data.
Stars: ✭ 64 (+236.84%)
pykicadLibrary for working with KiCAD file formats
Stars: ✭ 46 (+142.11%)
gdstkGdstk (GDSII Tool Kit) is a C++/Python library for creation and manipulation of GDSII and OASIS files.
Stars: ✭ 171 (+800%)
SIFT-BoFFeature extraction by using SITF+BoF.
Stars: ✭ 22 (+15.79%)