All Projects → 50-days-of-Statistics-for-Data-Science → Similar Projects or Alternatives

329 Open source projects that are alternatives of or similar to 50-days-of-Statistics-for-Data-Science

exemplary-ml-pipeline
Exemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (+21.05%)
feature engine
Feature engineering package with sklearn like functionality
Stars: ✭ 758 (+3889.47%)
featurewiz
Use advanced feature engineering strategies and select best features from your data set with a single line of code.
Stars: ✭ 229 (+1105.26%)
Market-Mix-Modeling
Market Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
Stars: ✭ 31 (+63.16%)
FIFA-2019-Analysis
This is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (+47.37%)
dominance-analysis
This package can be used for dominance analysis or Shapley Value Regression for finding relative importance of predictors on given dataset. This library can be used for key driver analysis or marginal resource allocation models.
Stars: ✭ 111 (+484.21%)
Nlpython
This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (+1294.74%)
Deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+1710.53%)
mistql
A miniature lisp-like language for querying JSON-like structures. Tuned for clientside ML feature extraction.
Stars: ✭ 260 (+1268.42%)
autoencoders tensorflow
Automatic feature engineering using deep learning and Bayesian inference using TensorFlow.
Stars: ✭ 66 (+247.37%)
Bike-Sharing-Demand-Kaggle
Top 5th percentile solution to the Kaggle knowledge problem - Bike Sharing Demand
Stars: ✭ 33 (+73.68%)
fastknn
Fast k-Nearest Neighbors Classifier for Large Datasets
Stars: ✭ 64 (+236.84%)
Competitive-Feature-Learning
Online feature-extraction and classification algorithm that learns representations of input patterns.
Stars: ✭ 32 (+68.42%)
Machine Learning Workflow With Python
This is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (+726.32%)
NVTabular
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Stars: ✭ 797 (+4094.74%)
skrobot
skrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Stars: ✭ 22 (+15.79%)
Blurr
Data transformations for the ML era
Stars: ✭ 96 (+405.26%)
Deep Learning Machine Learning Stock
Stock for Deep Learning and Machine Learning
Stars: ✭ 240 (+1163.16%)
Protr
Comprehensive toolkit for generating various numerical features of protein sequences
Stars: ✭ 30 (+57.89%)
Awesome Feature Engineering
A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Stars: ✭ 433 (+2178.95%)
Feature Engineering And Feature Selection
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
Stars: ✭ 526 (+2668.42%)
Kaggle Competitions
There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (+352.63%)
Nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Stars: ✭ 10,698 (+56205.26%)
The Building Data Genome Project
A collection of non-residential buildings for performance analysis and algorithm benchmarking
Stars: ✭ 117 (+515.79%)
Tsfel
An intuitive library to extract features from time series
Stars: ✭ 202 (+963.16%)
pyHSICLasso
Versatile Nonlinear Feature Selection Algorithm for High-dimensional Data
Stars: ✭ 125 (+557.89%)
msda
Library for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector
Stars: ✭ 80 (+321.05%)
Feature Selection
Features selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (+2710.53%)
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+6084.21%)
Mutual labels:  eda, feature-extraction
Complete Life Cycle Of A Data Science Project
Complete-Life-Cycle-of-a-Data-Science-Project
Stars: ✭ 140 (+636.84%)
Mutual labels:  eda, feature-engineering
gan tensorflow
Automatic feature engineering using Generative Adversarial Networks using TensorFlow.
Stars: ✭ 48 (+152.63%)
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+1047.37%)
tsflex
Flexible time series feature extraction & processing
Stars: ✭ 252 (+1226.32%)
Data-Science
Using Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-21.05%)
ReinforcementLearning Sutton-Barto Solutions
Solutions and figures for problems from Reinforcement Learning: An Introduction Sutton&Barto
Stars: ✭ 20 (+5.26%)
Mutual labels:  feature-engineering
Limbo
Library for VLSI CAD Design Useful parsers and solvers' api are implemented.
Stars: ✭ 84 (+342.11%)
Mutual labels:  eda
federated pca
Federated Principal Component Analysis Revisited!
Stars: ✭ 30 (+57.89%)
Mutual labels:  dimensionality-reduction
ParametricUMAP paper
Parametric UMAP embeddings for representation and semisupervised learning. From the paper "Parametric UMAP: learning embeddings with deep neural networks for representation and semi-supervised learning" (Sainburg, McInnes, Gentner, 2020).
Stars: ✭ 132 (+594.74%)
Mutual labels:  dimensionality-reduction
Python Computer Vision from Scratch
This repository explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply…
Stars: ✭ 219 (+1052.63%)
Mutual labels:  feature-extraction
mrmr
mRMR (minimum-Redundancy-Maximum-Relevance) for automatic feature selection at scale.
Stars: ✭ 170 (+794.74%)
Mutual labels:  feature-selection
pyAudioProcessing
Audio feature extraction and classification
Stars: ✭ 165 (+768.42%)
Mutual labels:  feature-extraction
Machine Learning
A repository of resources for understanding the concepts of machine learning/deep learning.
Stars: ✭ 29 (+52.63%)
Mutual labels:  dimensionality-reduction
Stock-Selection-a-Framework
This project demonstrates how to apply machine learning algorithms to distinguish "good" stocks from the "bad" stocks.
Stars: ✭ 239 (+1157.89%)
Mutual labels:  feature-selection
wal
WAL enables programmable waveform analysis.
Stars: ✭ 36 (+89.47%)
Mutual labels:  eda
moses
Streaming, Memory-Limited, r-truncated SVD Revisited!
Stars: ✭ 19 (+0%)
Mutual labels:  dimensionality-reduction
DRComparison
Comparison of dimensionality reduction methods
Stars: ✭ 29 (+52.63%)
Mutual labels:  dimensionality-reduction
AutoTabular
Automatic machine learning for tabular data. ⚡🔥⚡
Stars: ✭ 51 (+168.42%)
Mutual labels:  feature-engineering
featuretoolsOnSpark
A simplified version of featuretools for Spark
Stars: ✭ 24 (+26.32%)
Mutual labels:  feature-engineering
sef
A Python Library for Similarity-based Dimensionality Reduction
Stars: ✭ 24 (+26.32%)
Mutual labels:  dimensionality-reduction
enstop
Ensemble topic modelling with pLSA
Stars: ✭ 104 (+447.37%)
Mutual labels:  dimensionality-reduction
uapca
Uncertainty-aware principal component analysis.
Stars: ✭ 16 (-15.79%)
Mutual labels:  dimensionality-reduction
L0Learn
Efficient Algorithms for L0 Regularized Learning
Stars: ✭ 74 (+289.47%)
Mutual labels:  feature-selection
EvolutionaryForest
An open source python library for automated feature engineering based on Genetic Programming
Stars: ✭ 56 (+194.74%)
Mutual labels:  feature-engineering
zca
ZCA whitening in python
Stars: ✭ 29 (+52.63%)
Mutual labels:  feature-engineering
video features
Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.
Stars: ✭ 225 (+1084.21%)
Mutual labels:  feature-extraction
Ball
Statistical Inference and Sure Independence Screening via Ball Statistics
Stars: ✭ 22 (+15.79%)
Mutual labels:  feature-selection
topometry
A comprehensive dimensional reduction framework to recover the latent topology from high-dimensional data.
Stars: ✭ 64 (+236.84%)
Mutual labels:  dimensionality-reduction
pykicad
Library for working with KiCAD file formats
Stars: ✭ 46 (+142.11%)
Mutual labels:  eda
gdstk
Gdstk (GDSII Tool Kit) is a C++/Python library for creation and manipulation of GDSII and OASIS files.
Stars: ✭ 171 (+800%)
Mutual labels:  eda
SIFT-BoF
Feature extraction by using SITF+BoF.
Stars: ✭ 22 (+15.79%)
Mutual labels:  feature-extraction
1-60 of 329 similar projects