msdaLibrary for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector
Stars: ✭ 80 (+280.95%)
pyHSICLassoVersatile Nonlinear Feature Selection Algorithm for High-dimensional Data
Stars: ✭ 125 (+495.24%)
MlrMachine Learning in R
Stars: ✭ 1,542 (+7242.86%)
laravel-rolloutA package to integrate rollout into your Laravel project.
Stars: ✭ 23 (+9.52%)
GraphOfDocsGraphOfDocs: Representing multiple documents as a single graph
Stars: ✭ 13 (-38.1%)
autogbt-altAn experimental Python package that reimplements AutoGBT using LightGBM and Optuna.
Stars: ✭ 76 (+261.9%)
fast retrainingShow how to perform fast retraining with LightGBM in different business cases
Stars: ✭ 56 (+166.67%)
datascienvdatascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (+152.38%)
AutoTabularAutomatic machine learning for tabular data. ⚡🔥⚡
Stars: ✭ 51 (+142.86%)
qbso-fsPython implementation of QBSO-FS : a Reinforcement Learning based Bee Swarm Optimization metaheuristic for Feature Selection problem.
Stars: ✭ 47 (+123.81%)
FEASTA FEAture Selection Toolbox for C/C+, Java, and Matlab/Octave.
Stars: ✭ 67 (+219.05%)
exemplary-ml-pipelineExemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (+9.52%)
kaggle-plasticcSolution to Kaggle's PLAsTiCC Astronomical Classification Competition
Stars: ✭ 50 (+138.1%)
stackgbm🌳 Stacked Gradient Boosting Machines
Stars: ✭ 24 (+14.29%)
py ml utilsPython utilities for Machine Learning competitions
Stars: ✭ 29 (+38.1%)
50-days-of-Statistics-for-Data-ScienceThis repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.
Stars: ✭ 19 (-9.52%)
adaptAwesome Domain Adaptation Python Toolbox
Stars: ✭ 46 (+119.05%)
dominance-analysisThis package can be used for dominance analysis or Shapley Value Regression for finding relative importance of predictors on given dataset. This library can be used for key driver analysis or marginal resource allocation models.
Stars: ✭ 111 (+428.57%)
NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Stars: ✭ 797 (+3695.24%)
Apartment-Interest-PredictionPredict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text and images.
Stars: ✭ 17 (-19.05%)
KaggleKaggle Kernels (Python, R, Jupyter Notebooks)
Stars: ✭ 26 (+23.81%)
Kaggle-Competition-SberbankTop 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-housing-market
Stars: ✭ 31 (+47.62%)
neptune-client📒 Experiment tracking tool and model registry
Stars: ✭ 348 (+1557.14%)
mltbMachine Learning Tool Box
Stars: ✭ 25 (+19.05%)
mrmrmRMR (minimum-Redundancy-Maximum-Relevance) for automatic feature selection at scale.
Stars: ✭ 170 (+709.52%)
featurewizUse advanced feature engineering strategies and select best features from your data set with a single line of code.
Stars: ✭ 229 (+990.48%)
L0LearnEfficient Algorithms for L0 Regularized Learning
Stars: ✭ 74 (+252.38%)
LightGBM.jlLightGBM.jl provides a high-performance Julia interface for Microsoft's LightGBM.
Stars: ✭ 40 (+90.48%)
BallStatistical Inference and Sure Independence Screening via Ball Statistics
Stars: ✭ 22 (+4.76%)
zoofszoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.
Stars: ✭ 142 (+576.19%)
skrobotskrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Stars: ✭ 22 (+4.76%)
lightgbmExplainerAn R package that makes lightgbm models fully interpretable (take reference from https://github.com/AppliedDataSciencePartners/xgboostExplainer)
Stars: ✭ 22 (+4.76%)
bessBest Subset Selection algorithm for Regression, Classification, Count, Survival analysis
Stars: ✭ 14 (-33.33%)
stgPython/R library for feature selection in neural nets. ("Feature selection using Stochastic Gates", ICML 2020)
Stars: ✭ 47 (+123.81%)
FIFA-2019-AnalysisThis is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (+33.33%)
ml-pipelineUsing Kafka-Python to illustrate a ML production pipeline
Stars: ✭ 90 (+328.57%)
JLBoost.jlA 100%-Julia implementation of Gradient-Boosting Regression Tree algorithms
Stars: ✭ 65 (+209.52%)
recsys2019The complete code and notebooks used for the ACM Recommender Systems Challenge 2019
Stars: ✭ 26 (+23.81%)
fsfcFeature Selection for Clustering
Stars: ✭ 80 (+280.95%)
Stock-Selection-a-FrameworkThis project demonstrates how to apply machine learning algorithms to distinguish "good" stocks from the "bad" stocks.
Stars: ✭ 239 (+1038.1%)
docker-kaggle-ko머신러닝/딥러닝(PyTorch, TensorFlow) 전용 도커입니다. 한글 폰트, 한글 자연어처리 패키지(konlpy), 형태소 분석기, Timezone 등의 설정 등을 추가 하였습니다.
Stars: ✭ 46 (+119.05%)
feature engineFeature engineering package with sklearn like functionality
Stars: ✭ 758 (+3509.52%)
TextFeatureSelectionPython library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (+100%)
GPScode for "A global pathway selection algorithm for the reduction of detailed chemical kinetic mechanisms" (Gao et al., CNF'16)
Stars: ✭ 18 (-14.29%)
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+15876.19%)
lleavesCompiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ≥10x.
Stars: ✭ 132 (+528.57%)
Market-Mix-ModelingMarket Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
Stars: ✭ 31 (+47.62%)
CASTDeveloper Version of the R package CAST: Caret Applications for Spatio-Temporal models
Stars: ✭ 65 (+209.52%)
mlforecastScalable machine 🤖 learning for time series forecasting.
Stars: ✭ 96 (+357.14%)