KagglerCode for Kaggle Data Science Competitions
Stars: ✭ 614 (+3131.58%)
Kaggle-Quora-Question-PairsThis is our team's solution report, which achieves top 10% (305/3307) in this competition.
Stars: ✭ 58 (+205.26%)
Home Credit Default RiskDefault risk prediction for Home Credit competition - Fast, scalable and maintainable SQL-based feature engineering pipeline
Stars: ✭ 68 (+257.89%)
Kaggle CompetitionsThere are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (+352.63%)
Kaggle Quora Question PairsKaggle:Quora Question Pairs, 4th/3396 (https://www.kaggle.com/c/quora-question-pairs)
Stars: ✭ 705 (+3610.53%)
NyaggleCode for Kaggle and Offline Competitions
Stars: ✭ 209 (+1000%)
question-pairA siamese LSTM to detect sentence/question pairs.
Stars: ✭ 25 (+31.58%)
kaggle-berlinMaterial of the Kaggle Berlin meetup group!
Stars: ✭ 36 (+89.47%)
Machine Learning Workflow With PythonThis is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (+726.32%)
fastknnFast k-Nearest Neighbors Classifier for Large Datasets
Stars: ✭ 64 (+236.84%)
LightautomlLAMA - automatic model creation framework
Stars: ✭ 196 (+931.58%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-21.05%)
digit recognizerCNN digit recognizer implemented in Keras Notebook, Kaggle/MNIST (0.995).
Stars: ✭ 27 (+42.11%)
EvolutionaryForestAn open source python library for automated feature engineering based on Genetic Programming
Stars: ✭ 56 (+194.74%)
PyData-Pseudolabelling-KeynoteAccompanying notebook and sources to "A Guide to Pseudolabelling: How to get a Kaggle medal with only one model" (Dec. 2020 PyData Boston-Cambridge Keynote)
Stars: ✭ 23 (+21.05%)
AutoTSAutomated Time Series Forecasting
Stars: ✭ 665 (+3400%)
Quantitative-Big-Imaging-2018(Latest semester at https://github.com/kmader/Quantitative-Big-Imaging-2019) The material for the Quantitative Big Imaging course at ETHZ for the Spring Semester 2018
Stars: ✭ 50 (+163.16%)
msdaLibrary for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector
Stars: ✭ 80 (+321.05%)
kaggledatasetsCollection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (+131.58%)
kaggle-camera-model-identificationCode for reproducing 2nd place solution for Kaggle competition IEEE's Signal Processing Society - Camera Model Identification
Stars: ✭ 64 (+236.84%)
zcaZCA whitening in python
Stars: ✭ 29 (+52.63%)
data-science-learning📊 All of courses, assignments, exercises, mini-projects and books that I've done so far in the process of learning by myself Machine Learning and Data Science.
Stars: ✭ 32 (+68.42%)
Data-Science-ProjectsData Science projects on various problem statements and datasets using Data Analysis, Machine Learning Algorithms, Deep Learning Algorithms, Natural Language Processing, Business Intelligence concepts by Python
Stars: ✭ 28 (+47.37%)
bisemanticText pair classification
Stars: ✭ 12 (-36.84%)
skrobotskrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Stars: ✭ 22 (+15.79%)
AskQuoraQuora Q&A right from the command-line
Stars: ✭ 14 (-26.32%)
AutoXAutoX is an efficient automl tool, which is mainly aimed at data mining tasks with tabular data.
Stars: ✭ 431 (+2168.42%)
dominance-analysisThis package can be used for dominance analysis or Shapley Value Regression for finding relative importance of predictors on given dataset. This library can be used for key driver analysis or marginal resource allocation models.
Stars: ✭ 111 (+484.21%)
dku-kaggle-class단국대 SW중심대학 2020년도 오픈소스SW설계 - 캐글뽀개기 수업 일정 및 강의자료
Stars: ✭ 48 (+152.63%)
kdsb17Gaussian Mixture Convolutional AutoEncoder applied to CT lung scans from the Kaggle Data Science Bowl 2017
Stars: ✭ 18 (-5.26%)
Kaggle-Avito-NNThe 18th Place Solution to Avito Demand Prediction Challenge
Stars: ✭ 25 (+31.58%)
kaggle-champsCode for the CHAMPS Predicting Molecular Properties Kaggle competition
Stars: ✭ 49 (+157.89%)
NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Stars: ✭ 797 (+4094.74%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (+305.26%)
Fill-the-GAP[ACL-WS] 4th place solution to gendered pronoun resolution challenge on Kaggle
Stars: ✭ 13 (-31.58%)
PubMed-Best-MatchMachine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches
Stars: ✭ 36 (+89.47%)
clinkClink is a library that provides APIs and infrastructure to facilitate the development of parallelizable feature engineering operators that can be used in both C++ and Java runtime.
Stars: ✭ 24 (+26.32%)
fengfeng - feature engineering for machine-learning champions
Stars: ✭ 27 (+42.11%)
InstahelpInstahelp is a Q&A portal website similar to Quora
Stars: ✭ 21 (+10.53%)
Data-Science-ArticlesA collection of my blogs on Data Science and Machine learning.
Stars: ✭ 66 (+247.37%)
StoreItemDemand(117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.
Stars: ✭ 24 (+26.32%)
FIFA-2019-AnalysisThis is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (+47.37%)