Allstate capstoneAllstate Kaggle Competition ML Capstone Project
Stars: ✭ 72 (-96.08%)
Sci PypeA Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.
Stars: ✭ 90 (-95.1%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-93.19%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-91.72%)
ChefboostA Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (-90.41%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-94.11%)
Sigmoidal aiTutoriais de Python, Data Science, Machine Learning e Deep Learning - Sigmoidal
Stars: ✭ 103 (-94.39%)
DtreevizA python library for decision tree visualization and model interpretation.
Stars: ✭ 1,857 (+1.2%)
handson-ml도서 "핸즈온 머신러닝"의 예제와 연습문제를 담은 주피터 노트북입니다.
Stars: ✭ 285 (-84.47%)
Eli5A library for debugging/inspecting machine learning classifiers and explaining their predictions
Stars: ✭ 2,477 (+34.99%)
Auto ml[UNMAINTAINED] Automated machine learning for analytics & production
Stars: ✭ 1,559 (-15.04%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-27.08%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-93.9%)
Spark AlchemyCollection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-93.35%)
TrafficA toolbox for processing and analysing air traffic data
Stars: ✭ 138 (-92.48%)
Isolation ForestA Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-92.43%)
Autograd.jlJulia port of the Python autograd package.
Stars: ✭ 147 (-91.99%)
Uncertainty MetricsAn easy-to-use interface for measuring uncertainty and robustness.
Stars: ✭ 145 (-92.1%)
ScilabFree and Open Source software for numerical computation providing a powerful computing environment for engineering and scientific applications.
Stars: ✭ 138 (-92.48%)
Bodywork CoreDeploy machine learning projects developed in Python, to Kubernetes. Accelerated MLOps 🚀
Stars: ✭ 145 (-92.1%)
Machine Learning And Data ScienceThis is a repository which contains all my work related Machine Learning, AI and Data Science. This includes my graduate projects, machine learning competition codes, algorithm implementations and reading material.
Stars: ✭ 137 (-92.53%)
TestovoeHome assignments for data science positions
Stars: ✭ 149 (-91.88%)
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (-0.76%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-92.53%)
Efficient AprioriAn efficient Python implementation of the Apriori algorithm.
Stars: ✭ 145 (-92.1%)
Apache Spark NodeNode.js bindings for Apache Spark DataFrame APIs
Stars: ✭ 136 (-92.59%)
Data Science WgSF Brigade's Data Science Working Group.
Stars: ✭ 135 (-92.64%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (-5.07%)
2016 Ml ContestMachine learning contest - October 2016 TLE
Stars: ✭ 135 (-92.64%)
PandasschemaA validation library for Pandas data frames using user-friendly schemas
Stars: ✭ 135 (-92.64%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+550.84%)
Qlik Py ToolsData Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
Stars: ✭ 135 (-92.64%)
LabsLabs for the Foundations of Applied Mathematics curriculum.
Stars: ✭ 150 (-91.83%)
Machine Learning🌎 machine learning tutorials (mainly in Python3)
Stars: ✭ 1,924 (+4.85%)
Beyond Jupyter🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-92.64%)
HermioneML made simple
Stars: ✭ 135 (-92.64%)
Scalable Data ScienceScalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Stars: ✭ 142 (-92.26%)
Blockchain2graphBlockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (-92.7%)
AcceleratorsData science and AI solution accelerator suite that provides templates for prototyping, reporting, and presenting data science analytics of specific domains
Stars: ✭ 134 (-92.7%)
M2cgenTransform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies
Stars: ✭ 1,962 (+6.92%)
StumpySTUMPY is a powerful and scalable Python library for modern time series analysis
Stars: ✭ 2,019 (+10.03%)
Datasciencera curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (-5.89%)
Automl GsProvide an input CSV and a target field to predict, generate a model + code to run it.
Stars: ✭ 1,766 (-3.76%)
TntorchTensor Network Learning with PyTorch
Stars: ✭ 133 (-92.75%)
Project kojakTraining a Neural Network to Detect Gestures and Control Smart Home Devices with OpenCV in Python
Stars: ✭ 147 (-91.99%)
PycwtA Python module for continuous wavelet spectral analysis. It includes a collection of routines for wavelet transform and statistical analysis via FFT algorithm. In addition, the module also includes cross-wavelet transforms, wavelet coherence tests and sample scripts.
Stars: ✭ 146 (-92.04%)
Spark AuthorizerA Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-92.32%)
PecanThe Predictive Ecosystem Analyzer (PEcAn) is an integrated ecological bioinformatics toolbox.
Stars: ✭ 132 (-92.81%)
Doddle Model🍰 doddle-model: machine learning in Scala.
Stars: ✭ 142 (-92.26%)
Seq2seq tutorialCode For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (-92.81%)
Rpy2Interface to use R from Python
Stars: ✭ 132 (-92.81%)