Machine Learning With PythonPractice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (-61.16%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (-73.2%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-97.35%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-97.74%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-76.34%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+289.82%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (-67.56%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-98.85%)
User Machine Learning TutorialuseR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive http://user2016.org/tutorials/10.html
Stars: ✭ 393 (-93.05%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (-78.71%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-82.57%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (-82.36%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-98.87%)
Mli ResourcesH2O.ai Machine Learning Interpretability Resources
Stars: ✭ 428 (-92.43%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-98.02%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-92.47%)
H2o TutorialsTutorials and training material for the H2O Machine Learning Platform
Stars: ✭ 1,305 (-76.93%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-91.97%)
FlamlA fast and lightweight AutoML library.
Stars: ✭ 205 (-96.38%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (-96.16%)
Machine-Learning-ModelsIn This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
Stars: ✭ 30 (-99.47%)
Mlj.jlA Julia machine learning framework
Stars: ✭ 982 (-82.64%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (-86.83%)
25daysinmachinelearningI will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-99.06%)
H1stThe AI Application Platform We All Need. Human AND Machine Intelligence. Based on experience building AI solutions at Panasonic: robotics predictive maintenance, cold-chain energy optimization, Gigafactory battery mfg, avionics, automotive cybersecurity, and more.
Stars: ✭ 697 (-87.68%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-92.7%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-98.07%)
Isl PythonSolutions to labs and excercises from An Introduction to Statistical Learning, as Jupyter Notebooks.
Stars: ✭ 108 (-98.09%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-98%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-99.75%)
RayAn open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Stars: ✭ 18,547 (+227.92%)
scorubyRuby Scoring API for PMML
Stars: ✭ 69 (-98.78%)
HyperGBMA full pipeline AutoML tool for tabular data
Stars: ✭ 172 (-96.96%)
Interpretable machine learning with pythonExamples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Stars: ✭ 530 (-90.63%)
MydatascienceportfolioApplying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-95.99%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-97.29%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-98.04%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-99.77%)
PbaEfficient Learning of Augmentation Policy Schedules
Stars: ✭ 461 (-91.85%)
AutogluonAutoGluon: AutoML for Text, Image, and Tabular Data
Stars: ✭ 3,920 (-30.69%)
AdanetFast and flexible AutoML with learning guarantees.
Stars: ✭ 3,340 (-40.95%)
PycaretAn open-source, low-code machine learning library in Python
Stars: ✭ 4,594 (-18.78%)
ScannerEfficient video analysis at scale
Stars: ✭ 569 (-89.94%)
Automlpipeline.jlA package that makes it trivial to create and evaluate machine learning pipeline architectures.
Stars: ✭ 223 (-96.06%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (-46.18%)
Ml Workspace🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (-58.68%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (-19.01%)