Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-88.78%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-93.59%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+466.73%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-58.62%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+34.07%)
MydatascienceportfolioApplying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-77.25%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-1.2%)
Sigma coding youtubeThis is a collection of all the code that can be found on my YouTube channel Sigma Coding.
Stars: ✭ 611 (-38.78%)
Nteract📘 The interactive computing suite for you! ✨
Stars: ✭ 5,713 (+472.44%)
Zero To Mastery MlAll course materials for the Zero to Mastery Machine Learning and Data Science course.
Stars: ✭ 631 (-36.77%)
Fastai2Temporary home for fastai v2 while it's being developed
Stars: ✭ 630 (-36.87%)
H1stThe AI Application Platform We All Need. Human AND Machine Intelligence. Based on experience building AI solutions at Panasonic: robotics predictive maintenance, cold-chain energy optimization, Gigafactory battery mfg, avionics, automotive cybersecurity, and more.
Stars: ✭ 697 (-30.16%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-29.46%)
Machine learning refinedNotes, examples, and Python demos for the textbook "Machine Learning Refined" (published by Cambridge University Press).
Stars: ✭ 750 (-24.85%)
Industry Machine LearningA curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Stars: ✭ 6,077 (+508.92%)
CourseraQuiz & Assignment of Coursera
Stars: ✭ 774 (-22.44%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-98.4%)
Har Keras CoremlHuman Activity Recognition (HAR) with Keras and CoreML
Stars: ✭ 23 (-97.7%)
Data Science PortfolioPortfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (-43.99%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-45.79%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-36.57%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (-36.57%)
TsfreshAutomatic extraction of relevant features from time series:
Stars: ✭ 6,077 (+508.92%)
JustenoughscalaforsparkA tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (-46.09%)
FeatexpFeature exploration for supervised learning
Stars: ✭ 688 (-31.06%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (-25.35%)
Hitchhikers GuideThe Hitchhiker's Guide to Data Science for Social Good
Stars: ✭ 732 (-26.65%)
Intro To PythonAn intro to Python & programming for wanna-be data scientists
Stars: ✭ 536 (-46.29%)
LambdaschooldatascienceCompleted assignments and coding challenges from the Lambda School Data Science program.
Stars: ✭ 22 (-97.8%)
MdsModern Data Science
Stars: ✭ 19 (-98.1%)
ResourcesPyMC3 educational resources
Stars: ✭ 930 (-6.81%)
Tiledb VcfEfficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (-97.39%)
Awesome Google ColabGoogle Colaboratory Notebooks and Repositories (by @firmai)
Stars: ✭ 863 (-13.53%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+734.57%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-13.43%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-98.6%)
Python for mlbrief introduction to Python for machine learning
Stars: ✭ 29 (-97.09%)
Mlnet WorkshopML.NET Workshop to predict car sales prices
Stars: ✭ 29 (-97.09%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (-4.41%)
Python TrainingPython training for business analysts and traders
Stars: ✭ 972 (-2.61%)
Intro PythonPython pour Statistique et Science des Données -- Syntaxe, Trafic de Données, Graphes, Programmation, Apprentissage
Stars: ✭ 21 (-97.9%)
Docker Iocaml DatascienceDockerfile of Jupyter (IPython notebook) and IOCaml (OCaml kernel) with libraries for data science and machine learning
Stars: ✭ 30 (-96.99%)