This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.

Stars: ✭ 114 (-91.48%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Pandas Videos

Jupyter notebook and datasets from the pandas Q&A video series

Stars: ✭ 1,716 (+28.25%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Show ast

An IPython notebook plugin for visualizing ASTs.

Stars: ✭ 76 (-94.32%)

Mutual labels: ipython, jupyter-notebook, ipython-notebook

Youtube Like Predictor

YouTube Like Count Predictions using Machine Learning

Stars: ✭ 137 (-89.76%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Griffon Vm

Griffon Data Science Virtual Machine

Stars: ✭ 128 (-90.43%)

Mutual labels: jupyter-notebook, data-science, big-data

Data Science Portfolio

A Portfolio of my Data Science Projects

Stars: ✭ 149 (-88.86%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Mmlspark

Simple and Distributed Machine Learning

Stars: ✭ 2,899 (+116.67%)

Mutual labels: spark, pyspark, big-data

Scalable Data Science Platform

Content for architecting a data science platform for products using Luigi, Spark & Flask.

Stars: ✭ 158 (-88.19%)

Mutual labels: jupyter-notebook, data-science, spark

Handyspark

HandySpark - bringing pandas-like capabilities to Spark dataframes

Stars: ✭ 158 (-88.19%)

Mutual labels: jupyter-notebook, spark, pyspark

Data Science Resources

👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋

Stars: ✭ 171 (-87.22%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Data Science Your Way

Ways of doing Data Science Engineering and Machine Learning in R and Python

Stars: ✭ 530 (-60.39%)

Mutual labels: jupyter-notebook, data-science, notebook

Digital Signal Processing Lecture

Digital Signal Processing - Theory and Computational Examples

Stars: ✭ 532 (-60.24%)

Mutual labels: ipython, jupyter-notebook, notebook

Amazing Feature Engineering

Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

Stars: ✭ 218 (-83.71%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Spark Practice

Apache Spark (PySpark) Practice on Real Data

Stars: ✭ 200 (-85.05%)

Mutual labels: jupyter-notebook, spark, pyspark

Ipyexperiments

jupyter/ipython experiment containers for GPU and general RAM re-use

Stars: ✭ 128 (-90.43%)

Mutual labels: ipython, jupyter-notebook, notebook

Pachyderm

Reproducible Data Science at Scale!

Stars: ✭ 5,305 (+296.49%)

Mutual labels: data-science, data-analysis, big-data

Ipywebrtc

WebRTC for Jupyter notebook/lab

Stars: ✭ 171 (-87.22%)

Mutual labels: ipython, jupyter-notebook, ipython-notebook

Pysparkgeoanalysis

🌐 Interactive Workshop on GeoAnalysis using PySpark

Stars: ✭ 63 (-95.29%)

Mutual labels: jupyter-notebook, spark, pyspark

Signals And Systems Lecture

Continuous- and Discrete-Time Signals and Systems - Theory and Computational Examples

Stars: ✭ 166 (-87.59%)

Mutual labels: ipython, jupyter-notebook, notebook

Notebooks Statistics And Machinelearning

Jupyter Notebooks from the old UnsupervisedLearning.com (RIP) machine learning and statistics blog

Stars: ✭ 270 (-79.82%)

Mutual labels: jupyter-notebook, data-science, ipython-notebook

aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Stars: ✭ 111 (-91.7%)

Mutual labels: big-data, spark, pyspark

Datascience course

Curso de Data Science em Português

Stars: ✭ 294 (-78.03%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Cortx

CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.

Stars: ✭ 426 (-68.16%)

Mutual labels: jupyter-notebook, big-data, bigdata

Tutorials

CatBoost tutorials repository

Stars: ✭ 563 (-57.92%)

Mutual labels: ipython, jupyter-notebook, ipython-notebook

Notebooks

A collection of Jupyter/IPython notebooks

Stars: ✭ 78 (-94.17%)

Mutual labels: jupyter-notebook, data-science, ipython-notebook

Datacamp

🍧 A repository that contains courses I have taken on DataCamp

Stars: ✭ 69 (-94.84%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Zat

Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark

Stars: ✭ 303 (-77.35%)

Mutual labels: jupyter-notebook, spark, data-analysis

Nbstripout

strip output from Jupyter and IPython notebooks

Stars: ✭ 738 (-44.84%)

Mutual labels: ipython, jupyter-notebook, ipython-notebook

Vscodejupyter

Jupyter for Visual Studio Code

Stars: ✭ 337 (-74.81%)

Mutual labels: ipython, jupyter-notebook, ipython-notebook

The Elements Of Statistical Learning Python Notebooks

A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book

Stars: ✭ 405 (-69.73%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Spark Notebook

Interactive and Reactive Data Science using Scala and Spark.

Stars: ✭ 3,081 (+130.27%)

Mutual labels: data-science, spark, notebook

Hyperlearn

50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster

Stars: ✭ 1,204 (-10.01%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Data Science Ipython Notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Stars: ✭ 22,048 (+1547.83%)

Mutual labels: data-science, spark, big-data

Jupyter pivottablejs

Drag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js

Stars: ✭ 428 (-68.01%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Jupyterlab Lsp

Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol

Stars: ✭ 796 (-40.51%)

Mutual labels: ipython, jupyter-notebook, notebook

Jupytemplate

Templates for jupyter notebooks

Stars: ✭ 85 (-93.65%)

Mutual labels: jupyter-notebook, data-science, notebook

Spark Tdd Example

A simple Spark TDD example

Stars: ✭ 23 (-98.28%)

Mutual labels: jupyter-notebook, spark, pyspark

Skdata

Python tools for data analysis

Stars: ✭ 16 (-98.8%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Pyspark Setup Demo

Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks

Stars: ✭ 24 (-98.21%)

Mutual labels: jupyter-notebook, big-data, pyspark

Dataflowjavasdk

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

Stars: ✭ 854 (-36.17%)

Mutual labels: data-science, data-analysis, big-data

Sciblog support

Support content for my blog

Stars: ✭ 694 (-48.13%)

Mutual labels: jupyter-notebook, data-science, big-data

Ipython Dashboard

A stand alone, light-weight web server for building, sharing graphs created in ipython. Build for data science, data analysis guys. Aiming at building an interactive visualization, collaborated dashboard, and real-time streaming graph.

Stars: ✭ 664 (-50.37%)

Mutual labels: ipython, data-science, notebook

Pandas Profiling

Create HTML profiling reports from pandas DataFrame objects

Stars: ✭ 8,329 (+522.5%)

Mutual labels: jupyter-notebook, data-science, data-analysis

Ansible Jupyterhub

Ansible role to setup jupyterhub server (deprecated)

Stars: ✭ 14 (-98.95%)

Mutual labels: ipython, jupyter-notebook, ipython-notebook

Lambdaschooldatascience

Completed assignments and coding challenges from the Lambda School Data Science program.