All Projects → jadianes → Data Science Your Way

jadianes / Data Science Your Way

Licence: other
Ways of doing Data Science Engineering and Machine Learning in R and Python

Programming Languages

python
139335 projects - #7 most used programming language
r
7636 projects

Projects that are alternatives of or similar to Data Science Your Way

Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-79.43%)
Mutual labels:  jupyter-notebook, data-science, jupyter, notebook, exploratory-data-analysis
Intro To Python
An intro to Python & programming for wanna-be data scientists
Stars: ✭ 536 (+1.13%)
Mutual labels:  jupyter-notebook, data-science, jupyter, tutorial
Quantitative Notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (-32.83%)
Mutual labels:  jupyter-notebook, data-science, jupyter, notebook
Nteract
📘 The interactive computing suite for you! ✨
Stars: ✭ 5,713 (+977.92%)
Mutual labels:  jupyter-notebook, data-science, jupyter, notebook
Jupytemplate
Templates for jupyter notebooks
Stars: ✭ 85 (-83.96%)
Mutual labels:  jupyter-notebook, data-science, jupyter, notebook
Allstate capstone
Allstate Kaggle Competition ML Capstone Project
Stars: ✭ 72 (-86.42%)
Mutual labels:  jupyter-notebook, data-science, notebook, tutorial
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+1471.51%)
Mutual labels:  jupyter-notebook, data-science, jupyter, exploratory-data-analysis
Learnpythonforresearch
This repository provides everything you need to get started with Python for (social science) research.
Stars: ✭ 163 (-69.25%)
Mutual labels:  jupyter-notebook, data-science, jupyter, tutorial
Hands On Nltk Tutorial
The hands-on NLTK tutorial for NLP in Python
Stars: ✭ 419 (-20.94%)
Mutual labels:  jupyter-notebook, jupyter, notebook, tutorial
Polyaxon
Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+459.62%)
Mutual labels:  data-science, jupyter, notebook
Gophernotes
The Go kernel for Jupyter notebooks and nteract.
Stars: ✭ 3,100 (+484.91%)
Mutual labels:  jupyter-notebook, data-science, jupyter
Code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-45.85%)
Mutual labels:  jupyter-notebook, data-science, exploratory-data-analysis
Datavisualization
Tutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph
Stars: ✭ 234 (-55.85%)
Mutual labels:  jupyter-notebook, exploratory-data-analysis, tutorial
Beakerx
Beaker Extensions for Jupyter Notebook
Stars: ✭ 2,594 (+389.43%)
Mutual labels:  jupyter-notebook, jupyter, notebook
Cryptocurrency Analysis Python
Open-Source Tutorial For Analyzing and Visualizing Cryptocurrency Data
Stars: ✭ 278 (-47.55%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Applied Reinforcement Learning
Reinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks
Stars: ✭ 229 (-56.79%)
Mutual labels:  jupyter-notebook, jupyter, notebook
Scikit Learn Videos
Jupyter notebooks from the scikit-learn video series
Stars: ✭ 3,254 (+513.96%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Julia Dataframes Tutorial
A tutorial on Julia DataFrames package
Stars: ✭ 318 (-40%)
Mutual labels:  jupyter-notebook, tutorial, data-frame
Functional intro to python
[tutorial]A functional, Data Science focused introduction to Python
Stars: ✭ 228 (-56.98%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Tensorwatch
Debugging, monitoring and visualization for Python Machine Learning and Data Science
Stars: ✭ 3,191 (+502.08%)
Mutual labels:  jupyter-notebook, data-science, jupyter

Data Science Engineering, your way

Join the chat at https://gitter.im/jadianes/data-science-your-way

An introduction to different Data Science engineering concepts and Applications using Python and R

These series of tutorials on Data Science engineering will try to compare how different concepts in the discipline can be implemented in the two dominant ecosystems nowadays: R and Python.

We will do this from a neutral point of view. Our opinion is that each environment has good and bad things, and any data scientist should know how to use both in order to be as prepared as posible for job market or to start personal project.

To get a feeling of what is going on regarding this hot topic, we refer the reader to DataCamp's Data Science War infographic. Their infographic explores what the strengths of R are over Python and vice versa, and aims to provide a basic comparison between these two programming languages from a data science and statistics perspective.

Far from being a repetition from the previous, our series of tutorials will go hands-on into how to actually perform different data science taks such as working with data frames, doing aggregations, or creating different statistical models such in the areas of supervised and unsupervised learning.

We will use real-world datasets, and we will build some real data products. This will help us to quickly transfer what we learn here to actual data analysis situations.

If your are interested in Big Data products, then you might find interesting our series of tutorials on using Apache Spark and Python or using R on Apache Spark (SparkR).

Tutorials

This is a growing list of tutorials explaining concepts and applications in Python and R.

Introduction to Data Frames

An introduction to the basic data structure and how to use it in Python/Pandas and R.

Exploratory Data Analysis

About this important task in any data science engineering project.

Dimensionality Reduction and Clustering

About using Principal Component Analysis and k-means Clustering to better represent and understand our data.

Text Mining and Sentiment Classification

How to use text mining techniques to analyse the positive or non-positive sentiment of text documents using just linear methods.

Applications

These are some of the applications we have built using the concepts explained in the tutorials.

A web-based Sentiment Classifier using R and Shiny

How to build a web applications where we can upload text documents to be sentiment-analysed using the R-based framework Shiny.

Building Data Products with Python

Using a wine reviews and recommendations website as a leitmotif, this series of tutorials, with its own separate repository tagged by lessons, digs into how to use Python technologies such as Django, Pandas, or Scikit-learn, in order to build data products.

Red Wine Quality Data analysis with R

Using R and ggplot2, we perform Exploratory Data Analysis of this reference dataset about wine quality.

Information Retrieval algorithms with Python

Where we show our own implementation of a couple of Information Retrieval algorithms: vector space model, and tf-idf.

Kaggle - The Analytics Edge (Spring 2015)

My solution to this Kaggle competition. It was part of the edX MOOC The Analitics Edge. I highly recommend this on-line course. It is one of the most applied I have ever taken about using R for data anlysis and machine learning.

Contributing

Contributions are welcome! For bug reports or requests please submit an issue.

Contact

Feel free to contact me to discuss any issues, questions, or comments.

License

This repository contains a variety of content; some developed by Jose A. Dianes, and some from third-parties. The third-party content is distributed under the license provided by those parties.

The content developed by Jose A. Dianes is distributed under the following license:

Copyright 2016 Jose A Dianes

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].