All Projects → CermakM → Jupyter Datatables

CermakM / Jupyter Datatables

Licence: mit
Jupyter Notebook extension leveraging pandas DataFrames by integrating DataTables and ChartJS.

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Jupyter Datatables

Alphalens
Performance analysis of predictive (alpha) stock factors
Stars: ✭ 2,130 (+1577.17%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Choochoo
Training Diary
Stars: ✭ 186 (+46.46%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Cheatsheets.pdf
📚 Various cheatsheets in PDF
Stars: ✭ 159 (+25.2%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Py
Repository to store sample python programs for python learning
Stars: ✭ 4,154 (+3170.87%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Or Pandas
【运筹OR帷幄|数据科学】pandas教程系列电子书
Stars: ✭ 492 (+287.4%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Opendatawrangling
공공데이터 분석
Stars: ✭ 148 (+16.54%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Learnpythonforresearch
This repository provides everything you need to get started with Python for (social science) research.
Stars: ✭ 163 (+28.35%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+1370.87%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Geopython
Notebooks and libraries for spatial/geo Python explorations
Stars: ✭ 268 (+111.02%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Jupyter Tips And Tricks
Using Project Jupyter for data science.
Stars: ✭ 245 (+92.91%)
Mutual labels:  jupyter-notebook, jupyter, pandas
California Coronavirus Data
The Los Angeles Times' independent tally of coronavirus cases in California.
Stars: ✭ 188 (+48.03%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-84.25%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+6458.27%)
Mutual labels:  jupyter-notebook, jupyter, pandas
Machinelearningcourse
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-72.44%)
Mutual labels:  jupyter-notebook, jupyter, pandas
100 Pandas Puzzles
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Stars: ✭ 1,382 (+988.19%)
Mutual labels:  jupyter-notebook, pandas
Sigmoidal ai
Tutoriais de Python, Data Science, Machine Learning e Deep Learning - Sigmoidal
Stars: ✭ 103 (-18.9%)
Mutual labels:  jupyter-notebook, pandas
Prml
PRML algorithms implemented in Python
Stars: ✭ 10,206 (+7936.22%)
Mutual labels:  jupyter-notebook, jupyter
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-14.17%)
Mutual labels:  jupyter-notebook, jupyter
Scipy2017 Jupyter Widgets Tutorial
Notebooks for the SciPy 2017 tutorial "The Jupyter Interactive Widget Ecosystem"
Stars: ✭ 102 (-19.69%)
Mutual labels:  jupyter-notebook, jupyter
Nb pdf template
A more accurate representation of jupyter notebooks when converting to pdfs.
Stars: ✭ 109 (-14.17%)
Mutual labels:  jupyter-notebook, jupyter

Jupyter DataTables

Jupyter Notebook extension to leverage pandas DataFrames by integrating DataTables JS.


About

Data scientists and in fact many developers work with pd.DataFrame on daily basis to interpret data to process them. In my typical workflow. The common workflow is to display the dataframe, take a look at the data schema and then produce multiple plots to check the distribution of the data to have a clearer picture, perhaps search some data in the table, etc...

What if those distribution plots were part of the standard DataFrame and we had the ability to quickly search through the table with minimal effort? What if it was the default representation?

The jupyter-datatables uses jupyter-require to draw the table.


Installation

pip install jupyter-datatables

Usage

import numpy as np
import pandas as pd

from jupyter_datatables import init_datatables_mode

init_datatables_mode()

That's it, your default pandas representation will now use Jupyter DataTables!

df = pd.DataFrame(np.abs(np.random.randn(50, 5)), columns=list(string.ascii_uppercase[:5]))

Jupyter Datatables table representation


In most cases, you don't need to worry too much about the size of your data. Jupyter DataTables calculates required sample size based on a confidence interval (by default this would be 0.95) and margin of error and ceils it to the highest 'smart' value.

For example, for a data containing 100,000 samples, given 0.975 confidence interval and 0.02 margin of error, the Jupyter DataTables would calculate that 3044 samples are required and it would round it up to 4000.

Jupyter Datatables long table sample size

With additional note:

Sample size: 4,000 out of 100,000


We can also handle wide tables with ease.

df = pd.DataFrame(np.abs(np.random.randn(50, 20)), columns=list(string.ascii_uppercase[:20]))

Jupyter Datatables wide table representation


As per 0.3.0, there is a support for interactive tooltips:

Jupyter Datatables wide table representation

And also support for custom indices including Date type:

dft = pd.DataFrame({'A': np.random.rand(5),
                    'B': [1, 1, 3, 2, 1],
                    'C': 'This is a very long sentence that should automatically be trimmed',
                    'D': [pd.Timestamp('20010101'), pd.Timestamp('20010102'), pd.Timestamp('20010103'), pd.Timestamp('20010104'), pd.Timestamp('20010105')],
                    'E': pd.Series([1.0] * 5).astype('float32'),
                    'F': [False, True, False, False, True],
                   })

dft.D = dft.D.apply(pd.to_datetime)
dft.set_index('D', inplace=True)

Jupyter Datatables wide table representation



Current status and future plans:

Check out the Project Board where we track issues and TODOs for our Jupyter tooling!


Author: Marek Cermak [email protected], @AICoE

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].