tutorialsShort programming tutorials pertaining to data analysis.
Infinite Stories with DataThis repo consists of my analysis of random datasets using various statistical and visualization techniques.
DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
ttbbeerAn R Dataset Package for US Beer Statistics From TTB 🍺
ipaddressData analysis of IP addresses and networks
ipychartThe power of Chart.js with Python
dsrIntroduction to Data Science with R (2017)
dask-awkwardNative Dask collection for awkward arrays, and the library to use it.
antzANTz immersive 3D data visualization engine
transbigdataA Python package develop for transportation spatio-temporal big data processing, analysis and visualization.
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
hnnThe Human Neocortical Neurosolver (HNN) is a software tool that gives researchers/clinicians the ability to develop/test hypotheses on circuit mechanisms underlying EEG/MEG data.
copulaeMultivariate data modelling with Copulas in Python
python-notebooksA collection of Jupyter Notebooks used in conferences or just to have some snippets.
spectrochempySpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
nebulaA distributed block-based data storage and compute engine
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
heidiheidi : tidy data in Haskell
CC33ZCurso de Ciência da Computação
mercuryMercury - data visualize and discovery with Javascript, such as apache zeppelin and jupyter
hacking-datascienceNotebooks and design assets related to my publication 'hacking-datascience' on Medium.
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
whyqddata wrangling simplicity, complete audit transparency, and at speed
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
plottrA flexible plotting and data analysis tool.
UliEngineeringA python library for calculations perfomed in electronics engineering
Bitcoin Analysis-Python Bitcoin is widely used cryptocurrency for digital market. It is decentralised that means it is not own by government or any other company.Transactions are simple and easy as it doesn’t belong to any country.Records data are stored in Blockchain.Bitcoin price is variable and it is widely used so it is important to predict the price of it f…
labplotLabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
datastationApp to easily query, script, and visualize data from every database, file, and API.
arpesMirror of PyARPES (gitlab/lanzara-group/python-arpes) the open source ARPES analysis framework
R-data-wranglingMaterials for my my R data workshop. https://cengel.github.io/R-data-wrangling/
python-eodhistoricaldataDownload data from EOD historical data https://eodhistoricaldata.com/ using Python, Requests and Pandas.
miniqlA tiny JSON-based query language inspired by GraphQL
mlgaugeA simple library to benchmark the performance of machine learning methods across different datasets.
cellpyextract and tweak data from electrochemical tests of cells
radarEstamos de mudança para o GitLab: https://gitlab.com/radar-parlamentar/radar.
ML-CaPsuleML-capsule is a Project for beginners and experienced data science Enthusiasts who don't have a mentor or guidance and wish to learn Machine learning. Using our repo they can learn ML, DL, and many related technologies with different real-world projects and become Interview ready.
nbodykitAnalysis kit for large-scale structure datasets, the massively parallel way
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
dataframeStructured data processing in Kotlin
bowGo data analysis / manipulation library built on top of Apache Arrow
icdTools for working with icd codes and comorbidities
bruceR📦 BRoadly Useful Convenient and Efficient R functions that BRing Users Concise and Elegant R data analyses.
datartDatart is a next generation Data Visualization Open Platform