datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+1576%)
DatatableA Python package for manipulating 2-dimensional tabular data structures
Stars: ✭ 1,166 (+4564%)
playgroundA place to play programming
Stars: ✭ 21 (-16%)
xarray-beamDistributed Xarray with Apache Beam
Stars: ✭ 83 (+232%)
kobe-every-shot-everA Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (+164%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+380%)
js-symbol-treeTurn any collection of objects into its own efficient tree or linked list using Symbol
Stars: ✭ 86 (+244%)
CS basicsMy CS learning : algorithm, data structure, and system design | #SE
Stars: ✭ 21 (-16%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+140%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+5304%)
labplotLabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
Stars: ✭ 107 (+328%)
mlforecastScalable machine 🤖 learning for time series forecasting.
Stars: ✭ 96 (+284%)
dask-pytorch-ddpdask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.
Stars: ✭ 50 (+100%)
mercuryMercury - data visualize and discovery with Javascript, such as apache zeppelin and jupyter
Stars: ✭ 29 (+16%)
linked-blocking-multi-queueA concurrent collection that extends the existing Java concurrent collection library, offering an optionally-bounded blocking "multi-queue" based on linked nodes.
Stars: ✭ 41 (+64%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+544%)
trieTrie (a.k.a. prefix tree) C# implementation. Has constant-time string prefix lookup.
Stars: ✭ 84 (+236%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-36%)
CPCompetitive Coding
Stars: ✭ 25 (+0%)
array-keyed-mapJS datastructure, like Map, but the keys are arrays
Stars: ✭ 29 (+16%)
Go PatriciaA generic patricia trie (also called radix tree) implemented in Go (Golang)
Stars: ✭ 243 (+872%)
UliEngineeringA python library for calculations perfomed in electronics engineering
Stars: ✭ 35 (+40%)
DenormalizrDenormalize data normalized with normalizr
Stars: ✭ 231 (+824%)
Bitcoin Analysis-Python Bitcoin is widely used cryptocurrency for digital market. It is decentralised that means it is not own by government or any other company.Transactions are simple and easy as it doesn’t belong to any country.Records data are stored in Blockchain.Bitcoin price is variable and it is widely used so it is important to predict the price of it f…
Stars: ✭ 42 (+68%)
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-48%)
datajoint-pythonRelational data pipelines for the science lab
Stars: ✭ 140 (+460%)
python-notebooksA collection of Jupyter Notebooks used in conferences or just to have some snippets.
Stars: ✭ 14 (-44%)
heidiheidi : tidy data in Haskell
Stars: ✭ 24 (-4%)
nanostackSmall middleware stack library
Stars: ✭ 39 (+56%)
transbigdataA Python package develop for transportation spatio-temporal big data processing, analysis and visualization.
Stars: ✭ 195 (+680%)
prefect-saturnPython client for using Prefect Cloud with Saturn Cloud
Stars: ✭ 15 (-40%)
CC33ZCurso de Ciência da Computação
Stars: ✭ 50 (+100%)
daskperimentReproducibility for Humans: A lightweight tool to perform reproducible machine learning experiment.
Stars: ✭ 25 (+0%)
spectrochempySpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Stars: ✭ 34 (+36%)
dask-sqlDistributed SQL Engine in Python using Dask
Stars: ✭ 271 (+984%)
hacking-datascienceNotebooks and design assets related to my publication 'hacking-datascience' on Medium.
Stars: ✭ 41 (+64%)
lazycluster🎛 Distributed machine learning made simple.
Stars: ✭ 43 (+72%)
CPTH🌟 Competitive Programming Template Headers | With documentation, CI tests and Codecov
Stars: ✭ 23 (-8%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (+748%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-24%)
qhub🪴 Nebari - your open source data science platform
Stars: ✭ 175 (+600%)
website-oldThe Frictionless Data website.
Stars: ✭ 31 (+24%)
coiled-resourcesNotebooks that support blog posts and tech talks on Dask / Coiled.
Stars: ✭ 33 (+32%)
hnnThe Human Neocortical Neurosolver (HNN) is a software tool that gives researchers/clinicians the ability to develop/test hypotheses on circuit mechanisms underlying EEG/MEG data.
Stars: ✭ 62 (+148%)
graphchain⚡️ An efficient cache for the execution of dask graphs.
Stars: ✭ 63 (+152%)
plottrA flexible plotting and data analysis tool.
Stars: ✭ 32 (+28%)
StaticvecImplements a fixed-capacity stack-allocated Vec alternative backed by an array, using const generics.
Stars: ✭ 236 (+844%)
nebulaA distributed block-based data storage and compute engine
Stars: ✭ 127 (+408%)
C Macro CollectionsEasy to use, header only, macro generated, generic and type-safe Data Structures in C
Stars: ✭ 192 (+668%)
antzANTz immersive 3D data visualization engine
Stars: ✭ 25 (+0%)
copulaeMultivariate data modelling with Copulas in Python
Stars: ✭ 96 (+284%)