CockroachCockroachDB - the open source, cloud-native distributed SQL database.
Stars: ✭ 22,700 (+116.64%)
GpdbGreenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
Stars: ✭ 4,928 (-52.97%)
Best Of Ml Python🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Stars: ✭ 6,057 (-42.19%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (-52.7%)
TeachingTeaching Materials for Dr. Waleed A. Yousef
Stars: ✭ 435 (-95.85%)
Moderndive bookStatistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (-94.97%)
Disk.frameFast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data
Stars: ✭ 517 (-95.07%)
CorfudbA cluster consistency platform
Stars: ✭ 539 (-94.86%)
Facebook data analyzerAnalyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more
Stars: ✭ 515 (-95.08%)
Yugabyte DbThe high-performance distributed SQL database for global, internet-scale apps.
Stars: ✭ 5,890 (-43.79%)
Facebook ArchiveJust some fun you can have with facebook's archive data
Stars: ✭ 63 (-99.4%)
Isp Data PollutionISP Data Pollution to Protect Private Browsing History with Obfuscation
Stars: ✭ 425 (-95.94%)
RoughvizReusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
Stars: ✭ 6,022 (-42.53%)
FakerFaker is a pure Elixir library for generating fake data.
Stars: ✭ 673 (-93.58%)
Octo CliCLI tool to expose data from any database as a serverless web service.
Stars: ✭ 653 (-93.77%)
EventqlDistributed "massively parallel" SQL query engine
Stars: ✭ 1,121 (-89.3%)
RowsA common, beautiful interface to tabular data, no matter the format
Stars: ✭ 739 (-92.95%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-93.28%)
EngsoccerdataEnglish and European soccer results 1871-2020
Stars: ✭ 615 (-94.13%)
LpfmpointsEvolution of LPFM Stations
Stars: ✭ 19 (-99.82%)
Mithril DataA rich data model library for Mithril javascript framework
Stars: ✭ 17 (-99.84%)
GraphGraph is a semantic database that is used to create data-driven applications.
Stars: ✭ 855 (-91.84%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-99.75%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-96.06%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-90.59%)
BubblyA python package for plotting animated and interactive bubble charts using Plotly
Stars: ✭ 37 (-99.65%)
SoccergraphrSoccer Analytics in R using OPTA data
Stars: ✭ 42 (-99.6%)
DataconfsA list of conferences connected with data worldwide.
Stars: ✭ 36 (-99.66%)
RqliteThe lightweight, distributed relational database built on SQLite
Stars: ✭ 9,147 (-12.7%)
Ds and ml projectsData Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (-99.47%)
SaberWindow-Based Hybrid CPU/GPU Stream Processing Engine
Stars: ✭ 35 (-99.67%)
VerticapyVerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
Stars: ✭ 59 (-99.44%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-87.23%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-99.45%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (-90.77%)
MuzeComposable data visualisation library for web with a data-first approach now powered by WebAssembly
Stars: ✭ 1,153 (-89%)
SeabornStatistical data visualization in Python
Stars: ✭ 9,007 (-14.04%)
Etl with pythonETL with Python - Taught at DWH course 2017 (TAU)
Stars: ✭ 68 (-99.35%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-99.3%)
MagicboxA platform that uses real-time data to inform life-saving humanitarian responses to emergency situations
Stars: ✭ 73 (-99.3%)
Ac D3Javascript Library for building Audiovisual Charts in D3
Stars: ✭ 76 (-99.27%)
AethosAutomated Data Science and Machine Learning library to optimize workflow.
Stars: ✭ 94 (-99.1%)
MachineMachine is a workflow/pipeline library for processing data
Stars: ✭ 78 (-99.26%)
LivechartAndroid library to draw beautiful and rich line charts.
Stars: ✭ 78 (-99.26%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (-88.27%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (-88.18%)
NycdbDatabase of NYC Housing Data
Stars: ✭ 94 (-99.1%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-99.34%)
DeveeldbDeveelDB is a complete SQL database system, primarly developed for .NET/Mono frameworks
Stars: ✭ 80 (-99.24%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-99.23%)
DatabenchData analysis tool.
Stars: ✭ 82 (-99.22%)