awesome-bigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+5.87%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-98.37%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-94.34%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-99.36%)
greycatGreyCat - Data Analytics, Temporal data, What-if, Live machine learning
Stars: ✭ 104 (-99.01%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (-85.58%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-97.39%)
Linkedingiveaway👨🏽🏫You can learn about anything over here. What Giveaways I do and why it's important in today's modern world. Are you interested in Giveaway's?🔋
Stars: ✭ 67 (-99.36%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (-99.38%)
ChordPython package for creating beautiful interactive Chord Diagrams. Pro version available at https://m8.fyi/chord
Stars: ✭ 217 (-97.93%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-98.98%)
TensorbaseTensorBase BE is building a high performance, cloud neutral bigdata warehouse for SMEs fully in Rust.
Stars: ✭ 440 (-95.8%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (-56.28%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (-96.27%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (-18.58%)
NsdbNatural Series Database
Stars: ✭ 49 (-99.53%)
SupersetApache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+306.89%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-97.84%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+155.8%)
RqliteThe lightweight, distributed relational database built on SQLite
Stars: ✭ 9,147 (-12.7%)
GriddbGridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
Stars: ✭ 1,587 (-84.85%)
Interferenceopensource distributed database with base JPA implementation and event processing support
Stars: ✭ 57 (-99.46%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (-89.73%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-99.45%)
TraildbTrailDB is an efficient tool for storing and querying series of events
Stars: ✭ 1,029 (-90.18%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-99.48%)
SoccergraphrSoccer Analytics in R using OPTA data
Stars: ✭ 42 (-99.6%)
Ds and ml projectsData Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (-99.47%)
VerticapyVerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
Stars: ✭ 59 (-99.44%)
Django DatabrowseDatabrowse is a Django application that lets you browse your data.
Stars: ✭ 41 (-99.61%)
EventqlDistributed "massively parallel" SQL query engine
Stars: ✭ 1,121 (-89.3%)
SeabornStatistical data visualization in Python
Stars: ✭ 9,007 (-14.04%)
ToolboxA Java Toolbox for Scalable Probabilistic Machine Learning
Stars: ✭ 105 (-99%)
MuzeComposable data visualisation library for web with a data-first approach now powered by WebAssembly
Stars: ✭ 1,153 (-89%)
StartrA template for data journalism in R
Stars: ✭ 69 (-99.34%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-99.34%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-99.3%)
Data PolygamyData Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Stars: ✭ 39 (-99.63%)
Facebook ArchiveJust some fun you can have with facebook's archive data
Stars: ✭ 63 (-99.4%)
Etl with pythonETL with Python - Taught at DWH course 2017 (TAU)
Stars: ✭ 68 (-99.35%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-86.85%)
DeveeldbDeveelDB is a complete SQL database system, primarly developed for .NET/Mono frameworks
Stars: ✭ 80 (-99.24%)
LivechartAndroid library to draw beautiful and rich line charts.
Stars: ✭ 78 (-99.26%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (-88.18%)
MachineMachine is a workflow/pipeline library for processing data
Stars: ✭ 78 (-99.26%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (-88.27%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-99.23%)
PaxosstorePaxosStore has been deployed in WeChat production for more than two years, providing storage services for the core businesses of WeChat backend. Now PaxosStore is running on thousands of machines, and is able to afford billions of peak TPS.
Stars: ✭ 1,278 (-87.8%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (-88.15%)
D3vueA D3 Plugin for VueJS
Stars: ✭ 87 (-99.17%)
Ac D3Javascript Library for building Audiovisual Charts in D3
Stars: ✭ 76 (-99.27%)
DatabenchData analysis tool.
Stars: ✭ 82 (-99.22%)
Basketball analyticsRepository which contains various scripts and work with various basketball statistics
Stars: ✭ 88 (-99.16%)