Lite Virtual ListVirtual list component library supporting waterfall flow based on vue
Stars: ✭ 223 (+374.47%)
Mobydq🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (+161.7%)
Attic PredictionioPredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,522 (+26542.55%)
TreevizTree diagrams with JavaScript 🌲 📈
Stars: ✭ 95 (+102.13%)
SigmfThe Signal Metadata Format Specification
Stars: ✭ 120 (+155.32%)
KeyviKeyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 161 (+242.55%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+3344.68%)
UsqlU-SQL Examples and Issue Tracking
Stars: ✭ 221 (+370.21%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (+144.68%)
PrestoThe official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+27468.09%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+3114.89%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+44770.21%)
AmbariMirror of Apache Ambari
Stars: ✭ 1,576 (+3253.19%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (+225.53%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (+134.04%)
Awkward 0.xManipulate arrays of complex data structures as easily as Numpy.
Stars: ✭ 216 (+359.57%)
FiliEasily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.
Stars: ✭ 151 (+221.28%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (+112.77%)
Couchdb DockerSemi-official Apache CouchDB Docker images
Stars: ✭ 194 (+312.77%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (+197.87%)
HelicalinsightHelical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
Stars: ✭ 214 (+355.32%)
KuduMirror of Apache Kudu
Stars: ✭ 1,360 (+2793.62%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (+106.38%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+2746.81%)
ReefMirror of Apache REEF
Stars: ✭ 92 (+95.74%)
Bitcoin Value Predictor[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (+93.62%)
Belajarpython.comOpen Source Indonesian Python Programming Tutorial Site
Stars: ✭ 141 (+200%)
Parquet MrApache Parquet
Stars: ✭ 1,278 (+2619.15%)
Kafka UiOpen-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (+389.36%)
PanoptesA Global Scale Network Telemetry Ecosystem
Stars: ✭ 80 (+70.21%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (+70.21%)
IotdbApache IoTDB
Stars: ✭ 1,221 (+2497.87%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (+310.64%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (+195.74%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+68.09%)
PoseidonA search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (+3714.89%)
Vue Virtual Scroll List⚡️A vue component support big amount data list with high render performance and efficient.
Stars: ✭ 3,201 (+6710.64%)
Selinon An advanced distributed task flow management on top of Celery
Stars: ✭ 237 (+404.26%)
LabsResearch on distributed system
Stars: ✭ 73 (+55.32%)
BookkeeperApache Bookkeeper
Stars: ✭ 1,178 (+2406.38%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (+191.49%)
AppdocsApplication Performance Optimization Summary
Stars: ✭ 1,169 (+2387.23%)
GunAn open source cybersecurity protocol for syncing decentralized graph data.
Stars: ✭ 15,172 (+32180.85%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (+46.81%)