PrestoThe official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+5607.93%)
PyRasgoHelper code to interact with Rasgo via our SDK, PyRasgo
Stars: ✭ 39 (-82.82%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-32.6%)
Vue Virtual Scroll List⚡️A vue component support big amount data list with high render performance and efficient.
Stars: ✭ 3,201 (+1310.13%)
Quantitative-Big-Imaging-2018(Latest semester at https://github.com/kmader/Quantitative-Big-Imaging-2019) The material for the Quantitative Big Imaging course at ETHZ for the Spring Semester 2018
Stars: ✭ 50 (-77.97%)
ParquetviewerSimple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (-36.12%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+8.81%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-36.56%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+7.93%)
Belajarpython.comOpen Source Indonesian Python Programming Tutorial Site
Stars: ✭ 141 (-37.89%)
Kafka UiOpen-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (+1.32%)
PoseidonA search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (+689.87%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-39.65%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+3.52%)
mmtf-sparkMethods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
Stars: ✭ 20 (-91.19%)
HamaMirror of Apache Hama
Stars: ✭ 129 (-43.17%)
Lite Virtual ListVirtual list component library supporting waterfall flow based on vue
Stars: ✭ 223 (-1.76%)
bagriXML/Document DB on top of distributed cache
Stars: ✭ 40 (-82.38%)
AzuredatalakeSamples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-43.61%)
UsqlU-SQL Examples and Issue Tracking
Stars: ✭ 221 (-2.64%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-43.61%)
dislibThe Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
Stars: ✭ 39 (-82.82%)
Mobydq🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (-45.81%)
Awkward 0.xManipulate arrays of complex data structures as easily as Numpy.
Stars: ✭ 216 (-4.85%)
Report自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-45.81%)
KuduMirror of Apache Kudu
Stars: ✭ 1,360 (+499.12%)
SigmfThe Signal Metadata Format Specification
Stars: ✭ 120 (-47.14%)
HelicalinsightHelical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
Stars: ✭ 214 (-5.73%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+613.22%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (-44.49%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-49.34%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+565.64%)
awesome-dbtA curated list of awesome dbt resources
Stars: ✭ 520 (+129.07%)
AmbariMirror of Apache Ambari
Stars: ✭ 1,576 (+594.27%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-14.98%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-51.54%)
couchdb-pkgApache CouchDB Packaging support files
Stars: ✭ 24 (-89.43%)
GunAn open source cybersecurity protocol for syncing decentralized graph data.
Stars: ✭ 15,172 (+6583.7%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-55.95%)
FlumeMirror of Apache Flume
Stars: ✭ 2,200 (+869.16%)
KeyviKeyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 171 (-24.67%)
Attic PredictionioPredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,522 (+5416.3%)
DvidDistributed, Versioned, Image-oriented Dataservice
Stars: ✭ 174 (-23.35%)
leetspeekOpen and collaborative content from leet hackers!
Stars: ✭ 11 (-95.15%)
awesome-coder-resources编程路上加油站!------【持续更新中...欢迎star,欢迎常回来看看......】【内容:编程/学习/阅读资源,开源项目,面试题,网站,书,博客,教程等等】
Stars: ✭ 54 (-76.21%)
cdp-servicecdp数据平台,帮助企业充分了解客户,实现千人千面的精准营销。
Stars: ✭ 30 (-86.78%)