jsixA hobby operating system for x86_64, boots with UEFI.
Stars: ✭ 60 (-1.64%)
Daudit🌲 Configuration flaws detector for Hadoop, MongoDB, MySQL, and more!
Stars: ✭ 108 (+77.05%)
Simple It EnglishSimple-IT-English: smart wordbook from community for community
Stars: ✭ 233 (+281.97%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (+75.41%)
SemiCode-OSThe World First Linux Distribution for Programmers and Web Developers
Stars: ✭ 16 (-73.77%)
TdengineAn open-source big data platform designed and optimized for the Internet of Things (IoT).
Stars: ✭ 17,434 (+28480.33%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (+72.13%)
kievA set of tools to do distributed logging for Ruby web applications
Stars: ✭ 46 (-24.59%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+252.46%)
processorA compiler, assembler, and processor.
Stars: ✭ 24 (-60.66%)
Biglassobiglasso: Extending Lasso Model Fitting to Big Data in R
Stars: ✭ 87 (+42.62%)
ShifuAn end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+239.34%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (+40.98%)
sushiElk Audio OS Plugin host and DAW
Stars: ✭ 78 (+27.87%)
Athena CliPresto-like CLI tool for AWS Athena
Stars: ✭ 85 (+39.34%)
Awesome Learning实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (+222.95%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (+31.15%)
hayabusaHayabusa: Simple and Fast Full-Text Search Engine for Massive System Log Data
Stars: ✭ 43 (-29.51%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (+21.31%)
FlinkxBased on Apache Flink. support data synchronization/integration and streaming SQL computation.
Stars: ✭ 2,651 (+4245.9%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (+13.11%)
lgrepCLI for searching logstash and other elasticsearch based systems
Stars: ✭ 12 (-80.33%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1516.39%)
Java Notes☕️ Java 基础 👫 面向对象思想✏️ 算法 📝 操作系统 ☁️ 网络 💾 数据库 🙊 Spring 💡 系统架构🐘大数据
Stars: ✭ 160 (+162.3%)
Aws Auto Terminate Idle EmrAWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-65.57%)
intersect一道面试题的思考 - 6000万数据包和300万数据包在50M内存使用环境中求交集
Stars: ✭ 54 (-11.48%)
Javainterview最全的Java技术知识点,以及Java源码分析。为开源贡献自己的一份力。
Stars: ✭ 154 (+152.46%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+1422.95%)
workflUXAn open-source, cloud-ready web application for simplified deployment of big data workflows.
Stars: ✭ 26 (-57.38%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-91.8%)
AthenacliAthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.
Stars: ✭ 151 (+147.54%)
Kube BatchA batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
Stars: ✭ 804 (+1218.03%)
ptt-studyabroad-api🔎 Search articles with personalized results on ptt/studyabroad
Stars: ✭ 57 (-6.56%)
GearpumpLightweight real-time big data streaming engine over Akka
Stars: ✭ 745 (+1121.31%)
PoliAn easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Stars: ✭ 1,850 (+2932.79%)
VaexOut-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+11036.07%)
pidi-osA minimalistic operating system
Stars: ✭ 35 (-42.62%)
BigartmFast topic modeling platform
Stars: ✭ 563 (+822.95%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+129.51%)
BigsliceA serverless cluster computing system for the Go programming language
Stars: ✭ 469 (+668.85%)
Bigdataie大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (+629.51%)
TwitworkMonitor twitter stream
Stars: ✭ 133 (+118.03%)
Circosjsd3 library to build circular graphs
Stars: ✭ 436 (+614.75%)
SEACSysteme d'exploitation
Stars: ✭ 22 (-63.93%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+2721.31%)
JigsawJigsaw七巧板 provides a set of web components based on Angular5/8/9+. The main purpose of Jigsaw is to help the application developers to construct complex & intensive interacting & user friendly web pages. Jigsaw is supporting the development of all applications of Big Data Product of ZTE.
Stars: ✭ 354 (+480.33%)
rubbanKibana Automatic Index Pattern Discovery and Other Elastic Stack Curating Tasks
Stars: ✭ 49 (-19.67%)
Liteflowliteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (+83.61%)
VolcanoA Cloud Native Batch System (Project under CNCF)
Stars: ✭ 2,114 (+3365.57%)
bigquery-data-lineageReference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Stars: ✭ 112 (+83.61%)
Dissecting-Xv6Xv6 installation , Adding System Calls
Stars: ✭ 9 (-85.25%)
CS Offer后台开发基础知识总结(春招/秋招)
Stars: ✭ 352 (+477.05%)
Every Single Day I TldrA daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (+308.2%)