A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.

Stars: ✭ 86 (-39.01%)

Mutual labels: bigdata

Sre Interview Prep Guide

Site Reliability Engineer Interview Preparation Guide

Stars: ✭ 2,446 (+1634.75%)

Mutual labels: study

Athena Cli

Presto-like CLI tool for AWS Athena

Stars: ✭ 85 (-39.72%)

Mutual labels: bigdata

Liteflow

liteflow是一个基于任务版本来实现的分布式任务流调度系统

Stars: ✭ 112 (-20.57%)

Mutual labels: bigdata

Hudi Resources

汇总Apache Hudi相关资料

Stars: ✭ 79 (-43.97%)

Mutual labels: bigdata

Feast

Feature Store for Machine Learning

Stars: ✭ 2,576 (+1726.95%)

Mutual labels: big-data

Labs

Research on distributed system

Stars: ✭ 73 (-48.23%)

Mutual labels: big-data

Setl

A simple Spark-powered ETL framework that just works 🍺

Stars: ✭ 79 (-43.97%)

Mutual labels: big-data

Azure Event Hubs Spark

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

Stars: ✭ 140 (-0.71%)

Mutual labels: bigdata

Spark Website

Apache Spark Website

Stars: ✭ 75 (-46.81%)

Mutual labels: big-data

Lambda Arch

Applying Lambda Architecture with Spark, Kafka, and Cassandra.

Stars: ✭ 111 (-21.28%)

Mutual labels: bigdata

Njupt Yellow Page

😋南京邮电大学黄页

Stars: ✭ 74 (-47.52%)

Mutual labels: study

Richdem

High-performance Terrain and Hydrology Analysis

Stars: ✭ 127 (-9.93%)

Mutual labels: big-data

Apache Spark Hands On

Educational notes,Hands on problems w/ solutions for hadoop ecosystem

Stars: ✭ 74 (-47.52%)

Mutual labels: bigdata

Books

技术书籍等

Stars: ✭ 110 (-21.99%)

Mutual labels: bigdata

Attic Apex Malhar

Mirror of Apache Apex malhar

Stars: ✭ 131 (-7.09%)

Mutual labels: big-data

Bookkeeper

Apache Bookkeeper

Stars: ✭ 1,178 (+735.46%)

Mutual labels: big-data

Fpart

Sort files and pack them into partitions

Stars: ✭ 127 (-9.93%)

Mutual labels: bigdata

Technical Interview Megarepo

Study materials for SE/CS technical interviews

Stars: ✭ 1,480 (+949.65%)

Mutual labels: study

Httpperfectguide

http 완벽가이드 책 스터디 모임

Stars: ✭ 72 (-48.94%)

Mutual labels: study

Flinkstreamsql

基于开源的flink，对其实时sql进行扩展；主要实现了流与维表的join，支持原生flink SQL所有的语法

Stars: ✭ 1,682 (+1092.91%)

Mutual labels: bigdata

My Journey In The Data Science World

📢 Ready to learn or review your knowledge!

Stars: ✭ 1,175 (+733.33%)

Mutual labels: big-data

Volcano

A Cloud Native Batch System (Project under CNCF)

Stars: ✭ 2,114 (+1399.29%)

Mutual labels: bigdata

Appdocs

Application Performance Optimization Summary

Stars: ✭ 1,169 (+729.08%)

Mutual labels: big-data

Carbondata

Mirror of Apache CarbonData

Stars: ✭ 1,158 (+721.28%)

Mutual labels: big-data

Daudit

🌲 Configuration flaws detector for Hadoop, MongoDB, MySQL, and more!

Stars: ✭ 108 (-23.4%)

Mutual labels: bigdata

Hazelcast Cpp Client

Hazelcast IMDG C++ Client

Stars: ✭ 67 (-52.48%)

Mutual labels: big-data

Software Development Resources

Curated list of Software Development resources

Stars: ✭ 67 (-52.48%)

Mutual labels: study

Sparkling Graph

SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.

Stars: ✭ 139 (-1.42%)

Mutual labels: big-data

Open Source Handbook

⭐️ Open source projects for all skill levels

Stars: ✭ 131 (-7.09%)

Mutual labels: big-data

Hadoopcryptoledger

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

Stars: ✭ 126 (-10.64%)

Mutual labels: bigdata

Awesome Bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

Stars: ✭ 10,478 (+7331.21%)

Mutual labels: bigdata

Flink Shaded

Apache Flink shaded artifacts repository

Stars: ✭ 67 (-52.48%)

Mutual labels: big-data

Ng Docs

非常适合初学Angular的同学阅读的一份文档. 包含Angular API、Rxjs、Zorro(还没做)、在线测验(还没做)等.

Stars: ✭ 66 (-53.19%)

Mutual labels: study

Attic Predictionio Sdk Java

PredictionIO Java SDK

Stars: ✭ 107 (-24.11%)

Mutual labels: big-data

Rsparkling

RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)

Stars: ✭ 65 (-53.9%)

Mutual labels: big-data

Mobydq

🐳 Tool to automate data quality checks on data pipelines

Stars: ✭ 123 (-12.77%)

Mutual labels: big-data

Automation

code generator

Stars: ✭ 65 (-53.9%)

Mutual labels: study

Cloud Volume

Read and write Neuroglancer datasets programmatically.

Stars: ✭ 63 (-55.32%)

Mutual labels: big-data

Spark Doc Zh

Apache Spark 官方文档中文版

Stars: ✭ 1,126 (+698.58%)

Mutual labels: big-data

61-120 of 629 similar projects

‹

›

next*5