MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+4545%)
BigsliceA serverless cluster computing system for the Go programming language
Stars: ✭ 469 (+2245%)
DparkPython clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+13240%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (+70%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+4185%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+1125%)
workflUXAn open-source, cloud-ready web application for simplified deployment of big data workflows.
Stars: ✭ 26 (+30%)
TdengineAn open-source big data platform designed and optimized for the Internet of Things (IoT).
Stars: ✭ 17,434 (+87070%)
5Lectures and computer labs storage for IW5 course at FIT VUT.
Stars: ✭ 32 (+60%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+6655%)
Simple It EnglishSimple-IT-English: smart wordbook from community for community
Stars: ✭ 233 (+1065%)
bigquery-data-lineageReference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Stars: ✭ 112 (+460%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+975%)
IndexMetarhia educational program index 📖
Stars: ✭ 2,045 (+10125%)
ShifuAn end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+935%)
Awesome Learning实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (+885%)
study-snap📓📲 Flutter app for managing study materials in form of photos.
Stars: ✭ 34 (+70%)
AOSVLecture notes for Advanced Operating Systems and Virtualization course at Sapienza University of Rome
Stars: ✭ 21 (+5%)
FlinkxBased on Apache Flink. support data synchronization/integration and streaming SQL computation.
Stars: ✭ 2,651 (+13155%)
Java Notes☕️ Java 基础 👫 面向对象思想✏️ 算法 📝 操作系统 ☁️ 网络 💾 数据库 🙊 Spring 💡 系统架构🐘大数据
Stars: ✭ 160 (+700%)
ssd1616 lectures about "Software Systems Design" presented in Innopolis University in 2021 for 3rd year BSc students
Stars: ✭ 44 (+120%)
Javainterview最全的Java技术知识点,以及Java源码分析。为开源贡献自己的一份力。
Stars: ✭ 154 (+670%)
Every Single Day I TldrA daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (+1145%)
PersonNotes个人笔记集中营,快糙猛的形式记录技术性Notes .. 📚☕️⌨️🎧
Stars: ✭ 61 (+205%)
codefoundryExamples for gauravbytes.com
Stars: ✭ 57 (+185%)
Hadoop Attack LibraryA collection of pentest tools and resources targeting Hadoop environments
Stars: ✭ 228 (+1040%)
awesome-coder-resources编程路上加油站!------【持续更新中...欢迎star,欢迎常回来看看......】【内容:编程/学习/阅读资源,开源项目,面试题,网站,书,博客,教程等等】
Stars: ✭ 54 (+170%)
Node HbaseAsynchronous HBase client for NodeJs using REST
Stars: ✭ 226 (+1030%)
Flink Boot懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度,该脚手架默认集成Spring框架进行Bean管理,同时将微服务以及WEB开发领域中经常用到的框架集成进来,进一步提升开发速度。比如集成Mybatis ORM框架,Hibernate Validator校验框架,Spring Retry重试框架等,具体见下面的脚手架特性。
Stars: ✭ 209 (+945%)
intersect一道面试题的思考 - 6000万数据包和300万数据包在50M内存使用环境中求交集
Stars: ✭ 54 (+170%)
Coursera DlScript for downloading Coursera.org videos and naming them.
Stars: ✭ 8,609 (+42945%)
Kotlin Spark ApiThis projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (+815%)
AthenacliAthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.
Stars: ✭ 151 (+655%)
Astronomical TechniquesIntroduction to astronomy research featuring short video lectures with (incomplete) Jupyter notebooks
Stars: ✭ 29 (+45%)
NmflibraryMATLAB library for non-negative matrix factorization (NMF): Version 1.8.1
Stars: ✭ 153 (+665%)
gomrjobgomrjob - a Go Framework for Hadoop Map Reduce Jobs
Stars: ✭ 39 (+95%)
HudiUpserts, Deletes And Incremental Processing on Big Data.
Stars: ✭ 2,586 (+12830%)
moodle-downloaderA 4.9 stars rated chrome extension for batch downloading Moodle resources 💾
Stars: ✭ 68 (+240%)
AvroApache Avro is a data serialization system.
Stars: ✭ 2,005 (+9925%)
GeckoDownloadManager🐸 Gecko Download Manager is a Chrome Extension that improves downloading lectures 💾 from the Echo360 System.
Stars: ✭ 44 (+120%)
PoliAn easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Stars: ✭ 1,850 (+9150%)
dsp-theoryTheory of digital signal processing (DSP): signals, filtration (IIR, FIR, CIC, MAF), transforms (FFT, DFT, Hilbert, Z-transform) etc.
Stars: ✭ 643 (+3115%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+600%)
TwitworkMonitor twitter stream
Stars: ✭ 133 (+565%)