Node Sinek🎩 Most advanced high level Node.js Kafka client
Stars: ✭ 262 (+509.3%)
Java Studyjava-study 是本人学习Java过程中记录的一些代码!从Java基础的数据类型、jdk1.8的Lambda、Stream和日期的使用、 IO流、数据集合、多线程使用、并发编程、23种设计模式示例代码、常用的工具类, 以及一些常用框架,netty、mina、springboot、kafka、storm、zookeeper、redis、elasticsearch、hbase、hive等等。
Stars: ✭ 571 (+1227.91%)
PucketBucketing and partitioning system for Parquet
Stars: ✭ 29 (-32.56%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (+504.65%)
SparklearningLearning Apache spark,including code and data .Most part can run local.
Stars: ✭ 558 (+1197.67%)
Scrapy ClusterThis Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Stars: ✭ 921 (+2041.86%)
Spark DariaEssential Spark extensions and helper methods ✨😲
Stars: ✭ 553 (+1186.05%)
GatkOfficial code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+2230.23%)
Mongodb consistent backupA tool for performing consistent backups of MongoDB Clusters or Replica Sets
Stars: ✭ 255 (+493.02%)
Dncdnc 去中心化 开源社区 轻联盟 dncto.com QQ群 779699538
Stars: ✭ 551 (+1181.4%)
keralaDistributed KV Streams
Stars: ✭ 16 (-62.79%)
Cs Video CoursesList of Computer Science courses with video lectures.
Stars: ✭ 27,209 (+63176.74%)
Book本项目收藏这些年来看过或者听过的一些不错的书籍,在整理文件时看见这些,发现删掉有点可惜,放着又太浪费空间,本着分享的原则,就把它们共享出来,一方面给需要的读者提供这些书籍,另一方面也是一种像知识库的积累吧
Stars: ✭ 47 (+9.3%)
GoroseGoRose(go orm), a mini database ORM for golang, which inspired by the famous php framwork laravle's eloquent. It will be friendly for php developer and python or ruby developer. Currently provides six major database drivers: mysql,sqlite3,postgres,oracle,mssql, Clickhouse.
Stars: ✭ 947 (+2102.33%)
JustenoughscalaforsparkA tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+1151.16%)
coreThe XP Framework is an all-purpose, object oriented PHP framework.
Stars: ✭ 13 (-69.77%)
Es Cqrs Shopping CartA resilient and scalable shopping cart system designed using Event Sourcing (ES) and Command Query Responsibility Segregation (CQRS)
Stars: ✭ 19 (-55.81%)
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-41.86%)
LopqTraining of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Stars: ✭ 530 (+1132.56%)
dllibdllib is a distributed deep learning library running on Apache Spark
Stars: ✭ 32 (-25.58%)
skytableSkytable is an extremely fast, secure and reliable real-time NoSQL database with automated snapshots and TLS
Stars: ✭ 696 (+1518.6%)
spark-data-sourcesDeveloping Spark External Data Sources using the V2 API
Stars: ✭ 36 (-16.28%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-55.81%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-62.79%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (+1083.72%)
Covid19TrackerA Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.
Stars: ✭ 65 (+51.16%)
HeraclesHigh performance HBase / Spark SQL engine
Stars: ✭ 27 (-37.21%)
blogblog entries
Stars: ✭ 39 (-9.3%)
MagellanGeo Spatial Data Analytics on Spark
Stars: ✭ 507 (+1079.07%)
swoole-futures⏳ Futures, Streams & Async/Await for PHP's Swoole asynchronous run-time.
Stars: ✭ 100 (+132.56%)
KafkacenterKafkaCenter is a unified platform for Kafka cluster management and maintenance, producer / consumer monitoring, and use of ecological components.
Stars: ✭ 896 (+1983.72%)
awesome-internalsA curated list of awesome resources and learning materials in the field of X internals
Stars: ✭ 78 (+81.4%)
BrodApache Kafka client library for Erlang/Elixir
Stars: ✭ 501 (+1065.12%)
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-41.86%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (+2220.93%)
stream-snitchEvent emitter for watching text streams with regex patterns
Stars: ✭ 19 (-55.81%)
DebeziumChange data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Stars: ✭ 5,937 (+13706.98%)
CasperA compiler for automatically re-targeting sequential Java code to Apache Spark.
Stars: ✭ 45 (+4.65%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (+216.28%)
ZerocodeA community-developed, free, open source, microservices API automation and load testing framework built using JUnit core runners for Http REST, SOAP, Security, Database, Kafka and much more. Zerocode Open Source enables you to create, change, orchestrate and maintain your automated test cases declaratively with absolute ease.
Stars: ✭ 482 (+1020.93%)
web-streams-polyfillWeb Streams, based on the WHATWG spec reference implementation
Stars: ✭ 198 (+360.47%)
SparkmeasureThis is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Stars: ✭ 368 (+755.81%)
SidekickHigh Performance HTTP Sidecar Load Balancer
Stars: ✭ 366 (+751.16%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-74.42%)
Kafka Connect JdbcKafka Connect connector for JDBC-compatible databases
Stars: ✭ 698 (+1523.26%)
KafdropKafka UI and Monitoring Tool
Stars: ✭ 366 (+751.16%)