NettythriftThrift on Netty, support TCP/HTTP/WebSocket at same port. support multiple Protocols at same time. multil Simple Clients with Connection Pool.
Stars: ✭ 60 (+106.9%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+1472.41%)
ThriftApache Thrift
Stars: ✭ 8,821 (+30317.24%)
Docker Spark ClusterA simple spark standalone cluster for your testing environment purposses
Stars: ✭ 261 (+800%)
FinagleA fault tolerant, protocol-agnostic RPC system
Stars: ✭ 8,126 (+27920.69%)
Spark RedisA connector for Spark that allows reading and writing to/from Redis cluster
Stars: ✭ 773 (+2565.52%)
Thrift2flowConverts Thrift specs into Flow JavaScript type definitions
Stars: ✭ 39 (+34.48%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (+796.55%)
Thriftclientpoola thrift client connection pool & simple thrift use demo by golang
Stars: ✭ 32 (+10.34%)
Bigdataie大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (+1434.48%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+10396.55%)
Spark Jupyter AwsA guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (+793.1%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+751.72%)
Big Data Rosetta CodeCode snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (+775.86%)
Neo4j Spark ConnectorNeo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Stars: ✭ 245 (+744.83%)
RecommendationsystemBook recommender system using collaborative filtering based on Spark
Stars: ✭ 244 (+741.38%)
Hadoop Docker基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Stars: ✭ 238 (+720.69%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+22168.97%)
Book本项目收藏这些年来看过或者听过的一些不错的书籍,在整理文件时看见这些,发现删掉有点可惜,放着又太浪费空间,本着分享的原则,就把它们共享出来,一方面给需要的读者提供这些书籍,另一方面也是一种像知识库的积累吧
Stars: ✭ 47 (+62.07%)
MydatascienceportfolioApplying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (+682.76%)
Dji Firmware ToolsTools for handling firmwares of DJI products, with focus on quadcopters.
Stars: ✭ 424 (+1362.07%)
Spark WorkshopApache Spark™ and Scala Workshops
Stars: ✭ 224 (+672.41%)
spark-http-streamspark structured streaming via HTTP communication
Stars: ✭ 17 (-41.38%)
Sagemaker SparkA Spark library for Amazon SageMaker.
Stars: ✭ 219 (+655.17%)
ChroniclerScala toolchain for InfluxDB
Stars: ✭ 24 (-17.24%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+644.83%)
daf-kyloKylo integration with PDND (previously DAF).
Stars: ✭ 20 (-31.03%)
FeatranA Scala feature transformation library for data science and machine learning
Stars: ✭ 420 (+1348.28%)
Spark Knnk-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (+606.9%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+9896.55%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+2468.97%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+7741.38%)
spark-data-sourcesDeveloping Spark External Data Sources using the V2 API
Stars: ✭ 36 (+24.14%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (+544.83%)
Kotlin Spark ApiThis projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (+531.03%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-58.62%)
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (+506.9%)
Covid19TrackerA Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.
Stars: ✭ 65 (+124.14%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (+503.45%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+1324.14%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+8582.76%)
SparkV🤖⚡ | The most POWERFUL multipurpose chat/meme bot that will boost the activity in your server.
Stars: ✭ 24 (-17.24%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+7086.21%)
Cdhprojecthadoop各组件使用,持续更新
Stars: ✭ 733 (+2427.59%)
odbc2parquetA command line tool to query an ODBC data source and write the result into a parquet file.
Stars: ✭ 95 (+227.59%)
PystoreFast data store for Pandas time-series data
Stars: ✭ 325 (+1020.69%)
IMCtermiteEnables extraction of measurement data from binary files with extension 'raw' used by proprietary software imcFAMOS/imcSTUDIO and facilitates its storage in open source file formats
Stars: ✭ 20 (-31.03%)
SparklearningLearning Apache spark,including code and data .Most part can run local.
Stars: ✭ 558 (+1824.14%)
SparklintA tool for monitoring and tuning Spark jobs for efficiency.
Stars: ✭ 316 (+989.66%)
Thrift.jlThrift for Julia
Stars: ✭ 25 (-13.79%)
skeinA tool and library for easily deploying applications on Apache YARN
Stars: ✭ 128 (+341.38%)