Storm Camel ExampleReal-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.
Stars: ✭ 28 (-86.21%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-63.05%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-31.03%)
Docker Spark🚢 Docker image for Apache Spark
Stars: ✭ 78 (-61.58%)
ShifuAn end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+1.97%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-59.61%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-54.68%)
Every Single Day I TldrA daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (+22.66%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (-67.98%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-48.28%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+357.64%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-90.15%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-83.25%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-92.12%)
flokkrDocumentation placeholder and utilities for all the other containers.
Stars: ✭ 30 (-85.22%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+559.11%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-48.28%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-45.32%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-72.41%)
Docker Spark ClusterA simple spark standalone cluster for your testing environment purposses
Stars: ✭ 261 (+28.57%)
Ytk LearnYtk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).
Stars: ✭ 337 (+66.01%)
Big Data Rosetta CodeCode snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (+25.12%)
BigdlBuilding Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+1778.33%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+702.96%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-36.95%)
MarmarayGeneric Data Ingestion & Dispersal Library for Hadoop
Stars: ✭ 414 (+103.94%)
AlluxioAlluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (+2549.75%)
Pdf编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
Stars: ✭ 12,009 (+5815.76%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+258.62%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+124.63%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-97.54%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+747.78%)
Xlearning Xdmlextremely distributed machine learning
Stars: ✭ 113 (-44.33%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+708.87%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-25.12%)
Hadoop CommonMirror of Apache Hadoop common
Stars: ✭ 155 (-23.65%)
Hive Jdbc Uber JarHive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Stars: ✭ 188 (-7.39%)
Interview写在2019年后的蚂蚁、头条、拼多多的面试总结
Stars: ✭ 155 (-23.65%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-13.79%)
NmflibraryMATLAB library for non-negative matrix factorization (NMF): Version 1.8.1
Stars: ✭ 153 (-24.63%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-24.14%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+1020.2%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (-7.88%)
SparkFirely's open source FHIR server
Stars: ✭ 174 (-14.29%)
Javainterview最全的Java技术知识点,以及Java源码分析。为开源贡献自己的一份力。
Stars: ✭ 154 (-24.14%)
Movie recommend基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+930.54%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1140.39%)
QuillCompile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+884.24%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-24.63%)
Interview QuestionsList of all the Interview questions practiced from online resources and books
Stars: ✭ 187 (-7.88%)
InterviewEverything you need to prepare for your technical interview
Stars: ✭ 14,788 (+7184.73%)