DigitrecognizerJava Convolutional Neural Network example for Hand Writing Digit Recognition
Stars: ✭ 23 (-89.73%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-21.87%)
KyloKylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+308.93%)
TacticaldataprepKnowledge Review: Tactical Data Preparation (Python and R)
Stars: ✭ 19 (-91.52%)
ParquetviewerSimple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (-35.27%)
Sense Extension RecipesA collection of recipes to speed up development of Qlik Sense Visualization Extensions.
Stars: ✭ 17 (-92.41%)
Spark ExcelA Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (-3.57%)
Sparkling WaterSparkling Water provides H2O functionality inside Spark cluster
Stars: ✭ 887 (+295.98%)
Docker SparkApache Spark docker image
Stars: ✭ 1,396 (+523.21%)
Bigdataguide大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+264.73%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+377.68%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-32.14%)
DeequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Stars: ✭ 2,020 (+801.79%)
Spark Submit UiThis is a based on playframwork for submit spark app
Stars: ✭ 53 (-76.34%)
Coding Now学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
Stars: ✭ 750 (+234.82%)
SparkctrCTR prediction model based on spark(LR, GBDT, DNN)
Stars: ✭ 740 (+230.36%)
OryxOryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+696.88%)
FramelessExpressive types for Spark.
Stars: ✭ 717 (+220.09%)
ScannsA scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (-15.18%)
Pyspark StubsApache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (-56.25%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+677.68%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+182.59%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-56.7%)
Dev SetupmacOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults.
Stars: ✭ 5,590 (+2395.54%)
React WorkshopA step-by-step workshop for learning React fundamentals while building an app
Stars: ✭ 171 (-23.66%)
Dist KerasDistributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+173.66%)
Aspnetcore For BeginnersHalf day workshop for developers who are completely new to .NET Core and ASP.NET ASP.NET
Stars: ✭ 96 (-57.14%)
UnityplaygroundA collection of simple scripts to create 2D physics game, intended for giving workshops to a young audience
Stars: ✭ 603 (+169.2%)
Scalable Data ScienceScalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Stars: ✭ 142 (-36.61%)
Kubeadm WorkshopShowcasing a bare-metal multi-platform kubeadm setup with persistent storage and monitoring
Stars: ✭ 593 (+164.73%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-58.93%)
AlluxioAlluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (+2301.34%)
Spark Knnk-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (-8.48%)
SparklearningLearning Apache spark,including code and data .Most part can run local.
Stars: ✭ 558 (+149.11%)
Spark DariaEssential Spark extensions and helper methods ✨😲
Stars: ✭ 553 (+146.88%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-36.61%)
OpenscoringREST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
Stars: ✭ 536 (+139.29%)
Go Web WorkshopBuild Web Applications with Go on App Engine
Stars: ✭ 515 (+129.91%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+830.36%)
Awesome SparkA curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+373.66%)
ZparkioBoiler plate framework to use Spark and ZIO together.
Stars: ✭ 121 (-45.98%)
Spark Sklearn(Deprecated) Scikit-learn integration package for Apache Spark
Stars: ✭ 1,055 (+370.98%)
Awesome Recommendation EngineThe purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (-79.02%)