RoffildlibraryLibrary for MQL5 (MetaTrader) with Python, Java, Apache Spark, AWS
Stars: ✭ 63 (-65.57%)
DetEditA graphical user interface for annotating and editing events detected in long-term acoustic monitoring data
Stars: ✭ 20 (-89.07%)
Spark LucenerddSpark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-37.7%)
WaimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (-67.21%)
PoliAn easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Stars: ✭ 1,850 (+910.93%)
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-86.34%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (-67.21%)
dllibdllib is a distributed deep learning library running on Apache Spark
Stars: ✭ 32 (-82.51%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-68.31%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-70.49%)
confluent-spark-avroSpark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
Stars: ✭ 18 (-90.16%)
blogblog entries
Stars: ✭ 39 (-78.69%)
Liteflowliteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (-38.8%)
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-86.34%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+851.91%)
CasperA compiler for automatically re-targeting sequential Java code to Apache Spark.
Stars: ✭ 45 (-75.41%)
Docker HadoopA Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-70.49%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (-25.68%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (+743.72%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-39.34%)
Spark Submit UiThis is a based on playframwork for submit spark app
Stars: ✭ 53 (-71.04%)
tpch-sparkTPC-H queries in Apache Spark SQL using native DataFrames API
Stars: ✭ 63 (-65.57%)
frovedisFramework of vectorized and distributed data analytics
Stars: ✭ 59 (-67.76%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (-72.68%)
ElephasDistributed Deep learning with Keras & Spark
Stars: ✭ 1,521 (+731.15%)
kafka-compose🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-82.51%)
Awesome Recommendation EngineThe purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (-74.32%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-22.4%)
Spark FfmFFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-44.81%)
proteicStreaming and static data visualization for the modern web.
Stars: ✭ 37 (-79.78%)
Spark TdaSparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-75.41%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+914.21%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-91.26%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-39.89%)
TwitworkMonitor twitter stream
Stars: ✭ 133 (-27.32%)
Sparkling WaterSparkling Water provides H2O functionality inside Spark cluster
Stars: ✭ 887 (+384.7%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (+1244.26%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-2.19%)
SparkFirely's open source FHIR server
Stars: ✭ 174 (-4.92%)