local-docker-dbA bunch o' Docker Compose files used to quickly spin up local databases.
Stars: ✭ 251 (+1155%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (+385%)
spark-word2vecA parallel implementation of word2vec based on Spark
Stars: ✭ 24 (+20%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+6590%)
docker-symfonyDocker Symfony (PHP-FPM - NGINX - MySQL - MailHog - Redis - RabbitMQ)
Stars: ✭ 32 (+60%)
tengoGo La Tengo: a MySQL automation library
Stars: ✭ 27 (+35%)
Ammonite SparkRun spark calculations from Ammonite
Stars: ✭ 88 (+340%)
spark-data-sourcesDeveloping Spark External Data Sources using the V2 API
Stars: ✭ 36 (+80%)
LEMPerLEMPer Stack is terminal-based LEMP / LNMP installer and manager for Debian & Ubuntu cloud or virtual server (vps) and on-premise (bare metal).
Stars: ✭ 171 (+755%)
CuesheetA framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (+330%)
database connections⚙️Demonstration code to connect R on MacOS to various database flavors.
Stars: ✭ 18 (-10%)
Hops ExamplesExamples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (+320%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (+310%)
MySQL ModuleMySQL connector to Godot Engine.
Stars: ✭ 30 (+50%)
MleapMLeap: Deploy ML Pipelines to Production
Stars: ✭ 1,232 (+6060%)
nifi-stateless-operatorAn Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes
Stars: ✭ 52 (+160%)
Spark GbtlrHybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Stars: ✭ 81 (+305%)
larawellMonolithic docker container to run your Laravel apps: MariaDB/Redis/Nginx/PHP7.0-Fpm with properly configured cron and queue
Stars: ✭ 14 (-30%)
Docker Spark🚢 Docker image for Apache Spark
Stars: ✭ 78 (+290%)
smolderHL7 Apache Spark Datasource
Stars: ✭ 33 (+65%)
Ds CheatsheetsList of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+47160%)
database-journalDatabases: Concepts, commands, codes, interview questions and more...
Stars: ✭ 50 (+150%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (+270%)
akka-microserviceExample of a microservice with Scala, Akka, Spray and Camel/ActiveMQ
Stars: ✭ 45 (+125%)
swaggerqlEasily and simply convert SQL database into a REST API with Swagger documentation
Stars: ✭ 40 (+100%)
Kamu CliNext generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (+245%)
nifi-fdsMirror of Apache NiFi Flow Design System
Stars: ✭ 25 (+25%)
lib mysqludf redisProvides Mysql UDF commands to synchronize data from Mysql to Redis.
Stars: ✭ 20 (+0%)
Fast MrmrAn improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).
Stars: ✭ 67 (+235%)
friendicaFriendica Communications Platform
Stars: ✭ 1,048 (+5140%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (+225%)
Developer-ExamplesThis repository contains samples applications demonstrating the power of MariaDB!
Stars: ✭ 35 (+75%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (+220%)
SparkV🤖⚡ | The most POWERFUL multipurpose chat/meme bot that will boost the activity in your server.
Stars: ✭ 24 (+20%)
RoffildlibraryLibrary for MQL5 (MetaTrader) with Python, Java, Apache Spark, AWS
Stars: ✭ 63 (+215%)
exposed-upsertUpsert DSL extension for Exposed, Kotlin SQL framework
Stars: ✭ 21 (+5%)
WaimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (+200%)
Spark Fast TestsApache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Stars: ✭ 249 (+1145%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (+200%)
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (+1130%)
dllibdllib is a distributed deep learning library running on Apache Spark
Stars: ✭ 32 (+60%)
confluent-spark-avroSpark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
Stars: ✭ 18 (-10%)
dgiot-dashboardDG-IoT平台行业应用扩展插件 DG-IoT for application plugin
Stars: ✭ 229 (+1045%)
kafka-compose🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (+60%)
splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+805%)
CogStack-NiFiBuilding data processing pipelines for documents processing with NLP using Apache NiFi and related services
Stars: ✭ 22 (+10%)