All Projects → Spark On Kubernetes Helm → Similar Projects or Alternatives

1163 Open source projects that are alternatives of or similar to Spark On Kubernetes Helm

something to help you spark

Stars: ✭ 61 (-33.7%)

Mutual labels: spark

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.

Stars: ✭ 411 (+346.74%)

Mutual labels: jupyter

Urhox

Urho3D extension library

Stars: ✭ 13 (-85.87%)

Mutual labels: spark

Devops Python Tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

Stars: ✭ 406 (+341.3%)

Mutual labels: spark

Home

ApacheCN 开源组织：公告、介绍、成员、活动、交流方式

Stars: ✭ 1,199 (+1203.26%)

Mutual labels: spark

Ai Lab

All-in-one AI container for rapid prototyping

Stars: ✭ 406 (+341.3%)

Mutual labels: jupyter

Mlfeature

Feature engineering toolkit for Spark MLlib.

Stars: ✭ 12 (-86.96%)

Mutual labels: spark

Big data architect skills

一个大数据架构师应该掌握的技能

Stars: ✭ 400 (+334.78%)

Mutual labels: spark

Waimak

Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.

Stars: ✭ 60 (-34.78%)

Mutual labels: spark

Iceberg

Iceberg is a table format for large, slow-moving tabular data

Stars: ✭ 393 (+327.17%)

Mutual labels: spark

Sparkjni

A heterogeneous Apache Spark framework.

Stars: ✭ 11 (-88.04%)

Mutual labels: spark

Ipykernel

IPython Kernel for Jupyter

Stars: ✭ 386 (+319.57%)

Mutual labels: jupyter

Sci Pype

A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.

Stars: ✭ 90 (-2.17%)

Mutual labels: jupyter

Redash

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Stars: ✭ 20,147 (+21798.91%)

Mutual labels: spark

Helm Notmuch

Search emails with Notmuch and Helm

Stars: ✭ 10 (-89.13%)

Mutual labels: helm

Bigdl

Building Large-Scale AI Applications for Distributed Big Data

Stars: ✭ 3,813 (+4044.57%)

Mutual labels: spark

Zemberek Nlp Server

Zemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu

Stars: ✭ 60 (-34.78%)

Mutual labels: spark

Charts

Localized Helm charts from Helm Hub to China

Stars: ✭ 376 (+308.7%)

Mutual labels: helm

Vds

Verteego Data Suite

Stars: ✭ 9 (-90.22%)

Mutual labels: jupyter

Go Api Boilerplate

Go Server/API boilerplate using best practices DDD CQRS ES gRPC

Stars: ✭ 373 (+305.43%)

Mutual labels: helm

Spark Website

Apache Spark Website

Stars: ✭ 75 (-18.48%)

Mutual labels: spark

Helm S3

Helm plugin that allows to set up a chart repository in AWS S3.

Stars: ✭ 372 (+304.35%)

Mutual labels: helm

Python Ml

Stars: ✭ 8 (-91.3%)

Mutual labels: jupyter

Dockerspawner

Spawns JupyterHub single user servers in Docker containers

Stars: ✭ 368 (+300%)

Mutual labels: jupyter

Ipyleaflet

A Jupyter - Leaflet.js bridge

Stars: ✭ 1,103 (+1098.91%)

Mutual labels: jupyter

Sidekick

High Performance HTTP Sidecar Load Balancer

Stars: ✭ 366 (+297.83%)

Mutual labels: spark

Tiledb Vcf

Efficient variant-call data storage and retrieval library using the TileDB storage library.

Stars: ✭ 26 (-71.74%)

Mutual labels: spark

Metorikku

A simplified, lightweight ETL Framework based on Apache Spark

Stars: ✭ 361 (+292.39%)

Mutual labels: spark

Neo4jupyter

A quick visualization tool for Jupyter and Neo4J

Stars: ✭ 85 (-7.61%)

Mutual labels: jupyter

Quantitative Notebooks

Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy

Stars: ✭ 356 (+286.96%)

Mutual labels: jupyter

Spark Swagger

Spark (http://sparkjava.com/) support for Swagger (https://swagger.io/)

Stars: ✭ 25 (-72.83%)

Mutual labels: spark

Kubespawner

Kubernetes spawner for JupyterHub

Stars: ✭ 353 (+283.7%)

Mutual labels: jupyter

Pyspark Examples

Code examples on Apache Spark using python

Stars: ✭ 58 (-36.96%)

Mutual labels: spark

Helm Push

Helm plugin to push chart package to ChartMuseum

Stars: ✭ 343 (+272.83%)

Mutual labels: helm

Mobius

C# and F# language binding and extensions to Apache Spark

Stars: ✭ 929 (+909.78%)

Mutual labels: spark

Sparklens

Qubole Sparklens tool for performance tuning Apache Spark

Stars: ✭ 345 (+275%)

Mutual labels: spark

Cleanframes

type-class based data cleansing library for Apache Spark SQL

Stars: ✭ 75 (-18.48%)

Mutual labels: spark

Landscaper

Deprecated. Takes a set of Helm Chart references with values (a desired state), and realizes this in a Kubernetes cluster

Stars: ✭ 342 (+271.74%)

Mutual labels: helm

Pyspark Setup Demo

Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks

Stars: ✭ 24 (-73.91%)

Mutual labels: jupyter

Jupyterlab Dash

An Extension for the Interactive development of Dash apps in JupyterLab

Stars: ✭ 342 (+271.74%)

Mutual labels: jupyter

Pega Helm Charts

Orchestrate a Pega Platform™ deployment by using Docker, Kubernetes, and Helm to take advantage of Pega Platform Cloud Choice flexibility.

Stars: ✭ 58 (-36.96%)

Mutual labels: helm

Hide code

Code, prompt and output hiding for Jupyter/IPython notebooks.

Stars: ✭ 339 (+268.48%)

Mutual labels: jupyter

Digitrecognizer

Java Convolutional Neural Network example for Hand Writing Digit Recognition

Stars: ✭ 23 (-75%)

Mutual labels: spark

Ammonite Spark

Run spark calculations from Ammonite

Stars: ✭ 88 (-4.35%)

Mutual labels: spark

Wirbelsturm

Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.

Stars: ✭ 332 (+260.87%)

Mutual labels: spark

Thinkbayes2

Text and code for the forthcoming second edition of Think Bayes, by Allen Downey.

Stars: ✭ 918 (+897.83%)

Mutual labels: jupyter

Kubedog

Library to watch and follow kubernetes resources in CI/CD deploy pipelines

Stars: ✭ 326 (+254.35%)

Mutual labels: helm

Jupyterlab Python Bytecode

JupyterLab extension to explore CPython Bytecode

Stars: ✭ 57 (-38.04%)

Mutual labels: jupyter

Homemade Machine Learning

🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explained

Stars: ✭ 18,594 (+20110.87%)

Mutual labels: jupyter

Kylo

Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.

Stars: ✭ 916 (+895.65%)

Mutual labels: spark

Jupyter Edu Book

Teaching and Learning with Jupyter

Stars: ✭ 325 (+253.26%)

Mutual labels: jupyter

Dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Stars: ✭ 1,195 (+1198.91%)

Mutual labels: spark

Weblogsanalysissystem

A big data platform for analyzing web access logs

Stars: ✭ 37 (-59.78%)

Mutual labels: spark

Cp Helm Charts

The Confluent Platform Helm charts enable you to deploy Confluent Platform services on Kubernetes for development, test, and proof of concept environments.

Stars: ✭ 539 (+485.87%)

Mutual labels: helm

Intro To Python

An intro to Python & programming for wanna-be data scientists

Stars: ✭ 536 (+482.61%)

Mutual labels: jupyter

Awesome Pulsar

A curated list of Pulsar tools, integrations and resources.

Stars: ✭ 57 (-38.04%)

Mutual labels: spark

Flux2

Open and extensible continuous delivery solution for Kubernetes. Powered by GitOps Toolkit.