Silexsomething to help you spark
Stars: ✭ 61 (-33.7%)
Pytorch Original TransformerMy implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
Stars: ✭ 411 (+346.74%)
UrhoxUrho3D extension library
Stars: ✭ 13 (-85.87%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+341.3%)
HomeApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (+1203.26%)
Ai LabAll-in-one AI container for rapid prototyping
Stars: ✭ 406 (+341.3%)
MlfeatureFeature engineering toolkit for Spark MLlib.
Stars: ✭ 12 (-86.96%)
WaimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (-34.78%)
IcebergIceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+327.17%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-88.04%)
IpykernelIPython Kernel for Jupyter
Stars: ✭ 386 (+319.57%)
Sci PypeA Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.
Stars: ✭ 90 (-2.17%)
RedashMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+21798.91%)
Helm NotmuchSearch emails with Notmuch and Helm
Stars: ✭ 10 (-89.13%)
BigdlBuilding Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+4044.57%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (-34.78%)
ChartsLocalized Helm charts from Helm Hub to China
Stars: ✭ 376 (+308.7%)
VdsVerteego Data Suite
Stars: ✭ 9 (-90.22%)
Go Api BoilerplateGo Server/API boilerplate using best practices DDD CQRS ES gRPC
Stars: ✭ 373 (+305.43%)
Helm S3Helm plugin that allows to set up a chart repository in AWS S3.
Stars: ✭ 372 (+304.35%)
DockerspawnerSpawns JupyterHub single user servers in Docker containers
Stars: ✭ 368 (+300%)
IpyleafletA Jupyter - Leaflet.js bridge
Stars: ✭ 1,103 (+1098.91%)
SidekickHigh Performance HTTP Sidecar Load Balancer
Stars: ✭ 366 (+297.83%)
Tiledb VcfEfficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (-71.74%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+292.39%)
Neo4jupyterA quick visualization tool for Jupyter and Neo4J
Stars: ✭ 85 (-7.61%)
Quantitative NotebooksEducational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (+286.96%)
Spark SwaggerSpark (http://sparkjava.com/) support for Swagger (https://swagger.io/)
Stars: ✭ 25 (-72.83%)
KubespawnerKubernetes spawner for JupyterHub
Stars: ✭ 353 (+283.7%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-36.96%)
Helm Push Helm plugin to push chart package to ChartMuseum
Stars: ✭ 343 (+272.83%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+909.78%)
SparklensQubole Sparklens tool for performance tuning Apache Spark
Stars: ✭ 345 (+275%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-18.48%)
LandscaperDeprecated. Takes a set of Helm Chart references with values (a desired state), and realizes this in a Kubernetes cluster
Stars: ✭ 342 (+271.74%)
Pyspark Setup DemoDemo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-73.91%)
Jupyterlab DashAn Extension for the Interactive development of Dash apps in JupyterLab
Stars: ✭ 342 (+271.74%)
Pega Helm ChartsOrchestrate a Pega Platform™ deployment by using Docker, Kubernetes, and Helm to take advantage of Pega Platform Cloud Choice flexibility.
Stars: ✭ 58 (-36.96%)
Hide codeCode, prompt and output hiding for Jupyter/IPython notebooks.
Stars: ✭ 339 (+268.48%)
DigitrecognizerJava Convolutional Neural Network example for Hand Writing Digit Recognition
Stars: ✭ 23 (-75%)
Ammonite SparkRun spark calculations from Ammonite
Stars: ✭ 88 (-4.35%)
WirbelsturmWirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (+260.87%)
Thinkbayes2Text and code for the forthcoming second edition of Think Bayes, by Allen Downey.
Stars: ✭ 918 (+897.83%)
KubedogLibrary to watch and follow kubernetes resources in CI/CD deploy pipelines
Stars: ✭ 326 (+254.35%)
Homemade Machine Learning🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explained
Stars: ✭ 18,594 (+20110.87%)
KyloKylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+895.65%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+1198.91%)
Cp Helm ChartsThe Confluent Platform Helm charts enable you to deploy Confluent Platform services on Kubernetes for development, test, and proof of concept environments.
Stars: ✭ 539 (+485.87%)
Intro To PythonAn intro to Python & programming for wanna-be data scientists
Stars: ✭ 536 (+482.61%)
Awesome PulsarA curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-38.04%)
Flux2Open and extensible continuous delivery solution for Kubernetes. Powered by GitOps Toolkit.
Stars: ✭ 1,281 (+1292.39%)
Nbinclude.jlimport code from IJulia Jupyter notebooks into Julia programs
Stars: ✭ 90 (-2.17%)
XpediteA non-sampling profiler purpose built to measure and optimize performance of ultra low latency/real time systems
Stars: ✭ 89 (-3.26%)
CuesheetA framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-6.52%)