All Projects → Mare → Similar Projects or Alternatives

677 Open source projects that are alternatives of or similar to Mare

Sparklyr
R interface for Apache Spark
Stars: ✭ 775 (+6945.45%)
Mutual labels:  spark
Lopq
Training of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Stars: ✭ 530 (+4718.18%)
Mutual labels:  spark
Spark Scala Tutorial
A free tutorial for Apache Spark.
Stars: ✭ 907 (+8145.45%)
Mutual labels:  spark
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+4563.64%)
Mutual labels:  spark
Coding Now
学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
Stars: ✭ 750 (+6718.18%)
Mutual labels:  spark
Coursera Uw Machine Learning Clustering Retrieval
Stars: ✭ 25 (+127.27%)
Mutual labels:  mapreduce
Bigdata
💎🔥大数据学习笔记
Stars: ✭ 488 (+4336.36%)
Mutual labels:  mapreduce
Sparkctr
CTR prediction model based on spark(LR, GBDT, DNN)
Stars: ✭ 740 (+6627.27%)
Mutual labels:  spark
Awesome Scientific Computing
😎 Curated list of awesome software for numerical analysis and scientific computing
Stars: ✭ 476 (+4227.27%)
Mutual labels:  scientific-computing
Edge
Extreme-scale Discontinuous Galerkin Environment (EDGE)
Stars: ✭ 18 (+63.64%)
Mutual labels:  scientific-computing
Spark
Cross-platform real-time collaboration client optimized for business and organizations.
Stars: ✭ 471 (+4181.82%)
Mutual labels:  spark
Cdhproject
hadoop各组件使用,持续更新
Stars: ✭ 733 (+6563.64%)
Mutual labels:  spark
Poliastro
poliastro - 🚀 Astrodynamics in Python
Stars: ✭ 462 (+4100%)
Mutual labels:  scientific-computing
Dockerfiles
50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu
Stars: ✭ 847 (+7600%)
Mutual labels:  spark
Frameless
Expressive types for Spark.
Stars: ✭ 717 (+6418.18%)
Mutual labels:  spark
Spark Structured Streaming Book
The Internals of Spark Structured Streaming
Stars: ✭ 371 (+3272.73%)
Mutual labels:  spark
Datafusion
DataFusion has now been donated to the Apache Arrow project
Stars: ✭ 611 (+5454.55%)
Mutual labels:  spark
Mlpack
mlpack: a scalable C++ machine learning library --
Stars: ✭ 3,859 (+34981.82%)
Mutual labels:  scientific-computing
Bigdataie
大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (+3945.45%)
Mutual labels:  spark
Hail
Scalable genomic data analysis.
Stars: ✭ 706 (+6318.18%)
Mutual labels:  spark
High Performance Spark Examples
Examples for High Performance Spark
Stars: ✭ 436 (+3863.64%)
Mutual labels:  spark
Boxx
Tool-box for efficient build and debug in Python. Especially for Scientific Computing and Computer Vision.
Stars: ✭ 429 (+3800%)
Mutual labels:  scientific-computing
Reflow
A language and runtime for distributed, incremental data processing in the cloud
Stars: ✭ 706 (+6318.18%)
Mutual labels:  scientific-computing
Deepxde
Deep learning library for solving differential equations and more
Stars: ✭ 420 (+3718.18%)
Mutual labels:  scientific-computing
Big Data Scala Spark
Coursera's big data course with Scala and Spark
Stars: ✭ 16 (+45.45%)
Mutual labels:  spark
Librmath.js
Javascript Pure Implementation of Statistical R "core" numerical libRmath.so
Stars: ✭ 425 (+3763.64%)
Mutual labels:  scientific-computing
Learn Julia The Hard Way
Learn Julia the hard way!
Stars: ✭ 679 (+6072.73%)
Mutual labels:  scientific-computing
Featran
A Scala feature transformation library for data science and machine learning
Stars: ✭ 420 (+3718.18%)
Mutual labels:  spark
Sidekick
High Performance HTTP Sidecar Load Balancer
Stars: ✭ 366 (+3227.27%)
Mutual labels:  spark
Listenbrainz Server
Server for the ListenBrainz project
Stars: ✭ 420 (+3718.18%)
Mutual labels:  spark
Distributed Computing
distributed_computing include mapreduce kvstore etc.
Stars: ✭ 654 (+5845.45%)
Mutual labels:  mapreduce
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+3654.55%)
Mutual labels:  spark
Ocaml Odepack
Binding to the ODEPACK FORTRAN library
Stars: ✭ 6 (-45.45%)
Mutual labels:  scientific-computing
Marmaray
Generic Data Ingestion & Dispersal Library for Hadoop
Stars: ✭ 414 (+3663.64%)
Mutual labels:  spark
Corral
🐎 A serverless MapReduce framework written for AWS Lambda
Stars: ✭ 648 (+5790.91%)
Mutual labels:  mapreduce
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+3590.91%)
Mutual labels:  spark
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (+109.09%)
Mutual labels:  spark
Big data architect skills
一个大数据架构师应该掌握的技能
Stars: ✭ 400 (+3536.36%)
Mutual labels:  spark
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+5654.55%)
Mutual labels:  spark
Amgcl
C++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+3445.45%)
Mutual labels:  scientific-computing
Bigdataguide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+7327.27%)
Mutual labels:  spark
Redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+183054.55%)
Mutual labels:  spark
Vexcl
VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Stars: ✭ 626 (+5590.91%)
Mutual labels:  scientific-computing
Bigdl
Building Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+34563.64%)
Mutual labels:  spark
Core
The core source repository for the Cherab project.
Stars: ✭ 26 (+136.36%)
Mutual labels:  scientific-computing
Wedatasphere
WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+3281.82%)
Mutual labels:  spark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+51318.18%)
Mutual labels:  spark
Sparkmeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Stars: ✭ 368 (+3245.45%)
Mutual labels:  spark
Itk
Insight Toolkit (ITK) -- Official Repository. ITK builds on a proven, spatially-oriented architecture for processing, segmentation, and registration of scientific images in two, three, or more dimensions.
Stars: ✭ 801 (+7181.82%)
Mutual labels:  scientific-computing
Loopy
A code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (+3236.36%)
Mutual labels:  scientific-computing
Zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+50018.18%)
Mutual labels:  spark
Owl
Owl - OCaml Scientific and Engineering Computing @ http://ocaml.xyz
Stars: ✭ 919 (+8254.55%)
Mutual labels:  scientific-computing
Kyuubi
Kyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (+3200%)
Mutual labels:  spark
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+7109.09%)
Mutual labels:  spark
Mongo Spark
The MongoDB Spark Connector
Stars: ✭ 588 (+5245.45%)
Mutual labels:  spark
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+3181.82%)
Mutual labels:  spark
Sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+3190.91%)
Mutual labels:  spark
Pyopencl
OpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+7081.82%)
Mutual labels:  scientific-computing
Pygam
[HELP REQUESTED] Generalized Additive Models in Python
Stars: ✭ 569 (+5072.73%)
Mutual labels:  scientific-computing
Sparklearning
Learning Apache spark,including code and data .Most part can run local.
Stars: ✭ 558 (+4972.73%)
Mutual labels:  spark
61-120 of 677 similar projects