All Projects → FIW_KRT → Similar Projects or Alternatives

624 Open source projects that are alternatives of or similar to FIW_KRT

Datahub
The Metadata Platform for the Modern Data Stack
Stars: ✭ 4,232 (+23411.11%)
Mutual labels:  big-data
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (+8477.78%)
Mutual labels:  big-data
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+16005.56%)
Mutual labels:  big-data
smart-city-analytics
Analyze large data sets collected from a long-range IoT system that uses LoRaWAN networking
Stars: ✭ 28 (+55.56%)
Mutual labels:  notebook
bigstatsr
R package for statistical tools with big matrices stored on disk.
Stars: ✭ 139 (+672.22%)
Mutual labels:  big-data
mmtf-workshop-2018
Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (+177.78%)
Mutual labels:  big-data
Koalas
Koalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+16811.11%)
Mutual labels:  big-data
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-22.22%)
Mutual labels:  big-data
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (+494.44%)
Mutual labels:  big-data
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+516.67%)
Mutual labels:  big-data
yildiz
🦄🌟 Graph Database layer on top of Google Bigtable
Stars: ✭ 24 (+33.33%)
Mutual labels:  big-data
predictionio-template-java-ecom-recommender
PredictionIO E-Commerce Recommendation Engine Template (Java-based parallelized engine)
Stars: ✭ 36 (+100%)
Mutual labels:  big-data
Maha
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (+461.11%)
Mutual labels:  big-data
ibmpairs
open source tools for interaction with IBM PAIRS:
Stars: ✭ 23 (+27.78%)
Mutual labels:  big-data
Cboard
An easy to use, self-service open BI reporting and BI dashboard platform.
Stars: ✭ 2,795 (+15427.78%)
Mutual labels:  big-data
spark-acid
ACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (+405.56%)
Mutual labels:  big-data
Graph sampling
Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (+450%)
Mutual labels:  big-data
predictionio-template-attribute-based-classifier
PredictionIO Classification Engine Template (Scala-based parallelized engine)
Stars: ✭ 38 (+111.11%)
Mutual labels:  big-data
gorilla-repl
A fork of Jony Epsilon's rich REPL for Clojure in the notebook style.
Stars: ✭ 22 (+22.22%)
Mutual labels:  notebook
vxquery
Mirror of Apache VXQuery
Stars: ✭ 19 (+5.56%)
Mutual labels:  big-data
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+60961.11%)
Mutual labels:  big-data
awesome-AI-kubernetes
❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (+427.78%)
Mutual labels:  big-data
Hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (+1266.67%)
Mutual labels:  big-data
hotmap
WebGL Heatmap Viewer for Big Data and Bioinformatics
Stars: ✭ 13 (-27.78%)
Mutual labels:  big-data
Orc
An ORC file format reader and writer for Go.
Stars: ✭ 97 (+438.89%)
Mutual labels:  big-data
egis
Egis - a handy Ruby interface for AWS Athena
Stars: ✭ 38 (+111.11%)
Mutual labels:  big-data
javaer-mind
Java 程序员进阶学习的思维导图
Stars: ✭ 66 (+266.67%)
Mutual labels:  big-data
big-sorter
Java library that sorts very large files of records by splitting into smaller sorted files and merging
Stars: ✭ 49 (+172.22%)
Mutual labels:  big-data
Streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (+433.33%)
Mutual labels:  big-data
Trafodion
Apache Trafodion
Stars: ✭ 242 (+1244.44%)
Mutual labels:  big-data
Beam
Apache Beam is a unified programming model for Batch and Streaming
Stars: ✭ 5,149 (+28505.56%)
Mutual labels:  big-data
bftkv
A distributed key-value storage that's tolerant to Byzantine fault.
Stars: ✭ 27 (+50%)
Mutual labels:  big-data
Treeviz
Tree diagrams with JavaScript 🌲 📈
Stars: ✭ 95 (+427.78%)
Mutual labels:  big-data
v6.dooring.public
可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+1694.44%)
Mutual labels:  big-data
ytpriv
YT metadata exporter
Stars: ✭ 28 (+55.56%)
Mutual labels:  big-data
predictionio-sdk-php
PredictionIO PHP SDK
Stars: ✭ 269 (+1394.44%)
Mutual labels:  big-data
Hazelcast Python Client
Hazelcast IMDG Python Client
Stars: ✭ 92 (+411.11%)
Mutual labels:  big-data
couchdb-mango
Mirror of Apache CouchDB Mango
Stars: ✭ 34 (+88.89%)
Mutual labels:  big-data
Selinon
An advanced distributed task flow management on top of Celery
Stars: ✭ 237 (+1216.67%)
Mutual labels:  big-data
couchdb-couch-plugins
Mirror of Apache CouchDB
Stars: ✭ 14 (-22.22%)
Mutual labels:  big-data
Smart Array To Tree
Convert large amounts of data array to tree fastly
Stars: ✭ 91 (+405.56%)
Mutual labels:  big-data
clusterdock
clusterdock is a framework for creating Docker-based container clusters
Stars: ✭ 26 (+44.44%)
Mutual labels:  big-data
computer-vision-notebooks
👁️ An authorial set of fundamental Python recipes on Computer Vision and Digital Image Processing.
Stars: ✭ 89 (+394.44%)
Mutual labels:  notebook
opendc
Collaborative Datacenter Simulation and Exploration for Everybody
Stars: ✭ 40 (+122.22%)
Mutual labels:  big-data
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (+355.56%)
Mutual labels:  big-data
subsemble
subsemble R package for ensemble learning on subsets of data
Stars: ✭ 40 (+122.22%)
Mutual labels:  big-data
Books
整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、Scala 、TensorFlow 、大数据 、推荐系统、数据库、数据挖掘 、机器学习 、深度学习 、算法等。
Stars: ✭ 222 (+1133.33%)
Mutual labels:  big-data
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+18538.89%)
Mutual labels:  big-data
Uproot4
ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (+344.44%)
Mutual labels:  big-data
MLBD
Materials for "Machine Learning on Big Data" course
Stars: ✭ 20 (+11.11%)
Mutual labels:  big-data
Codex
A free note-taking software for programmers and Computer Science students
Stars: ✭ 242 (+1244.44%)
Mutual labels:  notebook
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+338.89%)
Mutual labels:  big-data
predictionio-sdk-java
PredictionIO Java SDK
Stars: ✭ 107 (+494.44%)
Mutual labels:  big-data
shifting
A privacy-focused list of alternatives to mainstream services to help the competition.
Stars: ✭ 31 (+72.22%)
Mutual labels:  big-data
data-viz-utils
Functions for easily making publication-quality figures with matplotlib.
Stars: ✭ 16 (-11.11%)
Mutual labels:  big-data
text-rnn-tensorflow
Tutorial: Multi-layer Recurrent Neural Networks (LSTM, RNN) for text models in Python using TensorFlow.
Stars: ✭ 22 (+22.22%)
Mutual labels:  notebook
jupyterlab plotly
This repository is deprecated. The extension has moved to https://github.com/jupyterlab/jupyter-renderers
Stars: ✭ 16 (-11.11%)
Mutual labels:  notebook
notebooks
A docker-based starter kit for machine learning via jupyter notebooks. Designed for those who just want a runtime environment and get on with machine learning. Docker tags:
Stars: ✭ 29 (+61.11%)
Mutual labels:  notebook
Poseidon
A search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (+9861.11%)
Mutual labels:  big-data
Onlinestats.jl
Single-pass algorithms for statistics
Stars: ✭ 507 (+2716.67%)
Mutual labels:  big-data
301-360 of 624 similar projects