All Projects → Flume → Similar Projects or Alternatives

380 Open source projects that are alternatives of or similar to Flume

Magellan
Geo Spatial Data Analytics on Spark
Stars: ✭ 507 (-76.95%)
Mutual labels:  big-data
iis
Information Inference Service of the OpenAIRE system
Stars: ✭ 16 (-99.27%)
Mutual labels:  big-data
Mobydq
🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (-94.41%)
Mutual labels:  big-data
FIW KRT
Families In the WIld: A Kinship Recogntion Toolbox.
Stars: ✭ 18 (-99.18%)
Mutual labels:  big-data
Stream Framework
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
Stars: ✭ 4,576 (+108%)
Mutual labels:  big-data
shifting
A privacy-focused list of alternatives to mainstream services to help the competition.
Stars: ✭ 31 (-98.59%)
Mutual labels:  big-data
Attic Predictionio Template Recommender
PredictionIO Recommendation Engine Template (Scala-based parallelized engine)
Stars: ✭ 78 (-96.45%)
Mutual labels:  big-data
HadoopDedup
🍉基于Hadoop和HBase的大规模海量数据去重
Stars: ✭ 27 (-98.77%)
Mutual labels:  big-data
Redislite
Redis in a python module.
Stars: ✭ 464 (-78.91%)
Mutual labels:  big-data
hazelcast-csharp-client
Hazelcast .NET Client
Stars: ✭ 98 (-95.55%)
Mutual labels:  big-data
Attic Predictionio
PredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,522 (+469.18%)
Mutual labels:  big-data
corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
Stars: ✭ 16 (-99.27%)
Mutual labels:  big-data
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-79.36%)
Mutual labels:  big-data
big-data-engineering-indonesia
A curated list of big data engineering tools, resources and communities.
Stars: ✭ 26 (-98.82%)
Mutual labels:  big-data
Cookbook
The Data Engineering Cookbook
Stars: ✭ 9,829 (+346.77%)
Mutual labels:  big-data
merkle-db
High-scalability analytics database built on immutable merkle-trees
Stars: ✭ 44 (-98%)
Mutual labels:  big-data
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+902.18%)
Mutual labels:  big-data
metriql
The metrics layer for your data. Join us at https://metriql.com/slack
Stars: ✭ 227 (-89.68%)
Mutual labels:  big-data
Report
自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-94.41%)
Mutual labels:  big-data
dislib
The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
Stars: ✭ 39 (-98.23%)
Mutual labels:  big-data
Circosjs
d3 library to build circular graphs
Stars: ✭ 436 (-80.18%)
Mutual labels:  big-data
phoenix-queryserver
Apache Phoenix Query Server
Stars: ✭ 33 (-98.5%)
Mutual labels:  big-data
Bookkeeper
Apache Bookkeeper
Stars: ✭ 1,178 (-46.45%)
Mutual labels:  big-data
cdp-service
cdp数据平台,帮助企业充分了解客户,实现千人千面的精准营销。
Stars: ✭ 30 (-98.64%)
Mutual labels:  big-data
Listenbrainz Server
Server for the ListenBrainz project
Stars: ✭ 420 (-80.91%)
Mutual labels:  big-data
sgd
An R package for large scale estimation with stochastic gradient descent
Stars: ✭ 55 (-97.5%)
Mutual labels:  big-data
Belajarpython.com
Open Source Indonesian Python Programming Tutorial Site
Stars: ✭ 141 (-93.59%)
Mutual labels:  big-data
ytpriv
YT metadata exporter
Stars: ✭ 28 (-98.73%)
Mutual labels:  big-data
Opendata.cern.ch
Source code for the CERN Open Data portal
Stars: ✭ 411 (-81.32%)
Mutual labels:  big-data
scikit-learn-intelex
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Stars: ✭ 887 (-59.68%)
Mutual labels:  big-data
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-96.77%)
Mutual labels:  big-data
predictionio-sdk-python
PredictionIO Python SDK
Stars: ✭ 199 (-90.95%)
Mutual labels:  big-data
Mockneat
MockNeat is a Java 8+ library that facilitates the generation of arbitrary data for your applications.
Stars: ✭ 410 (-81.36%)
Mutual labels:  big-data
twitter-archive-reader
Full featured TypeScript Twitter archive reader and browser
Stars: ✭ 43 (-98.05%)
Mutual labels:  big-data
Sigmf
The Signal Metadata Format Specification
Stars: ✭ 120 (-94.55%)
Mutual labels:  big-data
accumulo-testing
Apache Accumulo Testing
Stars: ✭ 14 (-99.36%)
Mutual labels:  big-data
Kafka Connect Hdfs
Kafka Connect HDFS connector
Stars: ✭ 400 (-81.82%)
Mutual labels:  big-data
bagri
XML/Document DB on top of distributed cache
Stars: ✭ 40 (-98.18%)
Mutual labels:  big-data
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-96.86%)
Mutual labels:  big-data
TT Tech Space
TT Tech Research Notes
Stars: ✭ 21 (-99.05%)
Mutual labels:  big-data
Ignite
Apache Ignite
Stars: ✭ 4,027 (+83.05%)
Mutual labels:  big-data
masc
Microsoft's contributions for Spark with Apache Accumulo
Stars: ✭ 20 (-99.09%)
Mutual labels:  big-data
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (-93.05%)
Mutual labels:  big-data
Detecting-Malicious-URL-Machine-Learning
No description or website provided.
Stars: ✭ 47 (-97.86%)
Mutual labels:  big-data
Hive
Apache Hive
Stars: ✭ 4,031 (+83.23%)
Mutual labels:  big-data
Clickhouse
ClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+858.59%)
Mutual labels:  big-data
Hazelcast Cpp Client
Hazelcast IMDG C++ Client
Stars: ✭ 67 (-96.95%)
Mutual labels:  big-data
Vue Virtual Scroll List
⚡️A vue component support big amount data list with high render performance and efficient.
Stars: ✭ 3,201 (+45.5%)
Mutual labels:  big-data
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-83.59%)
Mutual labels:  big-data
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-88.77%)
Mutual labels:  big-data
Drill
Apache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (-26.41%)
Mutual labels:  big-data
Sylph
Stream computing platform for bigdata
Stars: ✭ 362 (-83.55%)
Mutual labels:  big-data
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-91.95%)
Mutual labels:  big-data
Keyvi
Keyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 171 (-92.23%)
Mutual labels:  big-data
Fluo
Apache Fluo
Stars: ✭ 159 (-92.77%)
Mutual labels:  big-data
100daysofmlcode
My journey to learn and grow in the domain of Machine Learning and Artificial Intelligence by performing the #100DaysofMLCode Challenge.
Stars: ✭ 146 (-93.36%)
Mutual labels:  big-data
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (-25.36%)
Mutual labels:  big-data
Orc
An ORC file format reader and writer for Go.
Stars: ✭ 97 (-95.59%)
Mutual labels:  big-data
Pyspark Setup Demo
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-98.91%)
Mutual labels:  big-data
web-click-flow
网站点击流离线日志分析
Stars: ✭ 14 (-99.36%)
Mutual labels:  flume
301-360 of 380 similar projects