All Projects → apache → falcon

apache / falcon

Licence: Apache-2.0 license
Mirror of Apache Falcon

Programming Languages

java
68154 projects - #9 most used programming language
javascript
184084 projects - #8 most used programming language
HTML
75241 projects
CSS
56736 projects
shell
77523 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to falcon

Big-Data-Demo
基于Vue、three.js、echarts,数据可视化展示项目,包含三维模型导入交互、三维模型标注等功能
Stars: ✭ 146 (+53.68%)
Mutual labels:  big-data
opendc
Collaborative Datacenter Simulation and Exploration for Everybody
Stars: ✭ 40 (-57.89%)
Mutual labels:  big-data
wrangler
Wrangler Transform: A DMD system for transforming Big Data
Stars: ✭ 63 (-33.68%)
Mutual labels:  big-data
MLBD
Materials for "Machine Learning on Big Data" course
Stars: ✭ 20 (-78.95%)
Mutual labels:  big-data
subsemble
subsemble R package for ensemble learning on subsets of data
Stars: ✭ 40 (-57.89%)
Mutual labels:  big-data
check-engine
Data validation library for PySpark 3.0.0
Stars: ✭ 29 (-69.47%)
Mutual labels:  big-data
talaria
TalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (+55.79%)
Mutual labels:  big-data
virgil-crypto-c
This library is designed to be small, flexible and convenient wrapper for a variety crypto algorithms. So it can be used in a small micro controller as well as in a high load server application.
Stars: ✭ 24 (-74.74%)
Mutual labels:  falcon
classifai
🔥 One of the most comprehensive open-source data annotation platform.
Stars: ✭ 99 (+4.21%)
Mutual labels:  big-data
couchdb-couch-plugins
Mirror of Apache CouchDB
Stars: ✭ 14 (-85.26%)
Mutual labels:  big-data
OnlineStatsBase.jl
Base types for OnlineStats.
Stars: ✭ 26 (-72.63%)
Mutual labels:  big-data
storm-ml
an online learning algorithm library for Storm
Stars: ✭ 18 (-81.05%)
Mutual labels:  big-data
clusterdock
clusterdock is a framework for creating Docker-based container clusters
Stars: ✭ 26 (-72.63%)
Mutual labels:  big-data
ByteSlice
"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (-74.74%)
Mutual labels:  big-data
couchdb-mango
Mirror of Apache CouchDB Mango
Stars: ✭ 34 (-64.21%)
Mutual labels:  big-data
Movies-Analytics-in-Spark-and-Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Stars: ✭ 47 (-50.53%)
Mutual labels:  big-data
spectree
API spec validator and OpenAPI document generator for Python web frameworks.
Stars: ✭ 190 (+100%)
Mutual labels:  falcon
predictionio-sdk-php
PredictionIO PHP SDK
Stars: ✭ 269 (+183.16%)
Mutual labels:  big-data
SparkProgrammingInScala
Apache Spark Course Material
Stars: ✭ 57 (-40%)
Mutual labels:  big-data
predictionio
PredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,510 (+13068.42%)
Mutual labels:  big-data

Apache Falcon

Falcon is a feed processing and feed management system aimed at making it easier for end consumers to onboard their feed processing and feed management on hadoop clusters.

Why Apache Falcon?

  • Dependencies across various data processing pipelines are not easy to establish. Gaps here typically leads to either incorrect/partial processing or expensive reprocessing. Repeated duplicate definition of a single feed multiple times can lead to inconsistencies / issues.

  • Input data may not arrive always on time and it is required to kick off the processing without waiting for all data to arrive and accommodate late data separately

  • Feed management services such as feed retention, replications across clusters, archival etc are tasks that are burdensome on individual pipeline owners and better offered as a service for all customers.

  • It should be easy to onboard new workflows/pipelines

  • Smoother integration with metastore/catalog

  • Provide notification to end customer based on availability of feed groups (logical group of related feeds, which are likely to be used together)

Online Documentation

You can find the documentation on Apache Falcon website.

How to Contribute

Before opening a pull request, please go through the Contributing to Apache Falcon wiki. It lists steps that are required before creating a PR and the conventions that we follow. If you are looking for issues to pick up then you can look at starter tasks or open tasks

Release Notes

You can download release notes of previous releases from the following links.

0.8

0.7

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].