All Projects → CogStack → CogStack-NiFi

CogStack / CogStack-NiFi

Licence: other
Building data processing pipelines for documents processing with NLP using Apache NiFi and related services

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects
Jupyter Notebook
11667 projects
groovy
2714 projects
Dockerfile
14818 projects
Makefile
30231 projects

Projects that are alternatives of or similar to CogStack-NiFi

nifi
Deploy a secured, clustered, auto-scaling NiFi service in AWS.
Stars: ✭ 37 (+68.18%)
Mutual labels:  nifi, apache-nifi
nifi-prometheus-reporter
A reporting task in Nifi which is capable of sending monitoring statistics as prometheus metrics to a prometheus pushgateway.
Stars: ✭ 48 (+118.18%)
Mutual labels:  nifi, apache-nifi
NiFi-Rule-engine-processor
Drools processor for Apache NiFi
Stars: ✭ 34 (+54.55%)
Mutual labels:  nifi, apache-nifi
logging-operator
A golang based operator to create and manage EFK (Elasticsearch, Fluentd, and Kibana) stack on Kubernetes
Stars: ✭ 42 (+90.91%)
Mutual labels:  kibana
data-product-streaming
Template to deploy a Data Product for data stream processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.
Stars: ✭ 32 (+45.45%)
Mutual labels:  data-integration
elk-dashboard-v5-docker
My production setup for the latest version of ELK stack running in a compose, displaying a basic -but powerfull- security and performance dashboard.
Stars: ✭ 25 (+13.64%)
Mutual labels:  kibana
puppet-kibana
Kibana Puppet module by Elastic.
Stars: ✭ 17 (-22.73%)
Mutual labels:  kibana
vagrant-vault-consul-docker-monitoring
No description or website provided.
Stars: ✭ 20 (-9.09%)
Mutual labels:  kibana
logstash filter f5
A Logstash filter for F5 apd, dcc, sshd and tmm syslog.
Stars: ✭ 19 (-13.64%)
Mutual labels:  kibana
docker elk stack
Docker images to run an ELK stack
Stars: ✭ 24 (+9.09%)
Mutual labels:  kibana
OpenOmics
A bioinformatics API and web-app to integrate multi-omics datasets & interface with public databases.
Stars: ✭ 22 (+0%)
Mutual labels:  data-integration
BadMedicine
Library and CLI for randomly generating medical data like you might get out of an Electronic Health Records (EHR) system
Stars: ✭ 18 (-18.18%)
Mutual labels:  electronic-health-records
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+250%)
Mutual labels:  data-integration
elastic-data-lake
Elastic Data Lake
Stars: ✭ 14 (-36.36%)
Mutual labels:  kibana
ELK-Hunting
Threat Hunting with ELK Workshop (InfoSecWorld 2017)
Stars: ✭ 58 (+163.64%)
Mutual labels:  kibana
nifi
Helm nifi chart
Stars: ✭ 29 (+31.82%)
Mutual labels:  nifi
nifi-sqllookup-services-bundle
NIFI controllers for SQL record and attributes lookups with built-in caching
Stars: ✭ 18 (-18.18%)
Mutual labels:  nifi
deflek
index and API RBAC for Elasticsearch and Kibana via reverse proxy. DEPRECATED
Stars: ✭ 13 (-40.91%)
Mutual labels:  kibana
Mapeathor
Translator of spreadsheet mappings into R2RML, RML or YARRRML
Stars: ✭ 27 (+22.73%)
Mutual labels:  data-integration
doctoral-thesis
📖 Generation and Applications of Knowledge Graphs in Systems and Networks Biology
Stars: ✭ 26 (+18.18%)
Mutual labels:  data-integration

Introduction

This repository proposes a possible next step for the free-text data processing capabilities implemented as CogStack-Pipeline, shaping the solution more towards Platform-as-a-Service.

CogStack-NiFi contains example recipes using Apache NiFi as the key data workflow engine with a set of services for documents processing with NLP. Each component implementing key functionality, such as Text Extraction or Natural Language Processing, runs as a service where the data routing between the components and data source/sink is handled by Apache NiFi. Moreover, NLP services are expected to implement an uniform RESTful API to enable easy plugging-in into existing document processing pipelines, making it possible to use any NLP application in the stack.

Important

Please note that the project is under constant improvement, brining new features or services that might impact current deployments, please be aware as this might affect you, the user, when making upgrades, so be sure to check the release notes and the documentation beforehand.

Feel free to ask questions on the github issue tracker or on our discourse website which is frequently used by our development team!

Project organisation

The project is organised in the following directories:

  • nifi - custom Docker image of Apache NiFi with configuration files, drivers, example workflows and custom user resources.
  • security - scripts to generate SSL keys and certificates for Apache NiFi and related services (when needed) with other security-related requirements.
  • services - available services with their corresponding configuration files and resources.
  • deploy - an example deployment of Apache NiFi with related services.
  • scripts - helper scripts such as the one ingesting samples into Elasticsearch.
  • data - any data that you wish to ingest should be placed here.

Official documentation now available here.

As a good starting point, deployment walks through an example deployment with some workflow examples.

All issues are tracked in README, check that section before opening a bug report ticket.

Important news and updates

Please check IMPORTANT_NEWS for any major changes that might affect your deployment and security problems that have been discovered.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].