awesome-bigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+65152.94%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+61535.29%)
mediapipe plusThe purpose of this project is to apply mediapipe to more AI chips.
Stars: ✭ 38 (+123.53%)
realar5 kB Advanced state manager for React
Stars: ✭ 41 (+141.18%)
dynamic.yamlDEPRECATED: YAML-based data transformations
Stars: ✭ 14 (-17.65%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+129.41%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (+282.35%)
analyzing-reddit-sentiment-with-awsLearn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
Stars: ✭ 40 (+135.29%)
Geoweavera web system to allow users to automatically record history and manage complicated scientific workflows in web browsers involving the online spatial data facilities, high-performance computation platforms, and open-source libraries.
Stars: ✭ 32 (+88.24%)
ventVent is a light-weight platform built to automate network collection and analysis pipelines using a flexible set of popular open source tools and technologies. Vent is python-based, extensible, leverages docker containers, and provides both an API and CLI.
Stars: ✭ 73 (+329.41%)
datajoint-pythonRelational data pipelines for the science lab
Stars: ✭ 140 (+723.53%)
hellhoundA set of libraries to create asynchronous, high performance, scalable and simple application.
Stars: ✭ 33 (+94.12%)
react-wranglerA react component for simple declarative state management with "one way data flow" and side effects
Stars: ✭ 16 (-5.88%)
zinggScalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+3752.94%)
pipenpipen - A pipeline framework for python
Stars: ✭ 82 (+382.35%)
cdcA library for performing Content-Defined Chunking (CDC) on data streams.
Stars: ✭ 18 (+5.88%)
effepiFun functional programming with pipelinable functions
Stars: ✭ 13 (-23.53%)
ob bulkstashBulk Stash is a docker rclone service to sync, or copy, files between different storage services. For example, you can copy files either to or from a remote storage services like Amazon S3 to Google Cloud Storage, or locally from your laptop to a remote storage.
Stars: ✭ 113 (+564.71%)
cephgeorepAn efficient unidirectional remote backup daemon for CephFS.
Stars: ✭ 27 (+58.82%)
saisokuSaisoku is a Python module that helps you build complex pipelines of batch file/directory transfer/sync jobs.
Stars: ✭ 40 (+135.29%)
cinjeA Pythonic and ultra fast template engine DSL.
Stars: ✭ 26 (+52.94%)
openPDCOpen Source Phasor Data Concentrator
Stars: ✭ 109 (+541.18%)
opentrials-airflowConfiguration and definitions of Airflow for OpenTrials
Stars: ✭ 18 (+5.88%)
tornadoThe Tornado 🌪️ framework, designed and implemented for adaptive online learning and data stream mining in Python.
Stars: ✭ 110 (+547.06%)
MERlinMERlin is an extensible analysis pipeline applied to decoding MERFISH data
Stars: ✭ 19 (+11.76%)
rule-engine基于流程,事件驱动,可拓展,响应式,轻量级的规则引擎。
Stars: ✭ 165 (+870.59%)
Direct-Messages-in-DjangoA tutorial on how to create synchronous slack-inspired message channels and private messages using Django. Enjoy!
Stars: ✭ 84 (+394.12%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (+158.82%)
doSimplest way to manage asynchronicity
Stars: ✭ 33 (+94.12%)
ATOMAutomated Tool for Optimized Modelling
Stars: ✭ 85 (+400%)
cqClojure Command-line Data Processor for JSON, YAML, EDN, XML and more
Stars: ✭ 111 (+552.94%)
twitter-stream-api🐤 Another Twitter stream PHP library to retrieve filtered tweets on hot.
Stars: ✭ 11 (-35.29%)
makinageStream Processing Made Easy
Stars: ✭ 31 (+82.35%)
mutableState containers with dirty checking and more
Stars: ✭ 32 (+88.24%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+1188.24%)
praxExperimental rendering library geared towards hybrid SSR+SPA apps. Focus on radical simplicity and performance. Tiny and dependency-free.
Stars: ✭ 18 (+5.88%)
machine-learning-data-pipelinePipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (+29.41%)
output-file-syncSynchronously write a file and create its ancestor directories if needed
Stars: ✭ 14 (-17.65%)
mxfactoriala payment application intended for deployment by the united states treasury
Stars: ✭ 36 (+111.76%)
godsendA simple and eloquent workflow for streaming messages to micro-services.
Stars: ✭ 15 (-11.76%)
rodaRöda: A stream-oriented scripting language
Stars: ✭ 43 (+152.94%)
surroundSurround is a framework for building AI driven microservices in Python, https://surround.readthedocs.io/en/latest/
Stars: ✭ 19 (+11.76%)
MegFlowEfficient ML solution for long-tailed demands.
Stars: ✭ 372 (+2088.24%)
wranglerWrangler Transform: A DMD system for transforming Big Data
Stars: ✭ 63 (+270.59%)
fastverseAn Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Stars: ✭ 123 (+623.53%)
graflowA graph stream library for Javascript
Stars: ✭ 53 (+211.76%)
lightflowA lightweight, distributed workflow system
Stars: ✭ 67 (+294.12%)
daanyDaany - .NET DAta ANalYtics .NET library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.
Stars: ✭ 49 (+188.24%)
snaposSnapcast OS
Stars: ✭ 73 (+329.41%)
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+47.06%)
GVProfGVProf: A Value Profiler for GPU-based Clusters
Stars: ✭ 25 (+47.06%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-17.65%)