All Projects → Scio → Similar Projects or Alternatives

1607 Open source projects that are alternatives of or similar to Scio

Beam
Apache Beam is a unified programming model for Batch and Streaming
Stars: ✭ 5,149 (+129.15%)
Mutual labels:  batch, streaming, beam
Gcp Variant Transforms
GCP Variant Transforms
Stars: ✭ 100 (-95.55%)
Mutual labels:  dataflow, bigquery, beam
bigflow
A Python framework for data processing on GCP.
Stars: ✭ 96 (-95.73%)
Mutual labels:  bigquery, beam, dataflow
bigquery-to-datastore
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Stars: ✭ 56 (-97.51%)
Mutual labels:  bigquery, beam, google-cloud
Onyx
Distributed, masterless, high performance, fault tolerant data processing
Stars: ✭ 2,019 (-10.15%)
Mutual labels:  batch, data, streaming
openmessaging.github.io
OpenMessaging homepage
Stars: ✭ 12 (-99.47%)
Mutual labels:  streaming, batch
Lexpredict Lexnlp
LexNLP by LexPredict
Stars: ✭ 439 (-80.46%)
Mutual labels:  data, ml
go-bqloader
bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-99.29%)
Mutual labels:  bigquery, google-cloud
managed ml systems and iot
Managed Machine Learning Systems and Internet of Things Live Lesson
Stars: ✭ 35 (-98.44%)
Mutual labels:  bigquery, ml
bigquery-data-lineage
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Stars: ✭ 112 (-95.02%)
Mutual labels:  bigquery, dataflow
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-99.24%)
Mutual labels:  bigquery, google-cloud
dataflow-contact-center-speech-analysis
Speech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create analytics.
Stars: ✭ 46 (-97.95%)
Mutual labels:  google-cloud, dataflow
DataflowTemplates
Convenient Dataflow pipelines for transforming data between cloud data sources
Stars: ✭ 22 (-99.02%)
Mutual labels:  bigquery, dataflow
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (-52.11%)
Mutual labels:  data, ml
kuromoji-for-bigquery
Tokenize Japanese text on BigQuery with Kuromoji in Apache Beam/Google Dataflow at scale
Stars: ✭ 11 (-99.51%)
Mutual labels:  bigquery, google-cloud
bqv
The simplest tool to manage views of BigQuery.
Stars: ✭ 22 (-99.02%)
Mutual labels:  bigquery, google-cloud
Attention Ocr
A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
Stars: ✭ 844 (-62.44%)
Mutual labels:  google-cloud, ml
Pandas Gbq
Pandas Google BigQuery
Stars: ✭ 243 (-89.19%)
Mutual labels:  bigquery, data
iris3
An upgraded and improved version of the Iris automatic GCP-labeling project
Stars: ✭ 38 (-98.31%)
Mutual labels:  bigquery, google-cloud
ob google-bigquery
This service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about installation, configuration or ongoing maintenance related to an SDK environment. This can be helpful to those who would prefer to not to be responsible for those activities.
Stars: ✭ 43 (-98.09%)
Mutual labels:  bigquery, google-cloud
end-to-end-machine-learning-with-google-cloud
End to End Machine Learning with Google Cloud Platform
Stars: ✭ 39 (-98.26%)
Mutual labels:  google-cloud, dataflow
firehose
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
Stars: ✭ 213 (-90.52%)
Mutual labels:  bigquery, streaming
Streaming Readings
Streaming System 相关的论文读物
Stars: ✭ 554 (-75.34%)
Mutual labels:  dataflow, streaming
Featran
A Scala feature transformation library for data science and machine learning
Stars: ✭ 420 (-81.31%)
Mutual labels:  data, ml
Openmessaging Java
OpenMessaging Runtime Interface for Java
Stars: ✭ 685 (-69.51%)
Mutual labels:  batch, streaming
Awesome Ai Ml Dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Stars: ✭ 831 (-63.02%)
Mutual labels:  data, ml
Ethereum Etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (-57.45%)
Mutual labels:  google-cloud, bigquery
Athenax
SQL-based streaming analytics platform at scale
Stars: ✭ 1,178 (-47.57%)
Mutual labels:  data, streaming
Specification
OpenMessaging Specification
Stars: ✭ 242 (-89.23%)
Mutual labels:  batch, streaming
Magnolify
A collection of Magnolia add-on modules
Stars: ✭ 81 (-96.4%)
Mutual labels:  google-cloud, bigquery
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-38.67%)
Mutual labels:  data, ml
Tesseract
A set of libraries for rapidly developing Pipeline driven micro/macroservices.
Stars: ✭ 20 (-99.11%)
Mutual labels:  data, dataflow
Raftlib
The RaftLib C++ library, streaming/dataflow concurrency via C++ iostream-like operators
Stars: ✭ 717 (-68.09%)
Mutual labels:  dataflow, streaming
Join Monster Graphql Tools Adapter
Use Join Monster to fetch your data with Apollo Server.
Stars: ✭ 130 (-94.21%)
Mutual labels:  batch, data
argon
Campaign Manager 360 and Display & Video 360 Reports to BigQuery connector
Stars: ✭ 31 (-98.62%)
Mutual labels:  bigquery, google-cloud
Pothosblocks
A collection of core processing blocks
Stars: ✭ 7 (-99.69%)
Mutual labels:  dataflow, streaming
Spark Bigquery Connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Stars: ✭ 126 (-94.39%)
Mutual labels:  google-cloud, bigquery
Audioowl
Fast and simple music and audio analysis using RNN in Python 🕵️‍♀️ 🥁
Stars: ✭ 151 (-93.28%)
Mutual labels:  data, ml
Lfai Landscape
🌄 Open Source AI Landscape - provides overview of top tier projects in the open source AI ecosystem, shows projects through GitHub data, funding or market cap, first and last commits, contributor count and much other information.
Stars: ✭ 172 (-92.35%)
Mutual labels:  data
Ml
Machine learning tools in JavaScript
Stars: ✭ 2,206 (-1.82%)
Mutual labels:  ml
Ee Outliers
Open-source framework to detect outliers in Elasticsearch events
Stars: ✭ 172 (-92.35%)
Mutual labels:  ml
General Store
Simple, flexible store implementation for Flux. #hubspot-open-source
Stars: ✭ 171 (-92.39%)
Mutual labels:  data
Googlecloudarchitectprofessional
Resources to prepare for Google Certified Cloud Architect Professional Exam - 2017
Stars: ✭ 177 (-92.12%)
Mutual labels:  google-cloud
Openintro
📦 R package for data and supplemental functions for OpenIntro resources
Stars: ✭ 176 (-92.17%)
Mutual labels:  data
Rekord
A javascript REST ORM that is offline and real-time capable
Stars: ✭ 171 (-92.39%)
Mutual labels:  batch
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-92.39%)
Mutual labels:  data
Haishinkit.swift
Camera and Microphone streaming library via RTMP, HLS for iOS, macOS, tvOS.
Stars: ✭ 2,237 (-0.45%)
Mutual labels:  streaming
Koel
🐦 A personal music streaming server that works.
Stars: ✭ 13,105 (+483.22%)
Mutual labels:  streaming
Logstash
Logstash - transport and process your logs, events, or other data
Stars: ✭ 12,543 (+458.21%)
Mutual labels:  streaming
Andrew Ng Notes
This is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (-91.99%)
Mutual labels:  ml
Pygeoapi
pygeoapi is a Python server implementation of the OGC API suite of standards. The project emerged as part of the next generation OGC API efforts in 2018 and provides the capability for organizations to deploy a RESTful OGC API endpoint using OpenAPI, GeoJSON, and HTML. pygeoapi is open source and released under an MIT license.
Stars: ✭ 178 (-92.08%)
Mutual labels:  data
Ncov2019 data crawler
疫情数据爬虫,2019新型冠状病毒数据仓库,轨迹数据,同乘数据,报道
Stars: ✭ 175 (-92.21%)
Mutual labels:  data
Transmogrifai
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (-7.25%)
Mutual labels:  ml
Databay
Databay is a Python interface for scheduled data transfer. It facilitates transfer of (any) data from A to B, on a scheduled interval.
Stars: ✭ 175 (-92.21%)
Mutual labels:  data
Exportsheetdata
Add-on for Google Sheets that allows sheets to be exported as JSON or XML.
Stars: ✭ 170 (-92.43%)
Mutual labels:  data
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+118.91%)
Mutual labels:  data
Alloy
Make usage of Metal API a pleasure
Stars: ✭ 178 (-92.08%)
Mutual labels:  ml
Mad
⚡ MAD: Manage Dependencies
Stars: ✭ 175 (-92.21%)
Mutual labels:  beam
Nipyapi
A convenient Python wrapper for Apache NiFi
Stars: ✭ 169 (-92.48%)
Mutual labels:  dataflow
Functions Framework Go
FaaS (Function as a service) framework for writing portable Go functions
Stars: ✭ 169 (-92.48%)
Mutual labels:  google-cloud
1-60 of 1607 similar projects