blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+21.28%)
Azure Cosmos Js@azure/cosmos has moved to a new repo https://github.com/Azure/azure-sdk-for-js
Stars: ✭ 201 (+327.66%)
Ftserver CsLightweight iBoxDB Full Text Search Server for C#
Stars: ✭ 81 (+72.34%)
skytableSkytable is an extremely fast, secure and reliable real-time NoSQL database with automated snapshots and TLS
Stars: ✭ 696 (+1380.85%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+1246.81%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+68.09%)
databaseKey-Value/Document store database library with btree and ARTree indexing methods, SSN-MVCC concurrency
Stars: ✭ 67 (+42.55%)
BenthosFancy stream processing made operationally mundane
Stars: ✭ 3,705 (+7782.98%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+12.77%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-57.45%)
RavendbACID Document Database
Stars: ✭ 2,870 (+6006.38%)
TiedotA rudimentary implementation of a basic document (NoSQL) database in Go
Stars: ✭ 2,643 (+5523.4%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+168.09%)
Aws Data WranglerPandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+4974.47%)
OrientdbOrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries. OrientDB Community Edition is Open Source using a liberal Apache 2 license.
Stars: ✭ 4,394 (+9248.94%)
Arangodb🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.
Stars: ✭ 11,880 (+25176.6%)
versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+206.38%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (-6.38%)
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+63.83%)
beneathBeneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (+38.3%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+10365.96%)
etl[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+493.62%)
DataformDataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+627.66%)
Rethinkdb.driver🎧 A NoSQL C#/.NET RethinkDB database driver with 100% ReQL API coverage.
Stars: ✭ 350 (+644.68%)
Mongodb Quickstart CourseCourse demos and handout material for Talk Python's MongoDB Quickstart course
Stars: ✭ 220 (+368.09%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1202.13%)
etl managerA python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-70.21%)
FtserverLightweight Embeddable iBoxDB Full Text Search Server for Java
Stars: ✭ 219 (+365.96%)
SaynData processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (+68.09%)
docsSource code of the ArangoDB online documentation
Stars: ✭ 18 (-61.7%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+365.96%)
deordie-meetupsDE or DIE meetup made by data engineers for data engineers. Currently in Russian only.
Stars: ✭ 48 (+2.13%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-17.02%)
google-sheets-etlLive import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-68.09%)
proc-thatproc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (-46.81%)
nedb-replThe command-line tool for NeDB
Stars: ✭ 19 (-59.57%)
mikiWiki system in PHP+NoDB in just one file. 10s setup + auto-installed. Full Markdown support. Super fast and lightweight (-0.01MB gzip). Multi-User support. Minimal and beautiful.
Stars: ✭ 25 (-46.81%)
YaEtlYet Another ETL in PHP
Stars: ✭ 60 (+27.66%)
zinggScalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+1293.62%)
get smartiesDummy variable generation with fit/transform capabilities
Stars: ✭ 23 (-51.06%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-63.83%)
grandnode2Free, Open source, Fast, Headless, Multi-tenant eCommerce platform built with .NET Core, MongoDB, AWS DocumentDB, Azure CosmosDB, LiteDB, Vue.js.
Stars: ✭ 626 (+1231.91%)
contessaEasy way to define, execute and store quality rules for your data.
Stars: ✭ 17 (-63.83%)
acebaseA fast, low memory, transactional, index & query enabled NoSQL database engine and server for node.js and browser with realtime data change notifications
Stars: ✭ 288 (+512.77%)
starlakeStarlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-65.96%)
datartDatart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (+2117.02%)
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+53.19%)
Everything-TechA collection of online resources to help you on your Tech journey.
Stars: ✭ 396 (+742.55%)
papiloDEPRECATED: Stream data processing micro-framework
Stars: ✭ 24 (-48.94%)
zdh server数据采集平台zdh,etl 处理服务
Stars: ✭ 53 (+12.77%)