RikoA Python stream processing engine modeled after Yahoo! Pipes
Stars: ✭ 1,571 (-2.9%)
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+13.78%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+204.02%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (-75.83%)
Reddit DetectivePlay detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Stars: ✭ 129 (-92.03%)
GrafterLinked Data & RDF Manufacturing Tools in Clojure
Stars: ✭ 174 (-89.25%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-95.49%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-14.83%)
MhworlddataGenerate a SQLite file from MHW data
Stars: ✭ 110 (-93.2%)
OdČeská otevřená data
Stars: ✭ 99 (-93.88%)
RlpRecursive Length Prefix Encoding in JavaScript
Stars: ✭ 93 (-94.25%)
PoochA friend to fetch your data files.
Stars: ✭ 101 (-93.76%)
DatxDatX is an opinionated JS/TS data store. It features support for simple property definition, references to other models and first-class TypeScript support.
Stars: ✭ 111 (-93.14%)
Aurelia SlickgridAurelia-Slickgrid a wrapper of the lightning fast & customizable SlickGrid datagrid with a few Styling Themes
Stars: ✭ 100 (-93.82%)
DataxDataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (-92.83%)
Glom☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️
Stars: ✭ 1,341 (-17.12%)
Caryon🔖一款基于C++的OI/ACM比赛出题解题辅助工具⭐
Stars: ✭ 109 (-93.26%)
Hearthstone DbA JSON collection of all Hearthstone cards. Hearthstone database.
Stars: ✭ 117 (-92.77%)
Ml PyxisTool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Stars: ✭ 93 (-94.25%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-92.89%)
TopokanjiTopologically ordered lists of kanji for effective learning
Stars: ✭ 108 (-93.33%)
PydapA Python library implementing the Data Access Protocol (DAP, aka OPeNDAP or DODS).
Stars: ✭ 90 (-94.44%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-93.33%)
PasterPasting a text data from a clipboard directlly to Sketch text layers [Sketch plugin]
Stars: ✭ 88 (-94.56%)
Kafka Connectequivalent to kafka-connect 🔧 for nodejs ✨🐢🚀✨
Stars: ✭ 102 (-93.7%)
RepurrrsiveRecursive lists to use in teaching and examples, because there is no iris data for lists.
Stars: ✭ 112 (-93.08%)
Csv2dbThe CSV to database command line loader
Stars: ✭ 102 (-93.7%)
Datastore🐹 Bloat free and flexible interface for data store and database access.
Stars: ✭ 99 (-93.88%)
Sentinel CrawlerXenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫
Stars: ✭ 118 (-92.71%)
Covid19 scenariosModels of COVID-19 outbreak trajectories and hospital demand
Stars: ✭ 1,355 (-16.25%)
NycdbDatabase of NYC Housing Data
Stars: ✭ 94 (-94.19%)
Geo Data Viewer🗺️ Geo Data Viewer w/0 Py 🐍 || pyWidgets ⚙️ || pandas 🐼 || @reactjs ⚛️ required to gen. some snazzy maps 🗺️ with keplerGL ...
Stars: ✭ 115 (-92.89%)
Open Data Etl Utility KitUse Pentaho's open source data integration tool (Kettle) to create Extract-Transform-Load (ETL) processes to update a Socrata open data portal. Documentation is available at http://open-data-etl-utility-kit.readthedocs.io/en/stable
Stars: ✭ 93 (-94.25%)
BabynamesAn R package containing US baby names from the SSA
Stars: ✭ 108 (-93.33%)
Binarykit💾🔍🧮 BinaryKit helps you to break down binary data into bits and bytes, easily access specific parts and write data to binary.
Stars: ✭ 92 (-94.31%)
Awesome Opendata RusOpendata resources in Russian / Открытые данные на русском языке
Stars: ✭ 121 (-92.52%)
Nitric[ABANDONED] General-purpose data processing library. Mirror of https://gitlab.com/nitric/nitric
Stars: ✭ 90 (-94.44%)
Aws Ecs AirflowRun Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-93.39%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (-6.61%)
EtlLinkedPipes ETL is an RDF based, lightweight ETL tool
Stars: ✭ 88 (-94.56%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+547.59%)
Js Adler32☑️ ADLER-32 checksum
Stars: ✭ 116 (-92.83%)
D3vueA D3 Plugin for VueJS
Stars: ✭ 87 (-94.62%)
Vue Table Dynamic🎉 A dynamic table with sorting, filtering, editing, pagination, multiple select, etc.
Stars: ✭ 106 (-93.45%)
Rest HooksDelightful data fetching for React.
Stars: ✭ 1,276 (-21.14%)
OpenfintechOpensource FinTech standards & payment provider data
Stars: ✭ 87 (-94.62%)
TksheetPython 3.6+ tkinter table widget for displaying tabular data
Stars: ✭ 86 (-94.68%)
CoreOpen source Dota 2 data platform
Stars: ✭ 1,266 (-21.76%)
RgbifInterface to the Global Biodiversity Information Facility API
Stars: ✭ 113 (-93.02%)
PlatformCode Climate Engineering Data Platform
Stars: ✭ 104 (-93.57%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-94.81%)