versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+800%)
beneathBeneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (+306.25%)
DagsterAn orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+25518.75%)
wikirepoPython based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (+106.25%)
dbddbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (+87.5%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+30643.75%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+393.75%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (+25%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+143.75%)
quickstepQuickstep project
Stars: ✭ 22 (+37.5%)
mydataharbor🇨🇳 MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数据同步中间件。帮助用户可靠、快速、稳定的对海量数据进行准实时增量同步或者定时全量同步,主要定位是为实时交易系统服务,亦可用于大数据的数据同步(ETL领域)。
Stars: ✭ 28 (+75%)
sqlite-guiLightweight SQLite editor for Windows
Stars: ✭ 151 (+843.75%)
dogETLA lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-6.25%)
persistityA persistence framework for game developers
Stars: ✭ 34 (+112.5%)
Musical-WorldDBMS Mini Project that basically designed for online music player
Stars: ✭ 59 (+268.75%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+3725%)
Clinic-Management-System-ASP.NET👨⚕️ A fully featured Clinic Management System based on three tier architecture made using ASP.NET, C# with a well documented README.md file.
Stars: ✭ 82 (+412.5%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (+0%)
magedbm💾 Magento 1.x Database Backup Manager
Stars: ✭ 38 (+137.5%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (+175%)
cobrixA COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (+581.25%)
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+762.5%)
singer-runnerA CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (+106.25%)
docker-omnidbOmniDB installed into a Docker container
Stars: ✭ 30 (+87.5%)
iridium💎 Growing collection of VS Code extensions with a fancy name
Stars: ✭ 39 (+143.75%)
sql-to-redis🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (+12.5%)
maxwell-sinkconsume maxwell generated message from kafka,export it to another mysql.
Stars: ✭ 16 (+0%)
DQCS数据质量控制系统
Stars: ✭ 34 (+112.5%)
PDAP-ScrapersCode relating to scraping public police data.
Stars: ✭ 72 (+350%)
maricutodbPHP Flat File Database Manager
Stars: ✭ 23 (+43.75%)
dswarman open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (+256.25%)
opentrials-airflowConfiguration and definitions of Airflow for OpenTrials
Stars: ✭ 18 (+12.5%)
go-bqloaderbqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (+0%)
covid-19Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-12.5%)
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+56.25%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (+31.25%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+318.75%)
python mozetlETL jobs for Firefox Telemetry
Stars: ✭ 25 (+56.25%)
heuristCore development repository. gitHub: Vsn 6 (2020 - ), Vsn 5 (2018 - 2020), Vsn 4 (2014-2017). Sourceforge: Vsn 3 (2009-2013), Vsn 1 & 2 (2005-2009)
Stars: ✭ 39 (+143.75%)
shieldShield is a role-based cloud-native user management system, identity & access proxy, and authorization server for your applications and API endpoints.
Stars: ✭ 158 (+887.5%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (+50%)
datajoint-pythonRelational data pipelines for the science lab
Stars: ✭ 140 (+775%)
dbuiUniversal Database CLI for MySQL, PostgreSQL, and SQLite. Terminal User Interface Application.
Stars: ✭ 110 (+587.5%)
neon-workshopA Pachyderm deep learning tutorial for conference workshops
Stars: ✭ 19 (+18.75%)
CVparserCVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (+75%)
uptasticsearchAn Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+193.75%)
wrangleA data transformation package for deep learning with Autonomio, Keras and TensorFlow.
Stars: ✭ 15 (-6.25%)
django-calaccess-raw-dataA Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+281.25%)
flockFlock: A Low-Cost Streaming Query Engine on FaaS Platforms
Stars: ✭ 232 (+1350%)
mikThe Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (+100%)
FIFA-18-Management-SystemThis repository contains the whole project. This project was intended to exhibit as a DBMS project but it can also act as a web development project as it includes complete front end and back end.
Stars: ✭ 42 (+162.5%)