Etl with pythonETL with Python - Taught at DWH course 2017 (TAU)
Stars: ✭ 68 (-46.03%)
CqlCategorical Query Language IDE
Stars: ✭ 196 (+55.56%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-54.76%)
uptasticsearchAn Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-62.7%)
DataEngineeringThis repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (-62.7%)
lineageGenerate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-87.3%)
sparklanesA lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-86.51%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-33.33%)
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-38.89%)
DataformDataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+171.43%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+20.63%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (-49.21%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+585.71%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+14.29%)
Pyetlpython ETL framework
Stars: ✭ 33 (-73.81%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+61.11%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-86.51%)
Learn Something Every Day📝 A compilation of everything that I learn; Computer Science, Software Development, Engineering, Math, and Coding in General. Read the rendered results here ->
Stars: ✭ 362 (+187.3%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+961.9%)
Applied Ml📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+14046.03%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (-0.79%)
HeadlesschromeA Go package for working with headless Chrome. Run interactive JavaScript commands on web pages with Go and Chrome.
Stars: ✭ 112 (-11.11%)
Laravel Natural LanguageThis package makes using the Google Natural API in your laravel app a breeze with minimum to no configuration, clean syntax and a consistent package API.
Stars: ✭ 119 (-5.56%)
DatepickertimelineflutterFlutter Date Picker Library that provides a calendar as a horizontal timeline
Stars: ✭ 112 (-11.11%)
Labeled Tweet GeneratorSearch for tweets and download the data labeled with its polarity in CSV format
Stars: ✭ 111 (-11.9%)
GracefulGraceful shutdown of Go 1.8+ servers using Server.Shutdown
Stars: ✭ 123 (-2.38%)
AutomungeArtificial Learning, Intelligent Machines
Stars: ✭ 119 (-5.56%)
BlazingsqlBlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Stars: ✭ 1,652 (+1211.11%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1373.02%)
Responsible Ai WidgetsThis project provides responsible AI user interfaces for Fairlearn, interpret-community, and Error Analysis, as well as foundational building blocks that they rely on.
Stars: ✭ 107 (-15.08%)
MtcnnMTCNN face detection implementation for TensorFlow, as a PIP package.
Stars: ✭ 1,689 (+1240.48%)
Dive Into Machine LearningDive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+8479.37%)
Gun Violence DataA comprehensive, accessible database that contains records of over 260k US gun violence incidents from January 2013 to March 2018.
Stars: ✭ 123 (-2.38%)
Sentinel CrawlerXenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫
Stars: ✭ 118 (-6.35%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-13.49%)
Docusign Node ClientThe Official DocuSign Node.js Client Library used to interact with the eSign REST API. Send, sign, and approve documents using this client.
Stars: ✭ 108 (-14.29%)
Datasist A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-2.38%)
Chain.jlA Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.
Stars: ✭ 118 (-6.35%)
Logger jsonJSON console backend for Elixir Logger.
Stars: ✭ 108 (-14.29%)
Pie chartFlutter Pie chart with animation
Stars: ✭ 117 (-7.14%)
Aws Ecs AirflowRun Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-15.08%)
KibaData processing & ETL framework for Ruby
Stars: ✭ 1,618 (+1184.13%)
OpenDiffusionKinetics open-source monorepo
Stars: ✭ 116 (-7.94%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-14.29%)
Scikit Learnscikit-learn: machine learning in Python
Stars: ✭ 48,322 (+38250.79%)
AllennlpAn open-source NLP research library, built on PyTorch.
Stars: ✭ 10,699 (+8391.27%)
PharbuilderCreate Phar of Composer based PHP application
Stars: ✭ 122 (-3.17%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+8215.87%)
Ai Expert RoadmapRoadmap to becoming an Artificial Intelligence Expert in 2021
Stars: ✭ 15,441 (+12154.76%)
DataxDataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (-7.94%)
TflearnDeep learning library featuring a higher-level API for TensorFlow.
Stars: ✭ 9,573 (+7497.62%)