All Projects → Butterfree → Similar Projects or Alternatives

1594 Open source projects that are alternatives of or similar to Butterfree

Etl with python
ETL with Python - Taught at DWH course 2017 (TAU)
Stars: ✭ 68 (-46.03%)
Mutual labels:  data-science, etl
hive-metastore-client
A client for connecting and running DDLs on hive metastore.
Stars: ✭ 37 (-70.63%)
Mutual labels:  etl, data-engineering
Cql
Categorical Query Language IDE
Stars: ✭ 196 (+55.56%)
Mutual labels:  data-science, etl
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-54.76%)
Mutual labels:  etl, data-engineering
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-62.7%)
Mutual labels:  etl, data-engineering
DataEngineering
This repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (-62.7%)
Mutual labels:  pyspark, data-engineering
lineage
Generate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-87.3%)
Mutual labels:  etl, pyspark
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-86.51%)
Mutual labels:  etl, pyspark
Awesome Business Intelligence
Actively curated list of awesome BI tools. PRs welcome!
Stars: ✭ 1,157 (+818.25%)
Mutual labels:  data-science, etl
Hale
(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-33.33%)
Mutual labels:  etl, etl-framework
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-38.89%)
Mutual labels:  etl, data-engineering
Dataform
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+171.43%)
Mutual labels:  etl, data-engineering
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+20.63%)
Mutual labels:  data-science, data-engineering
Stetl
Stetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (-49.21%)
Mutual labels:  etl, etl-framework
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+585.71%)
Mutual labels:  data-science, data-engineering
Hydrograph
A visual ETL development and debugging tool for big data
Stars: ✭ 144 (+14.29%)
Mutual labels:  etl, etl-framework
Pyetl
python ETL framework
Stars: ✭ 33 (-73.81%)
Mutual labels:  etl, etl-framework
Etlbox
A lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+61.11%)
Mutual labels:  etl, etl-framework
BETL-old
BETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-86.51%)
Mutual labels:  etl, etl-framework
Learn Something Every Day
📝 A compilation of everything that I learn; Computer Science, Software Development, Engineering, Math, and Coding in General. Read the rendered results here ->
Stars: ✭ 362 (+187.3%)
Mutual labels:  data-science, data-engineering
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+961.9%)
Mutual labels:  data-science, pyspark
Applied Ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+14046.03%)
Mutual labels:  data-science, data-engineering
Openkettlewebui
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (-0.79%)
Mutual labels:  etl, etl-framework
Headlesschrome
A Go package for working with headless Chrome. Run interactive JavaScript commands on web pages with Go and Chrome.
Stars: ✭ 112 (-11.11%)
Mutual labels:  package
Laravel Natural Language
This package makes using the Google Natural API in your laravel app a breeze with minimum to no configuration, clean syntax and a consistent package API.
Stars: ✭ 119 (-5.56%)
Mutual labels:  package
Datepickertimelineflutter
Flutter Date Picker Library that provides a calendar as a horizontal timeline
Stars: ✭ 112 (-11.11%)
Mutual labels:  package
Labeled Tweet Generator
Search for tweets and download the data labeled with its polarity in CSV format
Stars: ✭ 111 (-11.9%)
Mutual labels:  data-science
Graceful
Graceful shutdown of Go 1.8+ servers using Server.Shutdown
Stars: ✭ 123 (-2.38%)
Mutual labels:  package
Automunge
Artificial Learning, Intelligent Machines
Stars: ✭ 119 (-5.56%)
Mutual labels:  data-science
Blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Stars: ✭ 1,652 (+1211.11%)
Mutual labels:  data-science
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1373.02%)
Mutual labels:  etl-framework
Quantified Self
Self-knowledge through numbers
Stars: ✭ 118 (-6.35%)
Mutual labels:  data-science
Responsible Ai Widgets
This project provides responsible AI user interfaces for Fairlearn, interpret-community, and Error Analysis, as well as foundational building blocks that they rely on.
Stars: ✭ 107 (-15.08%)
Mutual labels:  data-science
Mtcnn
MTCNN face detection implementation for TensorFlow, as a PIP package.
Stars: ✭ 1,689 (+1240.48%)
Mutual labels:  package
Dive Into Machine Learning
Dive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+8479.37%)
Mutual labels:  data-science
Gun Violence Data
A comprehensive, accessible database that contains records of over 260k US gun violence incidents from January 2013 to March 2018.
Stars: ✭ 123 (-2.38%)
Mutual labels:  data-science
Sentinel Crawler
Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫
Stars: ✭ 118 (-6.35%)
Mutual labels:  etl
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-13.49%)
Mutual labels:  data-science
Ml Da Coursera Yandex Mipt
Machine Learning and Data Analysis Coursera Specialization from Yandex and MIPT
Stars: ✭ 108 (-14.29%)
Mutual labels:  data-science
Hass Data Detective
Explore and analyse your Home Assistant data
Stars: ✭ 109 (-13.49%)
Mutual labels:  data-science
Docusign Node Client
The Official DocuSign Node.js Client Library used to interact with the eSign REST API. Send, sign, and approve documents using this client.
Stars: ✭ 108 (-14.29%)
Mutual labels:  package
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-2.38%)
Mutual labels:  data-science
Chain.jl
A Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.
Stars: ✭ 118 (-6.35%)
Mutual labels:  data-science
Logger json
JSON console backend for Elixir Logger.
Stars: ✭ 108 (-14.29%)
Mutual labels:  package
Dash Stock Tickers Demo App
Dash Demo App - Stock Tickers
Stars: ✭ 108 (-14.29%)
Mutual labels:  data-science
Pie chart
Flutter Pie chart with animation
Stars: ✭ 117 (-7.14%)
Mutual labels:  package
Aws Ecs Airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-15.08%)
Mutual labels:  etl
Package Skeleton Php
A skeleton repository for Spatie's PHP Packages
Stars: ✭ 126 (+0%)
Mutual labels:  package
Kiba
Data processing & ETL framework for Ruby
Stars: ✭ 1,618 (+1184.13%)
Mutual labels:  etl
Open
DiffusionKinetics open-source monorepo
Stars: ✭ 116 (-7.94%)
Mutual labels:  data-science
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-14.29%)
Mutual labels:  pyspark
Ml Email Clustering
Email clustering with machine learning
Stars: ✭ 116 (-7.94%)
Mutual labels:  data-science
Scikit Learn
scikit-learn: machine learning in Python
Stars: ✭ 48,322 (+38250.79%)
Mutual labels:  data-science
Allennlp
An open-source NLP research library, built on PyTorch.
Stars: ✭ 10,699 (+8391.27%)
Mutual labels:  data-science
Pharbuilder
Create Phar of Composer based PHP application
Stars: ✭ 122 (-3.17%)
Mutual labels:  package
Learn Machine Learning
Learn to Build a Machine Learning Application from Top Articles
Stars: ✭ 116 (-7.94%)
Mutual labels:  data-science
Awesome Bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+8215.87%)
Mutual labels:  data-science
Ai Expert Roadmap
Roadmap to becoming an Artificial Intelligence Expert in 2021
Stars: ✭ 15,441 (+12154.76%)
Mutual labels:  data-science
Datax
DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (-7.94%)
Mutual labels:  etl
Tflearn
Deep learning library featuring a higher-level API for TensorFlow.
Stars: ✭ 9,573 (+7497.62%)
Mutual labels:  data-science
61-120 of 1594 similar projects