All Projects → Airbyte → Similar Projects or Alternatives

3066 Open source projects that are alternatives of or similar to Airbyte

Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-98.39%)
Mara Pipelines
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (-62.57%)
Mutual labels:  pipeline, etl, data, data-integration
Datacleaner
The premier open source Data Quality solution
Stars: ✭ 391 (-92.05%)
Mutual labels:  data-science, data-analysis, etl, data
Go Streams
A lightweight stream processing library for Go
Stars: ✭ 615 (-87.5%)
Mutual labels:  pipeline, etl, pipelines
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+766.72%)
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-98.43%)
Awesome Business Intelligence
Actively curated list of awesome BI tools. PRs welcome!
Stars: ✭ 1,157 (-76.48%)
Mutual labels:  data-science, data-analysis, etl
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (-78.13%)
Mutual labels:  data-science, data-analysis, data
naas
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (-95.55%)
Mutual labels:  integration, pipeline, etl
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (-97.07%)
Mutual labels:  etl, data-engineering, elt
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+18.07%)
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-82.44%)
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-96.52%)
Mutual labels:  data-science, data-analysis, data
Akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (-11.89%)
Mutual labels:  data-science, data-analysis, data
Skdata
Python tools for data analysis
Stars: ✭ 16 (-99.67%)
Mutual labels:  data-science, data-analysis, data
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (-51.51%)
Mutual labels:  data-science, etl, data-engineering
Chain.jl
A Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.
Stars: ✭ 118 (-97.6%)
Mutual labels:  data-science, data-analysis, pipeline
Openrefine
OpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+73.43%)
Mutual labels:  data-science, data-analysis, data
Gspread Pandas
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-95.41%)
Mutual labels:  data-science, data, data-engineering
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (-97.44%)
Mutual labels:  data-science, etl, data-engineering
Datascience course
Curso de Data Science em Português
Stars: ✭ 294 (-94.02%)
Mutual labels:  data-science, data-analysis, data
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (-87.13%)
Mutual labels:  data-science, etl, data-engineering
Knowledge Repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+0.75%)
Mutual labels:  data-science, data-analysis, data
Graphia
A visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-98.64%)
Mutual labels:  data-science, data-analysis, data
Datacomparer
dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-98.82%)
Mutual labels:  data-science, data-analysis, data
Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-98.39%)
Mutual labels:  data-science, etl, data-engineering
Steppy
Lightweight, Python library for fast and reproducible experimentation 🔬
Stars: ✭ 119 (-97.58%)
Mutual labels:  data-science, pipeline, open-source
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-94.45%)
Mutual labels:  data-science, data-analysis, data
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (-80.04%)
Mutual labels:  data-science, pipeline, pipelines
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (-69.28%)
Mutual labels:  data-science, data, data-engineering
Pipelinex
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
Stars: ✭ 127 (-97.42%)
Pdpipe
Easy pipelines for pandas DataFrames.
Stars: ✭ 590 (-88.01%)
Mutual labels:  data-science, pipeline, data
arthur-redshift-etl
ELT Code for your Data Warehouse
Stars: ✭ 22 (-99.55%)
Mutual labels:  etl, data-engineering, elt
Steppy Toolkit
Curated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-99.57%)
Mutual labels:  data-science, pipeline, open-source
Gopup
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (-75.02%)
Mutual labels:  data-science, data-analysis, data
Flyte
Accelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (-74.75%)
Mutual labels:  data-science, data-analysis, data
Awesome Bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+113.01%)
Mutual labels:  data-science, data
Ai Expert Roadmap
Roadmap to becoming an Artificial Intelligence Expert in 2021
Stars: ✭ 15,441 (+213.91%)
Mutual labels:  data-science, data-analysis
Pandas Datareader
Extract data from a wide range of Internet sources into a pandas DataFrame.
Stars: ✭ 2,183 (-55.62%)
Mutual labels:  data-analysis, data
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-97.82%)
Mutual labels:  data-science, data-analysis
Scikit Learn
scikit-learn: machine learning in Python
Stars: ✭ 48,322 (+882.35%)
Mutual labels:  data-science, data-analysis
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-97.8%)
Mutual labels:  data-science, data
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-97.78%)
Mutual labels:  data-science, data-analysis
Ml Da Coursera Yandex Mipt
Machine Learning and Data Analysis Coursera Specialization from Yandex and MIPT
Stars: ✭ 108 (-97.8%)
Mutual labels:  data-science, data-analysis
Xda
R package for exploratory data analysis
Stars: ✭ 112 (-97.72%)
Mutual labels:  data-science, data-analysis
Sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (-62.37%)
Mutual labels:  data-science, data-analysis
Cubes
Light-weight Python OLAP framework for multi-dimensional data analysis
Stars: ✭ 1,393 (-71.68%)
Mutual labels:  data-analysis, data
Hass Data Detective
Explore and analyse your Home Assistant data
Stars: ✭ 109 (-97.78%)
Mutual labels:  data-science, data
Algocode
Welcome everyone!🌟 Here you can solve problems, build scrappers and much more💻
Stars: ✭ 113 (-97.7%)
Mutual labels:  data-science, open-source
Loandefault Prediction
Lending Club Loan data analysis
Stars: ✭ 113 (-97.7%)
Mutual labels:  data-science, data-analysis
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (-69.18%)
Mutual labels:  data-science, data-analysis
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-97.68%)
Mutual labels:  data-science, data-analysis
D6t Python
Accelerate data science
Stars: ✭ 118 (-97.6%)
Mutual labels:  data-science, data-engineering
Auptimizer
An automatic ML model optimization tool.
Stars: ✭ 166 (-96.63%)
Mutual labels:  data-science, data-engineering
Kiba
Data processing & ETL framework for Ruby
Stars: ✭ 1,618 (-67.11%)
Mutual labels:  etl, data
Spark Alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-97.52%)
Mutual labels:  data-science, data-engineering
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-97.5%)
Mutual labels:  data-science, data-analysis
Open Solution Salt Identification
Open solution to the TGS Salt Identification Challenge
Stars: ✭ 124 (-97.48%)
Mutual labels:  data-science, pipeline
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-71.99%)
Mutual labels:  data-science, data
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-97.7%)
Mutual labels:  data-science, data-analysis
1-60 of 3066 similar projects