All Projects → redata-team → Redata

redata-team / Redata

Licence: mit
Monitoring system for data teams. Computing health checks on data, visualizing and alerting on them.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Redata

Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-5.52%)
Mutual labels:  data-analysis, data-visualization
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+1191.16%)
Mutual labels:  data-analysis, data-visualization
Dtale Desktop
Build a data visualization dashboard with simple snippets of python code
Stars: ✭ 128 (-29.28%)
Mutual labels:  data-analysis, data-visualization
Awesome Scientific Python
A curated list of awesome scientific Python resources
Stars: ✭ 127 (-29.83%)
Mutual labels:  data-analysis, data-visualization
Countly Sdk Web
Countly Product Analytics SDK for websites and web applications
Stars: ✭ 165 (-8.84%)
Mutual labels:  data-analysis, data-visualization
Dtale
Visualizer for pandas data structures
Stars: ✭ 2,864 (+1482.32%)
Mutual labels:  data-analysis, data-visualization
Data Science Portfolio
A Portfolio of my Data Science Projects
Stars: ✭ 149 (-17.68%)
Mutual labels:  data-analysis, data-visualization
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-37.02%)
Mutual labels:  data-analysis, data-visualization
Report Designer
🚀 打印设计、可视化、大屏、编辑器、设计器、数据分析、报表设计、组件化、表单设计、h5页面、调查问卷、pdf生成、流程图、试卷、SVG、图形元素、物联网
Stars: ✭ 160 (-11.6%)
Mutual labels:  data-analysis, data-visualization
Etl unicorn
数据可视化, 数据挖掘, 数据处理 ETL
Stars: ✭ 156 (-13.81%)
Mutual labels:  data-analysis, data-visualization
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-32.04%)
Mutual labels:  data-analysis, data-visualization
Matplotlib Doc Zh
📖 [译] Matplotlib 用户指南
Stars: ✭ 178 (-1.66%)
Mutual labels:  data-analysis, data-visualization
Pbpython
Code, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+852.49%)
Mutual labels:  data-analysis, data-visualization
Eegrunt
A Collection Python EEG (+ ECG) Analysis Utilities for OpenBCI and Muse
Stars: ✭ 171 (-5.52%)
Mutual labels:  data-analysis, data-visualization
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+737.57%)
Mutual labels:  data-analysis, data-visualization
Aachartkit Swift
📈📊📱💻🖥️An elegant modern declarative data visualization chart framework for iOS, iPadOS and macOS. Extremely powerful, supports line, spline, area, areaspline, column, bar, pie, scatter, angular gauges, arearange, areasplinerange, columnrange, bubble, box plot, error bars, funnel, waterfall and polar chart types. 极其精美而又强大的跨平台数据可视化图表框架,支持柱状图、条形图、…
Stars: ✭ 1,962 (+983.98%)
Mutual labels:  data-analysis, data-visualization
Sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+922.65%)
Mutual labels:  data-analysis, data-visualization
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-37.57%)
Mutual labels:  data-analysis, data-visualization
Sourced Ce
source{d} Community Edition (CE)
Stars: ✭ 153 (-15.47%)
Mutual labels:  data-analysis, data-visualization
Dabestr
Data Analysis with Bootstrap Estimation in R
Stars: ✭ 169 (-6.63%)
Mutual labels:  data-analysis, data-visualization

Slack Docker pulls License Last commit

Redata

Do you know this feeling? - Something in your data broke, you patiently added monitoring to detect that in the future... only so, that the next time different not expected thing went wrong :)

Redata is monitoring system for data teams. Automatically computing health checks on all your tables, visualizing them over time, and alerting on them.

Key features

Metrics layer

Redata computes health metrics for your data, containing information like this:

  • time since last record was added
  • number of records added in last (hour/day/week/month)
  • schema changes that recently happened
  • number of missing values in columns over time
  • min/max/avg of values and lenghts of strings in colums
  • other user defined metrics

UI with alerts & tables

Redata UI enables you to view all your tables, their health and alerts of unexpected situations. You can also adjust checks generated by Redata here.

Automatic dashboards

Having metrics in one common format makes it possible to create table health dashboards automatically Here are some examples of how Grafana dashboards look like:

Smart alerts

Redata compares metrics computed in the past to current metrics and alerts if anomalies are found. This means that situations like this:

  • sudden drops or increases in the volume of new records added to your tables
  • longer than expected break between data arrivals
  • significantly different maximal/minimal/avg numbers in any of table columns
  • and more

Would be detected, and you will be alerted. Redata supports Slack (with others tools possible to integerate for you via Grafana) so you can also set up alerts to your favorite support channel.

Benefits over doing monitoring yourself

What are benefits of using Redata instead of implementing data monitoring yourself? Here is a our list :)

  • UI showing health of your tables - See your tables and their health easily

  • Automatic and up to date health dashboards - It's normally quite cumbersome to setup proper monitoring for all tables and keeping it up to date is hard - redata can do that for you, detecting new tables and columns and automatically creating dashboards/panels for them.

  • Smart alerts - Once tables are detected redata automatically tracks their health and looks for anomalies there. Alerts are designed specifically for data quality checks and separete from Grafana alerts (no limits on what to alert on, etc.)

  • Visualizing new, previously impossible things - Things like schema changes, cannot be queried from DB, redata compares snapshots of your schemas and alert if this change

  • Big set of predefined and effectively computed metrics - Redata comes with large set of predefined metrics, computed out of box for your tables. We also optimize queries computing them, so that it's effective and fast.

Getting started (local machine setup)

git clone https://github.com/redata-team/redata.git
cd redata

docker-compose up

Now visit http://localhost:5000, add your database and starting monitoring your data. Default password/user for Redata/Grafana app is redata :)

Deploying on production

Redata uses docker and docker-compose for deployment, this makes it easy to deploy in the cloud, or in your on premise enviroment.

Look at sample setup instructions for specfic cloud providers:

Community

Join Slack for general questions about using redata, problems, and discussions with people making it :)

Integrations

Here are integrations we support or work on now. Let us know if you'd really like to pritize something or your DB is not included on the list.

Integration Status
PostgreSQL Supported
MySQL Supported
Exasol Supported
BigQuery Supported
Apache Airflow Supported, view all your checks in Airflow
Grafana Supported, view metrics here
Slack Supported, get alerts on Slack
Other SQL DBs Experimental support via using SQLAlchemy
AWS Redshift Supported
AWS S3 In development
Excel Planned
Snowflake Supported

License

Redata is licensed under the MIT license. See the LICENSE file for licensing information.

Docs

Want to learn a bit more on how Redata works (work in progress docs are here :) We recommend starting with data source which explains how to configure you DB. Later on table, scan, alert are views you most likely will be checking first when using Redata. Checks and editing them enables configuring Redata to your specific needs.

Contributing

We love all contributions, bigger and smaller.

Checkout out current list of issues here and see if you like anything from there. Also feel welcome to join our Slack and suggest ideas, or setup no pressure session with Redata here.

More details on how to tests your changes under: CONTRIBUTING

If you got this far and like what we are building, support us! Star https://github.com/redata-team/redata on Github :)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].