All Projects → re-data → Similar Projects or Alternatives

453 Open source projects that are alternatives of or similar to re-data

datatile
A library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (-56.13%)
Django-Data-quality-system
数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)
Stars: ✭ 143 (-85.03%)
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (-93.93%)
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+508.17%)
Mutual labels:  data-quality, dataquality
NBi
NBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (-89.32%)
Data-Quality-Analysis
The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)
Stars: ✭ 19 (-98.01%)
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+772.15%)
Mutual labels:  data-analysis, data-quality
hooqu
hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to Python
Stars: ✭ 17 (-98.22%)
osm-data-classification
Migrated to: https://gitlab.com/Oslandia/osm-data-classification
Stars: ✭ 23 (-97.59%)
Mutual labels:  data-analysis, data-quality
penguin-datalayer-collect
A data layer quality monitoring and validation module, this solution is part of the Raft Suite ecosystem.
Stars: ✭ 19 (-98.01%)
dbt ad reporting
Fivetran's ad reporting dbt package. Combine your Facebook, Google, Pinterest, Linkedin, Twitter, Snapchat and Microsoft advertising spend using this package.
Stars: ✭ 68 (-92.88%)
Mutual labels:  dbt, dbt-packages
pyglotaran
A Python library for Global and Target Analysis of time-resolved spectroscopy data
Stars: ✭ 33 (-96.54%)
Mutual labels:  data-analysis
tieba-zhuaqu
百度贴吧分布式爬虫,用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析
Stars: ✭ 56 (-94.14%)
Mutual labels:  data-analysis
crazy-awesome-crypto
A list of awesome crypto and blockchain projects
Stars: ✭ 35 (-96.34%)
Mutual labels:  data-analysis
dflib
In-memory Java DataFrame library
Stars: ✭ 50 (-94.76%)
Mutual labels:  data-analysis
data-disasters
data-disasters.netlify.app/
Stars: ✭ 34 (-96.44%)
Mutual labels:  data-analysis
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (-50.37%)
Mutual labels:  dbt
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (-84.92%)
Mutual labels:  data-quality
akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+439.79%)
Mutual labels:  data-analysis
uetai
Custom ML tracking experiment and debugging tools.
Stars: ✭ 17 (-98.22%)
Mutual labels:  data-analysis
LeTourDataSet
Every cyclist and stage of the Tour de France in two CSV files.
Stars: ✭ 61 (-93.61%)
Mutual labels:  data-analysis
tianchi-diabetes
天池精准医疗大赛——人工智能辅助糖尿病遗传风险预测 第一赛季
Stars: ✭ 20 (-97.91%)
Mutual labels:  data-analysis
genie
Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-97.8%)
Mutual labels:  data-analysis
TracIn
Implementation of Estimating Training Data Influence by Tracing Gradient Descent (NeurIPS 2020)
Stars: ✭ 165 (-82.72%)
Mutual labels:  data-quality
taller SparkR
Taller SparkR para las Jornadas de Usuarios de R
Stars: ✭ 12 (-98.74%)
Mutual labels:  data-analysis
Fraud-Detection-in-Online-Transactions
Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting
Stars: ✭ 41 (-95.71%)
Mutual labels:  data-analysis
covidviz
Professional visualizations of COVID-19, emulating NYT, The Guardian, Washington Post, The Economist & others, using only Python & Altair.
Stars: ✭ 24 (-97.49%)
Mutual labels:  data-analysis
ipython-notebooks
A collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (-95.5%)
Mutual labels:  data-analysis
check-engine
Data validation library for PySpark 3.0.0
Stars: ✭ 29 (-96.96%)
Mutual labels:  data-quality
ria-jit
Lightweight and performant dynamic binary translation for RISC–V code on x86–64
Stars: ✭ 38 (-96.02%)
Mutual labels:  dbt
meta-csv
A Clojure smart reader for CSV files
Stars: ✭ 20 (-97.91%)
Mutual labels:  data-analysis
PythonTipsDS
Python Tips for Data Scientist
Stars: ✭ 23 (-97.59%)
Mutual labels:  data-analysis
dbt-superset-lineage
Make dbt docs and Apache Superset talk to one another
Stars: ✭ 60 (-93.72%)
Mutual labels:  dbt
golearn
🔥 Golang basics and actual-combat (including: crawler, distributed-systems, data-analysis, redis, etcd, raft, crontab-task)
Stars: ✭ 36 (-96.23%)
Mutual labels:  data-analysis
online-course-recommendation-system
Built on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-96.75%)
Mutual labels:  data-analysis
FDBeye
R tools for eyetracker workflows.
Stars: ✭ 101 (-89.42%)
Mutual labels:  data-analysis
vinum
Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.
Stars: ✭ 57 (-94.03%)
Mutual labels:  data-analysis
metrics
📈 What to measure, how to measure it.
Stars: ✭ 14 (-98.53%)
Mutual labels:  data-analysis
python-for-data-and-media-communication-gitbook
An open source book on Python tailed for communication students with zero background
Stars: ✭ 99 (-89.63%)
Mutual labels:  data-analysis
dbt-ml-preprocessing
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
Stars: ✭ 128 (-86.6%)
Mutual labels:  dbt
stats
📈 Useful notes and personal collections on statistics.
Stars: ✭ 16 (-98.32%)
Mutual labels:  data-analysis
software-testing-resource-pack
Various files useful for manual testing and test automation etc.
Stars: ✭ 38 (-96.02%)
Mutual labels:  data-testing
hotmap
WebGL Heatmap Viewer for Big Data and Bioinformatics
Stars: ✭ 13 (-98.64%)
Mutual labels:  data-analysis
computational-neuroscience
Short undergraduate course taught at University of Pennsylvania on computational and theoretical neuroscience. Provides an introduction to programming in MATLAB, single-neuron models, ion channel models, basic neural networks, and neural decoding.
Stars: ✭ 36 (-96.23%)
Mutual labels:  data-analysis
ggshakeR
An analysis and visualization R package that works with publicly available soccer data
Stars: ✭ 69 (-92.77%)
Mutual labels:  data-analysis
RepSeP
Reproducible Self-Publishing - Demo Publications in the Most Common Formats
Stars: ✭ 14 (-98.53%)
Mutual labels:  data-analysis
r-resources-for-data-science
A biggest collection of free books and other resources for R programming
Stars: ✭ 24 (-97.49%)
Mutual labels:  data-analysis
dataquest-guided-projects-solutions
My dataquest project solutions
Stars: ✭ 35 (-96.34%)
Mutual labels:  data-analysis
elucidate
convenience functions to help researchers elucidate patterns in their data
Stars: ✭ 26 (-97.28%)
Mutual labels:  data-analysis
advanced-pandas
Pandas is a powerful tool for data exploration and analysis (including timeseries).
Stars: ✭ 22 (-97.7%)
Mutual labels:  data-analysis
dataViz CADi
Materials for the "Data Visualization" CADi workshop @ "Tecnológico de Monterrey"
Stars: ✭ 14 (-98.53%)
Mutual labels:  data-analysis
Naive-Resume-Matching
Text Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (-97.17%)
Mutual labels:  data-analysis
dbt-clickhouse
The Clickhouse plugin for dbt (data build tool)
Stars: ✭ 77 (-91.94%)
Mutual labels:  dbt
iMOKA
interactive Multi Objective K-mer Analysis
Stars: ✭ 19 (-98.01%)
Mutual labels:  data-analysis
mixedvines
Python package for canonical vine copula trees with mixed continuous and discrete marginals
Stars: ✭ 36 (-96.23%)
Mutual labels:  data-analysis
8-Week-SQL-Challenge
Case study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
Stars: ✭ 43 (-95.5%)
Mutual labels:  data-analysis
Moose
MOOSE - Platform for software and data analysis.
Stars: ✭ 110 (-88.48%)
Mutual labels:  data-analysis
ospi
Open Source Presence Infographic of Indian Startups
Stars: ✭ 25 (-97.38%)
Mutual labels:  data-analysis
lightdash
An open source alternative to Looker built using dbt. Made for analysts ❤️
Stars: ✭ 1,082 (+13.3%)
Mutual labels:  dbt
open-digger
Open source analysis tools
Stars: ✭ 193 (-79.79%)
Mutual labels:  data-analysis
1-60 of 453 similar projects