Applied Ml📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+10702.42%)
Mutual labels: data-quality
penguin-datalayer-collectA data layer quality monitoring and validation module, this solution is part of the Raft Suite ecosystem.
Stars: ✭ 19 (-88.48%)
Mutual labels: data-quality
osm-data-classificationMigrated to: https://gitlab.com/Oslandia/osm-data-classification
Stars: ✭ 23 (-86.06%)
Mutual labels: data-quality
Real-Time-Abnormal-Events-Detection-and-Tracking-in-Surveillance-SystemThe main abnormal behaviors that this project can detect are: Violence, covering camera, Choking, lying down, Running, Motion in restricted areas. It provides much flexibility by allowing users to choose the abnormal behaviors they want to be detected and keeps track of every abnormal event to be reviewed. We used three methods to detect abnorma…
Stars: ✭ 35 (-78.79%)
Mutual labels: influence
soda-sparkSoda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (-64.85%)
Mutual labels: data-quality
great expectations actionA GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
Stars: ✭ 66 (-60%)
Mutual labels: data-quality
Great expectationsAlways know what to expect from your data.
Stars: ✭ 5,808 (+3420%)
Mutual labels: data-quality
Data-Quality-AnalysisThe PEDSnet Data Quality Assessment Toolkit (OMOP CDM)
Stars: ✭ 19 (-88.48%)
Mutual labels: data-quality
contessaEasy way to define, execute and store quality rules for your data.
Stars: ✭ 17 (-89.7%)
Mutual labels: data-quality
versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (-12.73%)
Mutual labels: data-quality
hive compared bqhive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
Stars: ✭ 27 (-83.64%)
Mutual labels: data-quality
NBiNBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (-38.18%)
Mutual labels: data-quality
popular-github-template📗 Repo Template: Make Your GitHub Repos More Popular
Stars: ✭ 16 (-90.3%)
Mutual labels: influence
roguelike-universeUnderstanding game design inspiration of roguelike games via web scraping and network analysis.
Stars: ✭ 17 (-89.7%)
Mutual labels: influence
check-engineData validation library for PySpark 3.0.0
Stars: ✭ 29 (-82.42%)
Mutual labels: data-quality
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+4947.88%)
Mutual labels: data-quality
dqlab-career-trackA collection of scripts written to complete DQLab Data Analyst Career Track 📊
Stars: ✭ 53 (-67.88%)
Mutual labels: data-quality
hooquhooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to Python
Stars: ✭ 17 (-89.7%)
Mutual labels: data-quality
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+153.94%)
Mutual labels: data-quality
leilaLibrería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Stars: ✭ 56 (-66.06%)
Mutual labels: data-quality