All Projects → getmetamapper → metamapper

getmetamapper / metamapper

Licence: BSD-2-Clause license
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.

Programming Languages

python
139335 projects - #7 most used programming language
javascript
184084 projects - #8 most used programming language
TSQL
950 projects
SCSS
7915 projects
shell
77523 projects
HTML
75241 projects

Projects that are alternatives of or similar to metamapper

Datahub
The Metadata Platform for the Modern Data Stack
Stars: ✭ 4,232 (+6953.33%)
Mutual labels:  metadata, data-catalog, data-discovery
Amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Stars: ✭ 2,901 (+4735%)
Mutual labels:  metadata, data-catalog, data-discovery
whale
🐳 The stupidly simple CLI workspace for your data warehouse.
Stars: ✭ 696 (+1060%)
Mutual labels:  data-catalog, data-discovery
sqllineage
SQL Lineage Analysis Tool powered by Python
Stars: ✭ 348 (+480%)
Mutual labels:  metadata, data-discovery
react-native-xaml
A React Native Windows library to use XAML / WinUI controls
Stars: ✭ 55 (-8.33%)
Mutual labels:  metadata
icc
JavaScript module to parse International Color Consortium (ICC) profiles
Stars: ✭ 37 (-38.33%)
Mutual labels:  metadata
dirdf
R package: dirdf - Extracts Metadata from Directory and File Names
Stars: ✭ 57 (-5%)
Mutual labels:  metadata
MetaCPP
C++ Reflection & Serialization using Clang's LibTooling
Stars: ✭ 44 (-26.67%)
Mutual labels:  metadata
intake-esm
An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.
Stars: ✭ 78 (+30%)
Mutual labels:  data-catalog
Islandora-Metadata-Interest-Group
The purpose of the Islandora Metadata Interest Group (IMIG) is to investigate and provide metadata solutions that help improve metadata creation, maintenance and enhancement in Islandora.
Stars: ✭ 29 (-51.67%)
Mutual labels:  metadata
Crema
Meta data server & client tools for game development
Stars: ✭ 61 (+1.67%)
Mutual labels:  metadata
nts
NTS Radio downloader and metadata parser
Stars: ✭ 58 (-3.33%)
Mutual labels:  metadata
Real-time-Data-Warehouse
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Stars: ✭ 52 (-13.33%)
Mutual labels:  data-warehouse
metadata-xml-tool
CLI tool for processing Salesforce Metadata XML files
Stars: ✭ 14 (-76.67%)
Mutual labels:  metadata
oblivion
The Oblivion masterlist.
Stars: ✭ 16 (-73.33%)
Mutual labels:  metadata
pyexiv2
Read/Write metadata(including EXIF, IPTC, XMP), comment and ICC Profile embedded in digital images.
Stars: ✭ 120 (+100%)
Mutual labels:  metadata
oge
Page metadata as a service
Stars: ✭ 22 (-63.33%)
Mutual labels:  metadata
herd-mdl
Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.
Stars: ✭ 11 (-81.67%)
Mutual labels:  data-catalog
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+18388.33%)
Mutual labels:  data-warehouse
graphql-ts
Graphql implementation in Typescript using decorator
Stars: ✭ 63 (+5%)
Mutual labels:  metadata

Metamapper

CircleCI Vulnerabilities latest version python discord

Metamapper is an open-source metadata management platform that aims to make it easier to share data and its context across your organization. It's a self-updating data catalog complete with full-text search, an integrated commenting system, and much more.

What we're trying to accomplish

Growing organizations rely on data and analytics to drive decisions. With the emergence of tools like Airflow and companies like Segment and Fivetran, it's never been easier to get data into your warehouse.

But with all of this data comes a lot of noise. It can become difficult to keep track of things like business purpose and/or timeliness of your data, amongst other things. Plus, writing and maintaing that sort of documentation is just plain boring.

Metamapper aims to automate those boring documentation tasks and reduce the time that data engineers spend answering redundant questions. Just connect your data warehouse and Metamapper will periodically scan the datastore and maintain a commentable data catalog that can be viewed by your team via the UI.

Think of it as Google for your data warehouse – perform a search and it'll find the data that best fits your needs.

Here are a few features of Metamapper:

  • Browser-based: Everything in your browser, with a shareable URL you can give to your team.
  • Schema inspection: Metamapper crawls your database schema(s) every few hours and maintains a comprehensive data catalog.
  • Change detection: Detects when data definitions change between schema inspection runs. Useful for alerting uncommunicated changes.
  • Annotations: Supports comments on almost every object so your team can crowdsource knowledge about data assets.
  • Custom Properties: Easily attach custom metadata to databases and tables, such as data steward or ETL process references.
  • Search: Everything is indexed and searchable. Self-service data analytics, here we come!

Quickstart

You can try out a default version of Metamapper with sample data using Docker and Docker-Compose.

Clone the repository:

git clone [email protected]:getmetamapper/metamapper.git

From the repository root:

docker-compose -f docker-quickstart.yml up

Head to http://localhost:5555 to view the Metamapper UI. Try searching for "clickstream events" and see what happens!

Installation

Use our pre-baked Docker images. Detailed setup instructions can be found here: https://github.com/getmetamapper/metamapper-setup

Documentation

Supported datastores

Metamapper currently supports automatic crawling and indexing of these database management systems with plans to add more in the near future.

  • Amazon Redshift
  • AWS Athena
  • AWS Glue
  • Azure SQL Database
  • Azure Synapse (formerly Azure DW)
  • Google BigQuery
  • Hive Metastore
  • Microsoft SQL Server
  • MySQL
  • Oracle
  • PostgreSQL
  • Snowflake

Community / Get Involved

Other Resources

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].