All Projects → linkedin → Datahub

linkedin / Datahub

Licence: apache-2.0
The Metadata Platform for the Modern Data Stack

Programming Languages

typescript
32286 projects
java
68154 projects - #9 most used programming language
python
139335 projects - #7 most used programming language
shell
77523 projects
javascript
184084 projects - #8 most used programming language
haskell
3896 projects

Projects that are alternatives of or similar to Datahub

metamapper
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Stars: ✭ 60 (-98.58%)
Mutual labels:  metadata, data-catalog, data-discovery
Amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Stars: ✭ 2,901 (-31.45%)
Mutual labels:  metadata, data-catalog, data-discovery
sqllineage
SQL Lineage Analysis Tool powered by Python
Stars: ✭ 348 (-91.78%)
Mutual labels:  metadata, data-discovery
Sigmf
The Signal Metadata Format Specification
Stars: ✭ 120 (-97.16%)
Mutual labels:  big-data, metadata
whale
🐳 The stupidly simple CLI workspace for your data warehouse.
Stars: ✭ 696 (-83.55%)
Mutual labels:  data-catalog, data-discovery
feast-java
Feast Java Components
Stars: ✭ 12 (-99.72%)
Mutual labels:  metadata
bigstatsr
R package for statistical tools with big matrices stored on disk.
Stars: ✭ 139 (-96.72%)
Mutual labels:  big-data
isogeo-plugin-qgis
Isogeo plugin for QGIS
Stars: ✭ 13 (-99.69%)
Mutual labels:  metadata
WG3-MetadataSpecifications
WG3 Metadata Specification
Stars: ✭ 25 (-99.41%)
Mutual labels:  data-discovery
Shrine
File Attachment toolkit for Ruby applications
Stars: ✭ 2,903 (-31.4%)
Mutual labels:  metadata
arm-server
📃 A service for mapping Anime ID's between AniList, AniDB, MAL, and Kitsu (using https://github.com/manami-project/anime-offline-database)
Stars: ✭ 46 (-98.91%)
Mutual labels:  metadata
pdftag
A simple metadata editor for PDFs for Linux and Windows
Stars: ✭ 48 (-98.87%)
Mutual labels:  metadata
mmtf-workshop-2018
Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (-98.82%)
Mutual labels:  big-data
fmf
Flexible Metadata Format
Stars: ✭ 16 (-99.62%)
Mutual labels:  metadata
oauth
Allow users to log in with GitHub, Twitter, Facebook, and more!
Stars: ✭ 21 (-99.5%)
Mutual labels:  linkedin
Bioformats
Bio-Formats is a Java library for reading and writing data in life sciences image file formats. It is developed by the Open Microscopy Environment. Bio-Formats is released under the GNU General Public License (GPL); commercial licenses are available from Glencoe Software.
Stars: ✭ 256 (-93.95%)
Mutual labels:  metadata
QueryArrow
A semantically unified SQL and NoSQL query and update system
Stars: ✭ 17 (-99.6%)
Mutual labels:  metadata
Data-mining-python-script
It contain various script on web crawling/ data mining of social web(RSS,facebook,twitter,Linkedin)
Stars: ✭ 24 (-99.43%)
Mutual labels:  linkedin
KASocialLogins
This is Social login library in which you can login through Facebook , LinkedIn and Google
Stars: ✭ 15 (-99.65%)
Mutual labels:  linkedin
aboutmeinfo-telegram-bot
ℹ️ About Me Info Bot: Share your social media and links on Telegram
Stars: ✭ 20 (-99.53%)
Mutual labels:  linkedin

DataHub

DataHub: The Metadata Platform for the Modern Data Stack

Built with ❤️ by Acryl Data and LinkedIn

Version PyPI version build & test Docker Pulls Slack PRs Welcome GitHub commit activity License YouTube Medium Follow

🏠 Project Homepage: datahubproject.io


Quickstart | Documentation | Features | Roadmap | Adoption | Demo | Town Hall


📣 Next DataHub town hall meeting on Dec 17th, 9am-10am PDT (convert to your local time)

 Latest Update:

Introduction

DataHub is an open-source metadata platform for the modern data stack. Read about the architectures of different metadata systems and why DataHub excels here. Also read our LinkedIn Engineering blog post, check out our Strata presentation and watch our Crunch Conference Talk. You should also visit DataHub Architecture to get a better understanding of how DataHub is implemented and DataHub Onboarding Guide to understand how to extend DataHub for your own use cases.

Quickstart

Please follow the DataHub Quickstart Guide to get a copy of DataHub up & running locally using Docker. As the guide assumes some basic knowledge of Docker, we'd recommend you to go through the "Hello World" example of A Docker Tutorial for Beginners if Docker is completely foreign to you.

Demo and Screenshots

There's a hosted demo environment where you can play around with DataHub before installing.

DataHub Demo GIF

Source Code and Repositories

  • linkedin/datahub: This repository contains the complete source code for both DataHub's frontend & backend services.
  • linkedin/datahub-gma: This repository contains the source code for DataHub's metadata infrastructure libraries (Generalized Metadata Architecture, or GMA).

Documentation

We have documentation available at https://datahubproject.io/docs/.

Releases

See Releases page for more details. We follow the SemVer Specification when versioning the releases and adopt the Keep a Changelog convention for the changelog format.

Features & Roadmap

Check out DataHub's Features & Roadmap.

Contributing

We welcome contributions from the community. Please refer to our Contributing Guidelines for more details. We also have a contrib directory for incubating experimental features.

Community

Join our slack workspace for discussions and important announcements. You can also find out more about our upcoming town hall meetings and view past recordings.

Adoption

Here are the companies that have officially adopted DataHub. Please feel free to add yours to the list if we missed it.

Select Articles & Talks

See the full list here.

License

Apache License 2.0.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].