All Projects → hive-tools → documentr

hive-tools / documentr

Licence: Apache-2.0 License
A naive solution to document schemas

Programming Languages

python
139335 projects - #7 most used programming language
HTML
75241 projects
Makefile
30231 projects
shell
77523 projects

Projects that are alternatives of or similar to documentr

hive-jdbc-driver
An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (+29.17%)
Mutual labels:  hive
hiveberg
Demonstration of a Hive Input Format for Iceberg
Stars: ✭ 22 (-8.33%)
Mutual labels:  hive
EngineeringTeam
와이빅타 엔지니어링팀의 자료를 정리해두는 곳입니다.
Stars: ✭ 41 (+70.83%)
Mutual labels:  hive
aaocp
一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (+120.83%)
Mutual labels:  hive
BigDataTools
tools for bigData
Stars: ✭ 36 (+50%)
Mutual labels:  hive
hivemind
Hive API server (offloads most API calls from hived) implemented using Python+SQL
Stars: ✭ 46 (+91.67%)
Mutual labels:  hive
liquibase-impala
Liquibase extension to add Impala Database support
Stars: ✭ 23 (-4.17%)
Mutual labels:  hive
dlux open token
DLUX distributed deterministic finite state automata. Built for HIVE to take advantage of free transactions using multi-sig and escrow for security.
Stars: ✭ 16 (-33.33%)
Mutual labels:  hive
web-click-flow
网站点击流离线日志分析
Stars: ✭ 14 (-41.67%)
Mutual labels:  hive
cloud
云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (+100%)
Mutual labels:  hive
apiary
Apiary provides modules which can be combined to create a federated cloud data lake
Stars: ✭ 30 (+25%)
Mutual labels:  hive
waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Stars: ✭ 194 (+708.33%)
Mutual labels:  hive
ETL-Starter-Kit
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Stars: ✭ 21 (-12.5%)
Mutual labels:  hive
HiveJdbcStorageHandler
No description or website provided.
Stars: ✭ 21 (-12.5%)
Mutual labels:  hive
TitanDataOperationSystem
最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkaban、SpringBoot,Bootstrap, Echart等;
Stars: ✭ 62 (+158.33%)
Mutual labels:  hive
hive-cube
Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org
Stars: ✭ 34 (+41.67%)
Mutual labels:  hive
cobra-policytool
Manage Apache Atlas and Ranger configuration for your Hadoop environment.
Stars: ✭ 16 (-33.33%)
Mutual labels:  hive
TIL
Today I Learned
Stars: ✭ 43 (+79.17%)
Mutual labels:  hive
GooglePlay-Web-Crawler
Mapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive
Stars: ✭ 18 (-25%)
Mutual labels:  hive
spark-waimai
基于spark的外卖大数据平台分析系统
Stars: ✭ 24 (+0%)
Mutual labels:  hive

documentr Build Status

This package helps you to automate the documentation of your database migrations for creating tables.

img

And now it supports the table graph representation:

img

TO-DO

  • Differenciate between a table creation and creating a view
  • Create a basic graph representing each table relationships
  • Improve error handling, if something cannot be done, why?
  • Improve Documentr default template, right now it's kind of horrible
  • Write tests, and then... more tests
  • Configure Travis CI to run tests after each commit
  • Improve documentation :party:

Supports

  • Hive
  • MySQL
  • PostgreSQL

Example

/**
 * @author("Sergio Sola")
 * @description("This table creates a fact table with active customers")
 * @version("1.0.0")
 */
CREATE EXTERNAL TABLE IF NOT EXISTS fact_tables.active_customers (
    customer_id  BIGINT COMMENT "@reference(dimensions.customers.customer_sk) Reference to customer in time",
    product STRING COMMENT "@reference(other_tables.product.sku) Stores the product SKU"
 )
 PARTITIONED BY (country string)
STORED AS PARQUET
LOCATION '/YourCompany/fact_tables/active_customers';

Generates a JSON file like:

{
	"table": "active_customers",
	"metadata": {
		"version": "1.0.0",
		"description": "This table creates a fact table with active customers",
		"author": "Sergio Sola"
	},
	"fields": [{
		"comment": " Reference to customer in time",
		"type": "BIGINT",
		"name": "customer_id",
		"metadata": {
			"reference": "dimensions.customers.customer_sk"
		}
	}, {
		"comment": " Stores the product SKU",
		"type": "STRING",
		"name": "product",
		"metadata": {
			"reference": "other_tables.product.sku"
		}
	}, {
		"comment": "",
		"type": "STRING",
		"name": "country",
		"metadata": null
	}],
	"database": "fact_tables"
}

Now with this JSON files we can build a website displaying all these information in a beatiful way.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].