All Projects → stripe-archive → Herringbone

stripe-archive / Herringbone

Licence: mit
Tools for working with parquet, impala, and hive

Labels

Projects that are alternatives of or similar to Herringbone

Node Server Project
基于 node.js + express 技术栈,采用MVC结构设计、JWT + RESTful API、PM2服务监控的Node服务器端项目框架.
Stars: ✭ 47 (-64.93%)
Mutual labels:  thrift
Hs2client
C++ native client for Impala and Hive, with Python / pandas bindings
Stars: ✭ 69 (-48.51%)
Mutual labels:  thrift
Line Instant Messenger Protocol
It is work of Matti Virkkunen, link to http://altrepo.eu/git/line-protocol.git for latest update.
Stars: ✭ 96 (-28.36%)
Mutual labels:  thrift
Tns
tns provides distributed solutions for thrift, support service discovery, high availability, load balancing, the gray release, horizontal scaling, and so on.
Stars: ✭ 53 (-60.45%)
Mutual labels:  thrift
Tchannel
network multiplexing and framing protocol for RPC
Stars: ✭ 1,122 (+737.31%)
Mutual labels:  thrift
Sails
Create a Thrift Server use like Rails
Stars: ✭ 72 (-46.27%)
Mutual labels:  thrift
Spring Thrift Api Gateway
Gateway for Apache Thrift requests processing that is built on Spring Cloud stack
Stars: ✭ 38 (-71.64%)
Mutual labels:  thrift
Frugal
Thrift improved
Stars: ✭ 113 (-15.67%)
Mutual labels:  thrift
Thriftpy
Thriftpy has been deprecated, please migrate to https://github.com/Thriftpy/thriftpy2
Stars: ✭ 1,156 (+762.69%)
Mutual labels:  thrift
Evernote Thrift
Thrift IDL files for the Evernote Cloud API
Stars: ✭ 94 (-29.85%)
Mutual labels:  thrift
Thrift
Apache Thrift
Stars: ✭ 8,821 (+6482.84%)
Mutual labels:  thrift
Nettythrift
Thrift on Netty, support TCP/HTTP/WebSocket at same port. support multiple Protocols at same time. multil Simple Clients with Connection Pool.
Stars: ✭ 60 (-55.22%)
Mutual labels:  thrift
Thrift connector
Clients of thrift, utilizing connection pools
Stars: ✭ 74 (-44.78%)
Mutual labels:  thrift
Finagle
A fault tolerant, protocol-agnostic RPC system
Stars: ✭ 8,126 (+5964.18%)
Mutual labels:  thrift
Dapeng Soa
A lightweight, high performance micro-service framework
Stars: ✭ 101 (-24.63%)
Mutual labels:  thrift
Thrift2flow
Converts Thrift specs into Flow JavaScript type definitions
Stars: ✭ 39 (-70.9%)
Mutual labels:  thrift
Andl
Andl is A New Database Language
Stars: ✭ 71 (-47.01%)
Mutual labels:  thrift
Aeraki
Manage any layer 7 traffic in Istio Service Mesh.
Stars: ✭ 119 (-11.19%)
Mutual labels:  thrift
Php Thrift Sql
A PHP library for connecting to Hive or Impala over Thrift
Stars: ✭ 107 (-20.15%)
Mutual labels:  thrift
Springboot Thrift Etcd Ribbon
基于springboot的thrift的rpc, 服务发现基于etcd,路由基于ribbon
Stars: ✭ 75 (-44.03%)
Mutual labels:  thrift

Herringbone

Herringbone is deprecated and is no longer being actively maintained.

Herringbone is a suite of tools for working with parquet files on hdfs, and with impala and hive.

The available commands are:

flatten: transform a directory of parquet files with a nested structure into a directory of parquet files with a flat schema that can be loaded into impala or hive (neither of which support nested schemas). Default output directory is /path/to/input/directory-flat.

$ herringbone flatten -i /path/to/input/directory [-o /path/to/non/default/output/directory]

load: load a directory of parquet files (which must have a flat schema) into impala or hive (defaulting to impala). Use the --nocompute-stats option for faster loading into impala (but probably slower querying later on!)

$ herringbone load [--hive] [-u] [--nocompute-stats] -d db_name -t table -p /path/to/parquet/directory

tsv: transform a directory of parquet files into a directory of tsv files (which you can concat properly later with hadoop fs -getmerge /path/to/tsvs). Default output directory is /path/to/input/directory-tsv.

$ herringbone tsv -i /path/to/input/directory [-o /path/to/non/default/output/directory]

compact: transform a directory of parquet files into a directory of fewer larger parquet files. Default output directory is /path/to/input/directory-compact.

$ herringbone compact -i /path/to/input/directory [-o /path/to/non/default/output/directory]

See herringbone COMMAND --help for more information on a specific command.

Building

You'll need thrift 0.9.1 on your path.

$ git clone github.com/stripe/herringbone
$ cd herringbone
$ mvn package

Authors

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].