All Projects → scritchley → Orc

scritchley / Orc

Licence: mit
An ORC file format reader and writer for Go.

Programming Languages

go
31211 projects - #10 most used programming language
golang
3204 projects

Labels

Projects that are alternatives of or similar to Orc

Bookkeeper
Apache Bookkeeper
Stars: ✭ 1,178 (+1114.43%)
Mutual labels:  big-data
Panoptes
A Global Scale Network Telemetry Ecosystem
Stars: ✭ 80 (-17.53%)
Mutual labels:  big-data
Reef
Mirror of Apache REEF
Stars: ✭ 92 (-5.15%)
Mutual labels:  big-data
Cookbook
The Data Engineering Cookbook
Stars: ✭ 9,829 (+10032.99%)
Mutual labels:  big-data
Iotdb
Apache IoTDB
Stars: ✭ 1,221 (+1158.76%)
Mutual labels:  big-data
Parquet Mr
Apache Parquet
Stars: ✭ 1,278 (+1217.53%)
Mutual labels:  big-data
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-26.8%)
Mutual labels:  big-data
Streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (-1.03%)
Mutual labels:  big-data
Uproot4
ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-17.53%)
Mutual labels:  big-data
Hazelcast Python Client
Hazelcast IMDG Python Client
Stars: ✭ 92 (-5.15%)
Mutual labels:  big-data
Spark Website
Apache Spark Website
Stars: ✭ 75 (-22.68%)
Mutual labels:  big-data
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-18.56%)
Mutual labels:  big-data
Smart Array To Tree
Convert large amounts of data array to tree fastly
Stars: ✭ 91 (-6.19%)
Mutual labels:  big-data
Labs
Research on distributed system
Stars: ✭ 73 (-24.74%)
Mutual labels:  big-data
Treeviz
Tree diagrams with JavaScript 🌲 📈
Stars: ✭ 95 (-2.06%)
Mutual labels:  big-data
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+1111.34%)
Mutual labels:  big-data
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (-15.46%)
Mutual labels:  big-data
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (+0%)
Mutual labels:  big-data
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1279.38%)
Mutual labels:  big-data
Bitcoin Value Predictor
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-6.19%)
Mutual labels:  big-data

orc

Build Status code-coverage go-doc

Project Status

This project is still a work in progress.

Current Support

Column Encoding Read Write Go Type
SmallInt, Int, BigInt int64
Float, Double float32, float64
String, Char, and VarChar string
Boolean bool
TinyInt byte
Binary []byte
Decimal orc.Decimal
Date orc.Date (time.Time)
Timestamp time.Time
Struct orc.Struct (map[string]interface{})
List []interface{}
Map []orc.MapEntry
Union interface{}
  • The writer support is in its late stages, however, I do not recommend using it yet.

Example

r, err := Open("./examples/demo-12-zlib.orc")
if err != nil {
    log.Fatal(err)
}
defer r.Close()

// Create a new Cursor reading the provided columns.
c := r.Select("_col0", "_col1", "_col2")

// Iterate over each stripe in the file.
for c.Stripes() {
    
    // Iterate over each row in the stripe.
    for c.Next() {
          
        // Retrieve a slice of interface values for the current row.
        log.Println(c.Row())
        
    }
   
}

if err := c.Err(); err != nil {
    log.Fatal(err)
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].