All Projects → spotify → Snakebite

spotify / Snakebite

Licence: apache-2.0
A pure python HDFS client

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Snakebite

BigDataTools
tools for bigData
Stars: ✭ 36 (-95.65%)
Mutual labels:  hdfs
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-98.31%)
Mutual labels:  hdfs
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (-50.97%)
Mutual labels:  hdfs
ros hadoop
Hadoop splittable InputFormat for ROS. Process rosbag with Hadoop Spark and other HDFS compatible systems.
Stars: ✭ 92 (-88.89%)
Mutual labels:  hdfs
fluent-plugin-webhdfs
Hadoop WebHDFS output plugin for Fluentd
Stars: ✭ 57 (-93.12%)
Mutual labels:  hdfs
Divolte Collector
Divolte Collector
Stars: ✭ 264 (-68.12%)
Mutual labels:  hdfs
datasqueeze
Hadoop utility to compact small files
Stars: ✭ 18 (-97.83%)
Mutual labels:  hdfs
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (-38.04%)
Mutual labels:  hdfs
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-98.43%)
Mutual labels:  hdfs
Kafka Connect Hdfs
Kafka Connect HDFS connector
Stars: ✭ 400 (-51.69%)
Mutual labels:  hdfs
taller SparkR
Taller SparkR para las Jornadas de Usuarios de R
Stars: ✭ 12 (-98.55%)
Mutual labels:  hdfs
fastdata-cluster
Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-97.58%)
Mutual labels:  hdfs
Juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Stars: ✭ 4,262 (+414.73%)
Mutual labels:  hdfs
fsbrowser
Fast desktop client for Hadoop Distributed File System
Stars: ✭ 27 (-96.74%)
Mutual labels:  hdfs
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+625.6%)
Mutual labels:  hdfs
aaocp
一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (-93.6%)
Mutual labels:  hdfs
bigkube
Minikube for big data with Scala and Spark
Stars: ✭ 16 (-98.07%)
Mutual labels:  hdfs
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-99.4%)
Mutual labels:  hdfs
Bigdata
💎🔥大数据学习笔记
Stars: ✭ 488 (-41.06%)
Mutual labels:  hdfs
Kafka Connect Ui
Web tool for Kafka Connect |
Stars: ✭ 388 (-53.14%)
Mutual labels:  hdfs

Snakebite mini logo

Snakebite is a python library that provides a pure python HDFS client and a wrapper around Hadoops minicluster. The client uses protobuf for communicating with the NameNode and comes in the form of a library and a command line interface. Currently, the snakebite client supports most actions that involve the Namenode and reading data from DataNodes.

Note: all methods that read data from a data node are able to check the CRC during transfer, but this is disabled by default because of performance reasons. This is the opposite behaviour from the stock Hadoop client.

Snakebite requires python2 (python3 is not supported yet) and python-protobuf 2.4.1 or higher.

Snakebite 1.3.x has been tested mainly against Cloudera CDH4.1.3 (hadoop 2.0.0) in production. Tests pass on HortonWorks HDP 2.0.3.22-alpha (protocol versions 7 and 8)

Snakebite 2.x has been tested on Hortonworks HDP2.0 and CDH5 Beta and ONLY supports Hadoop 2.2.0 and up (protocol version 9)!

Installing

Snakebite releases are available through pypi at https://pypi.python.org/pypi/snakebite/

To install snakebite run:

pip install snakebite

To install snakebite 2.x with Kerberos/SASL support, make sure you can install python-krbV (https://fedorahosted.org/python-krbV/) and then run:

pip install "snakebite[kerberos]"

Since the older version of snakebite (1.3.x) supports Hadoop 1.0 (instead of Hadoop 2), you might want to install an older version by running:

pip install -I snakebite==1.3.x

Note that the 1.3 branch is unmaintained and doesn't include any of the fixes in the 2.x branch.

Documentation

More information and documentation can be found at https://snakebite.readthedocs.io/en/latest/

Development

Make sure to read about development here and about testing over here, hack and come back with a pull requests <3

Travis CI status: Travis Join the chat at https://gitter.im/spotify/snakebite

Copyright 2013-2016 Spotify AB

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].