All Git Users → cloudera

37 open source projects by cloudera

1. Impala
Real-time Query for Hadoop; mirror of Apache Impala
2. Sqoop
Sqoop has moved to Apache!
✭ 174
java
3. Cloudera Playbook
Cloudera deployment automation with Ansible
✭ 168
html
4. Cm ext
Cloudera Manager Extensibility Tools and Documentation.
✭ 146
java
5. Impala Tpcds Kit
TPC-DS Kit for Impala
✭ 142
6. Kitten
The fast and fun way to write YARN applications.
✭ 132
java
10. Hs2client
C++ native client for Impala and Hive, with Python / pandas bindings
✭ 69
thrift
12. Sentry
Access Server
✭ 45
java
14. Livy
Livy is an open source REST interface for interacting with Apache Spark from anywhere
✭ 942
scala
16. Flume
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.
✭ 941
java
17. Lucene Solr
Mirror of Apache Lucene + Solr https://github.com/apache/lucene-solr
✭ 16
java
18. Kudu
Apache Kudu. Mirrored from https://github.com/apache/kudu
✭ 829
19. Impyla
Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
✭ 625
python
21. Crunch
Crunch is an Apache TLP now, and lives at http://crunch.apache.org/
✭ 312
java
22. Cdh Twitter Example
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
✭ 285
java
23. Cm api
Cloudera Manager API Client
✭ 278
java
24. whirr-cm
No description, website, or topics provided.
✭ 28
javashell
25. dist test
No description, website, or topics provided.
✭ 28
HTMLpython
26. clusterdock
No description, website, or topics provided.
27. cdsw-training
Example Python and R code for Cloudera Data Science Workbench training
28. earthquake
No description, website, or topics provided.
29. native-toolchain
No description, website, or topics provided.
30. sqoop2
No description, website, or topics provided.
31. thrift sasl
Thrift SASL module that implements TSaslClientTransport
32. seismichadoop
System for performing seismic data processing on a Hadoop cluster.
✭ 32
javashell
33. cm csds
A collection of Custom Service Descriptors
✭ 50
shelljava
34. kudu-examples
Example code for Kudu
✭ 79
35. cloudera-scripts-for-log4j
Scripts for addressing log4j zero day security issue
✭ 82
shell
36. director-sdk
Cloudera Director API clients
✭ 19
javapython
37. impala-udf-samples
Sample UDF and UDAs for Impala.
1-37 of 37 user projects