1. ImpalaReal-time Query for Hadoop; mirror of Apache Impala
4. Cm extCloudera Manager Extensibility Tools and Documentation.
6. KittenThe fast and fun way to write YARN applications.
10. Hs2clientC++ native client for Impala and Hive, with Python / pandas bindings
14. LivyLivy is an open source REST interface for interacting with Apache Spark from anywhere
16. FlumeWE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.
17. Lucene SolrMirror of Apache Lucene + Solr https://github.com/apache/lucene-solr
18. KuduApache Kudu. Mirrored from https://github.com/apache/kudu
19. ImpylaPython DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
20. HueOpen source SQL Query Assistant service for Databases/Warehouses
21. CrunchCrunch is an Apache TLP now, and lives at http://crunch.apache.org/
27. cdsw-trainingExample Python and R code for Cloudera Data Science Workbench training
30. sqoop2No description, website, or topics provided.