Apache Kudu edit

Columnar storage technology for tables of structured data, supporting low latency reads, updates and deletes by primary key, as well as analytical column/table scans. Provides Java, C++ and Python APIs, is queryable via Impala and Spark SQL, and provides Spark, Flume and MapReduce connectors. Supports cluster deployments (including co-existence with Hadoop), with tables partitioned into tablets (configurable on a per table basis), with tablets then replicated and distributed across the cluster, using the Raft Consensus Algorithm for consistency. Also supports variable column encoding (including bit shuffle, run length, dictionary and prefix encoding) and compression. Includes a web UI for reporting operational information, and metrics available from the command line, via HTTP or via a log file. Started in November 2012, with a initial beta release in September 2015. Donated to the Apache Foundation in December 2015, graduating in July 2016, with a 1.0 release in September 2016. Implemented in C++.

Technology Information

Other Names	Kudu
Vendors	The Apache Software Foundation
Type	Commercial Open Source
Last Updated	July 2019 - v1.10

Related Technologies

Is packaged by

Cloudera CDH

Release History

version	release date	release links	release comment
1.4	2017-06-13	summary
1.5	2017-09-08	summary; blog post
1.6	2017-12-07	summary; blog post
1.7	2018-03-23	summary; blog post
1.8	2018-10-26	summary; blog post
1.9	2019-03-12	summary; blog post
1.10	2019-07-09	summary; blog post

News

https://kudu.apache.org/releases/ - details of new releases
https://kudu.apache.org/blog/ - Apache Kudu blog
http://blog.cloudera.com/blog/category/kudu/ - Cloudera blog posts on Kudu