Apache Impala edit

An MPP query engine that supports the execution of SQL queries over in HDFS, HBase, Kudu and S3 based on tables defined in the Hive Metastore. Focus is on analytical (OLAP) use cases, and more specifically on low latency interactive queries (rather than long running batch queries), with some support for batch inserts of data. Supports DDL statements for updating the Hive Metastore, uses (broadly) the same SQL syntax as Hive (including UDFs and a range of aggregate and analytical functions), as well as the same JDBC / ODBC drivers, and is therefore compatible with any Hive query tool (such as Beeline). Supports querying over data in Parquet, Text, Avro, RCFile and SequenceFile formats, with the ability to write Parquet and Text data. Support Kerberos and LDAP authentication, and integration with Apache Sentry for authorisation. Includes a shell (Impala Shell) that supports some shell only commands for tuning performance and diagnosing problems. Created by Cloudera, started in May 2011 and first announced in October 2012, with a 1.0 GA release in May 2013. Donated to the Apache Foundation in December 2015, graduating in November 2017, and is still under active development.

Technology Information

Other Names	Impala
Vendors	The Apache Software Foundation
Type	Commercial Open Source
Last Updated	April 2019 - v3.2

Related Technologies

Is packaged by

Cloudera CDH, MapR Expansion Pack, Cloudera Altus Data Warehouse

Release History

version	release date	release links	release comment
2.9	2017-06-17	changelog
2.10	2017-09-15	changelog
2.11	2018-01-18	changelog
2.12	2018-05-01	changelog
3.0	2018-05-09	changelog
3.1	2018-12-06	changelog
3.2	2019-03-29	changelog

Blog Posts

The Week That Was - 24/03/2017 2017-03-24 Tech Vendors Technologies CDH Llama Whirr Impala Peter

Technology Information

Related Technologies

Release History

Links

Blog Posts