MapR Expansion Pack edit  

A package of open source Hadoop projects certified to work together against one or more versions of the MapR Converged Data Platform. Has new major releases roughtly once a quarter, with most components kept resonably up to date with the open source version, with any patching done publically in GitHub. Available as RPMs, and installable via the MapR Installer. These components were originally bundled as part of the MapR Converged Data Platform, but were broken out as the MapR Ecosystem Pack in September 2016 to allow them to be released independantly. Renamed to the MapR Expansion Pack as of version 4.0.

Technology Information

Other NamesMEP, MapR Ecosystem Pack
VendorsMapR
TypeCommercial Open Source
Last UpdatedOctober 2018 - v6.0

Related Technologies

PackagesApache Drill, Apache Flume, Apache Hive, Apache Impala, Kafka Connect, Kafka Streams, Apache Mahout, Apache Myriad, Apache Oozie, Apache Pig, Apache Sentry, Apache Spark, Apache Sqoop, Apache Storm, Apache Tez, Hue, MapR Object Store, MapR-DB HBase Clients and Tools, MapR-ES Kafka Clients and Tools
Is packaged byMapR Converged Data Platform

Further Information

The documentation homepage provides a list of the components included, however this omits a number of components including Mahout, Storm and Tez, and includes Cascading which isn’t actually bundled. The release notes has a more complete list, along with detailed release nodes and package details for each of the components.

Some key components to call out include:

  • Hive - includes Hive on Tez, but not Hive on Spark or LLAP. HCatalog is also included.
  • Hue - includes Livy
  • Spark - includes Spark 2.x only
  • Sqoop - includes both Squoop1 and Squoop2.
  • MapR-DB HBase Clients and Tools - this includes the standard client, REST gateway and Thrift1 gateway from HBase, and the asynchronous client from https://github.com/OpenTSDB/asynchbase, all of which can be used to manipulate data in MapR-DB binary tables. Prior to MEP 4.0 (MapR 6.0) the entirety of HBase was bundled, however in MEP 4.0 the HBase Master and HBase Regionserver are no longer distributed.
  • MapR-ES Kafka Clients and Tools - this includes the librdkafka C client, the Confluent Python client, Kafka Connect, Kafka Streams, KSQL and the Kafka REST proxy, all of which can be used to manipulate data in MapR-ES. Introduced in MEP 2.0 - see blog post
  • MapR Object Store provides an S3 compatible API over MapR-FS
  • The open source components for MapR Montioring - collectd, OpenTSDB, Grafana, FluentD, Elasticsearch and Kibana, are actually distributed as part of the MEP.

Release History

versionrelease daterelease linksrelease comment
4.02017-11-21announcement 
4.12018-02-02announcement 
5.02018-04-06announcement; release notesDrill 1.13; Flume 1.8; Impala 2.10; Spark 2.2.1
6.02018-10-03release notesKafka Streams; KSQL; Spark 2.3.1; Hive 2.3; Hue 4.2

News

  • Announcements via the MapR product announcements blog

Blog Posts