Core Hadoop Technologies (pt1)
Technologies The Apache Software Foundation Apache Flume Apache HBase Apache Hive HCatalog Hive Metastore Hive Server Peter
And we’re back - Happy New Year!
Having started with the core Apache Hadoop project, we’re now going to look at the “core” technologies within the Hadoop space, based on those included in multiple distributions (many thanks to Merv Adrian from Gartner for his useful tracker)
Apache Hadoop
And so we begin our journey through the jungle of Data Engineering technologies by looking at the technology du jour - Apache Hadoop.
The Technology Catalogue
The first step in this journey is going to be creation of a catalogue of the technologies that are going to be of interest to us as we explore the world of data engineering.
The Plan
One more post before we get started.
The following are my current thoughts for some of the topics I’d like to cover on this site, both as a reference for my future self to look back at my naive optimism, but also if anyone wants to start contributing to any of these now, or to start a discussion on any the later topics to start framing and exploring them.