Core Hadoop Technologies (pt1)

And we’re back - Happy New Year!

Having started with the core Apache Hadoop project, we’re now going to look at the “core” technologies within the Hadoop space, based on those included in multiple distributions (many thanks to Merv Adrian from Gartner for his useful tracker)

Read More

Apache Hadoop

And so we begin our journey through the jungle of Data Engineering technologies by looking at the technology du jour - Apache Hadoop.

Read More

The Technology Catalogue

Site

The first step in this journey is going to be creation of a catalogue of the technologies that are going to be of interest to us as we explore the world of data engineering.

Read More

The Plan

Site

One more post before we get started.

The following are my current thoughts for some of the topics I’d like to cover on this site, both as a reference for my future self to look back at my naive optimism, but also if anyone wants to start contributing to any of these now, or to start a discussion on any the later topics to start framing and exploring them.

Read More

Big Data

Site

Before we get stuck in, a short digression to talk about Big Data.

Read More