Thoughts on Hadoop Data Formats

This week we looked at two file formats for Hadoop; ORC and CarbonData and a new in-memory data structure specification; Arrow. Earlier on this year we looked at data serialisation frameworks - Avro and Parquet. File formats, data serialisation frameworks, specifications… Ahh, what a minefield! Today I’d like to try and make sense of it all by looking at the evolution of these various data formats to see how we got here.

Read More

The Mid Week News - 04/10/2017

Time for the news again, with all our updates on new technology releases and interesting things to read…

Read More

The Plan For This Week - 02/10/2017

It’s a guest publication week this week - our old friend Jeff Moszuti has a bunch of technology summaries on record format libraries (specifically Apache Arrow, CarbonData and ORCFile), and some thoughts to share with us at the end of the week.

We’ll start today with Apache Arrow…

Thoughts on Graph Technologies

So, I said four technology categories and three technology summaries this week. One day you’ll learn.

But let’s talk about what we did manage to achieve this week, specifically technology category pages on RDF Databases, Graph Databases and Graph Analytics

Read More

The Mid Week News - 27/09/2017

It’s news time again, and there are big announcements from Cloudera and Hortonworks this week…

Read More