Thoughts on Hadoop Data Formats
This week we looked at two file formats for Hadoop; ORC and CarbonData and a new in-memory data structure specification; Arrow. Earlier on this year we looked at data serialisation frameworks - Avro and Parquet. File formats, data serialisation frameworks, specifications… Ahh, what a minefield! Today I’d like to try and make sense of it all by looking at the evolution of these various data formats to see how we got here.
The Mid Week News - 04/10/2017
Time for the news again, with all our updates on new technology releases and interesting things to read…
The Plan For This Week - 02/10/2017
It’s a guest publication week this week - our old friend Jeff Moszuti has a bunch of technology summaries on record format libraries (specifically Apache Arrow, CarbonData and ORCFile), and some thoughts to share with us at the end of the week.
We’ll start today with Apache Arrow…
Thoughts on Graph Technologies
So, I said four technology categories and three technology summaries this week. One day you’ll learn.
But let’s talk about what we did manage to achieve this week, specifically technology category pages on RDF Databases, Graph Databases and Graph Analytics…
The Mid Week News - 27/09/2017
It’s news time again, and there are big announcements from Cloudera and Hortonworks this week…