The Mid Week News - 21/06/2017 edit
Time for some news, and only a week since we last did it!
Technology updates (details are on the relevant technology pages):
- Apache Pig has seen a 0.17 release, with support for using Spark as an execution engine introduced to complement the existing support for Tez and MapReduce
- Apache Kudu has seen a 1.4 release
- Cloudbreak has seen a 1.16 release, adding support for Hortonworks Flex Support Subscription
- Apache Impala has seen a 2.9 release
- Hortonworks Data Cloud for AWS has a tech preview of it’s 2.0 release
Technology news:
- Interesting post on the history of Kafka from ZDNet
- Yahoo have open sourced Bullet, a “forward looking query engine” for streaming data
- A view from Datanami on the latest HDF
- Part 4 of Hortonworks intro to HDF 3.0 looking at stream builder (the GUI for building streaming flows)
- A little old, but a still useful view on Data Artisans and Flink from Curt Monash
- A more recent post from Curt, with his views on Cloudera Altus
- A view on Spark Streaming vs Kafka Streams
- More views on the IBM-Hortonworks HDP deal from The Register and Gartner
- Hortonworks are blowing their trumpet (no pun intended) on their HDF 3.0 release and their IBM HDP deal
- Cloudera have published part 2 of their Solr memory tuning guide
- And finally, Databricks’ view on object storage (specifically S3) vs HDFS