Cloudera Data Science Workbench edit
A web based notebook for interactive data analytics on Hadoop (with both CDH and HDP supported) that uses docker to provide custom execution environments for each notebook. Supports Python, R and Scala interpreters, plus remote execution of Spark with out of the box support for Hadoop security. Notebook code is run within a docker container in a managed Kubernetes instance, allowing different libraries to be installed and used by different notebooks, and other dependancies to be installed via terminal access to the container or via custom Docker images. Also includes support for version control (via git), tracking of model tests (Experiments), automatic deployment of models and all dependancies behind a REST endpoint (Models), collaboration via shared projects, sharing of notebooks via HTTP URLs, publishing of notebooks as HTML and scheduled execution of notebooks via workflows (including dependancies on other jobs). Originally created by Sense.io, which was acquired by Cloudera in March 2016. Initial GA release was 1.0 in April 2017, with support for HDP added in January 2019 Technology Information
Vendors Cloudera Type Commercial Last Updated July 2019 - v1.6 Release History
version release date release links release comment 1.0 2017-04-26 announcement; release notes Initial release 1.1 2017-07-18 announcement; release notes; blog post 1.2 2017-10-20 release notes; blog post new usage monitoring 1.3 2018-01-26 announcement; release notes 1.4 2018-06-15 announcement; blog post; release notes 1.5 2019-01-29 announcement; release notes; CDH 6.1 + HDP 2.6/3.1 support 1.6 2019-07-24 announcement; using your own editor blog Links
News
Blog Posts