Videos

Published on May 16th, 2020 📆 | 2901 Views ⚑

0

Simplifying Hadoop: RecordService, a secure and unified data access path for compute frameworks


https://www.ispeech.org


One of the key values of the Hadoop ecosystem is its flexibility. There is a myriad of components that make up this ecosystem, allowing Hadoop to tackle otherwise intractable problems. However, having so many components provides a significant integration, implementation, and usability burden. Features that ought to work in all the components often require sizable per-component effort to ensure correctness across the stack.

In this talk, we introduce RecordService, a new solution to address this problem. The service provides an API to read data from Hadoop storage managers and return them as canonical records. This eliminates the need for components to support individual file formats, handle security, perform auditing, and implement sophisticated IO scheduling and other common processing that is at the bottom of any computation. (From Strata + Hadoop World NYC 2015)
source





Tagged with:



Comments are closed.