Processing Big Data with SQL on Hadoop

Jens Albrecht, Technische Hochschule Nürnberg

Traditional relational databases have never been designed for Big Data processing. They do not scale well beyond certain limits, are often expensive to operate, and require data to be loaded into well-structured tables. The Apache Hadoop framework, developed to scale on low-cost infrastructure, addresses these issues, but offers at its core only rudimentary methods to work with data. The talk gives an overview of the different technologies and systems, which are currently evolving to enable fast and scalable SQL access on top of Hadoop.