In this module we dive into cloud technologies that allow organizations to tap into potentially thousands of computers at the click of a button at little upfront cost. We also explain the software that is used to do this and also to program such compute clusters, in order to use them for addressing Big Data problems.
The below books give background information on the hardware, resp. software aspects of Big Data Infrastructures and Technologies:
More detailed information is available in the course manual (mostly in Dutch).
The origins of this material are in the Large Scale Data Engineering MSc course (LSDE).
The Big Data Infrastructure & Technologies module of the VU University post-graduate course Big Data & Data Science (BADS) was developed by Peter Boncz and Hannes Mühleisen from the Database Architectures research group of CWI, specifically for the Amsterdam Data Science initiative.
The lecture slides for this course are adapted from those used in the Extreme Computing course, which were graciously provided by dr. Stratis Viglas, of University of Edinburgh.