Home
-
Data Science Master
Big Data technologies for Data Science
Lecture: Transactional Databases at Cluster Scale
PDF slides:
noSQL Systems
[an error occurred while processing this directive]
Technical Literature
For technical background material, there are some papers:
The Dangers of Replication and a Solution
Cassandra: a distributed storage system
Megastore: Providing Scalable, Highly Available Storage for Interactive Services
Spanner: Google's globally distributed database
General Background Information
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, Salesforce
from
Cloudera, Inc.
Big Data & Cloud Introduction
The MapReduce Framework
The Spark Framework
Scalable Machine Learning
SQL on Big Data
noSQL systems
Data Streams