Slides
Course Syllabus, Grading, Policies
Overview
slides
Parallel & Distributed Databases
Textbook 50 pages
Intro to processes & threads
documentation
Processes & threads in C
web
DAS,NAS,SAN,RAID
articles
Parallel QEP, optimization, join, sorting …
The Query Compiler
Textbook 80 pages
Distributed Transactions, serializability theory, concurrency control algorithms
Concurrency Control
[Time, Clocks, and the Ordering of Events in a Distributed System
article 8 pages
Reliability, Atomicity, & Durability
Coping with System Failures
Textbook 40 pages
Replication protocols
Introduction to Advanced Replication
Documentation
P2P, Overlay networks, DHT
Non-relational, No-SQL
web article
Replication in Cassandra
Bittorrent
video 3 min
Bitcoin & Blockchain
article 9 pages
NoSQL data models, query processing & transactions
Data Warehousing Overview
Batch Processing using Hadoop
Queries on Batch Views
Queries on Realtime Views
Ethical questions in Big Data
5 approaches to thinking about ethical questions
where to get some data …
data