This course is to introduce graduate students with an advance topics in database systems, which are relevant to research and industrial development going on today in the broad context of database system. The focus of this course is on normalization and FD’s, transaction management and concurrency control: serializability in DB and the two-phase locking method, timestamps and recovery techniques. This course brings together several key information technologies used in manipulating, storing, and analyzing big data. We review MapReduce techniques for parallel processing and Hadoop, an open source framework. We touch on related tools that provide SQL-like access to unstructured data. We analyze so-called NoSQL storage solutions. We work with the public cloud as unlimited resource for big data analytics.