Distributed Systems Engineering (WS 2023)

[WS23/24] Scalable Data Management

Lecturer:

Prof. Dr.-Ing. habil. Dirk Habich

 

Time and Place:

Lecture: Monday, 13:00 - 14:30 (4. DS), SCH/A101/H

Exercise #1: Monday, 09:20 - 10:50 (2. DS), APB/E007/U

Exercise #2: Tuesday 14:50 - 16:20 (5. DS), ASB/114

 

Description

"Data is the new Oil" - with this sentence, the relevance of structured data and thus, implicitly of course the relevance of scalable database systems as a fundamental technique of analytical and transactional processing of usually large data sets becomes visible. In the context of this course, we will discuss concepts and methods that enable distributed data processing with respect to two essential properties: on the one hand, the aspect of "performance" will be addressed and thus, questions of scalability in the case of scale-out architectures will be discussed using systems such as Apache Spark. On the other hand, the aspect of "consistency" will be discussed, where different methods for synchronizing concurrent read and write activities on the same dataset will be presented.

In general, the goal of this course is to give an insight into scalable techniques and methods of database technology. The course requires a basic knowledge of databases. Attendance of another advanced courses is not necessary, but helpful in some topics. The course exercises consist of tasks that are integrated into the lecture and practical exercises in dealing with "real" systems.

Information

  • Please register in OPAL, because all materials and all communication is restricted to registered course members only. 
Institut für Systemarchitektur | Wintersemester 2023 / 2024 [WS23/24] Scalable Data Management

News

Lecturer:

Prof. Dr.-Ing. habil. Dirk Habich

 

Time and Place:

Lecture

Exercise #1:

Exercise #2:

Description

"Data is the new Oil" - with this sentence, the relevance of structured data and thus, implicitly of course the relevance of scalable database systems as a fundamental technique of analytical and transactional processing of usually large data sets becomes visible. In the context of this course, we will discuss concepts and methods that enable distributed data processing with respect to two essential properties: on the one hand, the aspect of "performance" will be addressed and thus, questions of scalability in the case of scale-out architectures will be discussed using systems such as Apache Spark. On the other hand, the aspect of "consistency" will be discussed, where different methods for synchronizing concurrent read and write activities on the same dataset will be presented.

In general, the goal of this course is to give an insight into scalable techniques and methods of database technology. The course requires a basic knowledge of databases. Attendance of another advanced courses is not necessary, but helpful in some topics. The course exercises consist of tasks that are integrated into the lecture and practical exercises in dealing with "real" systems.

Information

  • Please register in OPAL, because all materials and all communication is restricted to registered course members only. 
Lade Bewertungsübersicht