Applied Data Analysis
Course Description
Welcome to the course “Applied Data Analysis”. In summer semester 2025, the course takes place in a hybrid setting, consisting of on-site lectures at TU Dresden as well as online lectures and e-learning exercises via Zoom.
This practice-oriented module introduces the fundamentals of applied data analysis and provides an overview of relevant concepts, methods, and technologies. The focus here is particularly on the subfield of predictive analytics and the approaches of (supervised) machine learning for creating predictive models. Using a systematic procedure model, the basic steps and principles of predictive modeling are illustrated and underpinned with example approaches.
Learning objectives
- After actively participating in the module, students can describe the fields of application of applied data analysis and business analytics and classify basic technologies, methods and concepts.
- They are able to classify and differentiate between basic concepts, functionalities and principles of predictive modeling and, in particular, machine (supervised) learning.
- Building on this, they have a comprehensive understanding of the structure and functioning of predictive models based on machine learning methods.
- Students are able to explain the basic steps for developing an comprehensive domain and data understanding, for the exploration and preprocessing of data, and for the development and evaluation of machine learning models using a systematic approach.
- Furthermore, they have mastered the fundamental methods and principles of a range of machine learning models and can apply these to various practical examples, as well as evaluate, interpret and critically question the results.
- In addition, they are able to implement data analysis approaches and machine learning models using the Python programming language to address real-world problems.
Course structure
The practice-oriented module consists of lectures to convey conceptual content and an accompanying e-learning exercise in which selected aspects are explored in more depth and implemented using the programming language Python based on real data sets and demonstration examples.
Contents of the lecture units
- Classification of business analytics concepts and related approaches
- (descriptive analysis methods and tools, esp. business intelligence, online analytical processing, data warehousing, reporting, dashboards)
- Predictive models and differentiation from explanatory models
- Different types of machine learning
- Supervised learning (classification and regression tasks)
- Procedure models in the area of business analytics (e.g. CRISP-DM)
- Domain and data understanding
- Data preprocessing and feature engineering
- Model training with selected machine learning methods (e.g. linear/logistic regression, decision trees, ensemble methods, artificial neural networks)
- Model evaluation
Contents of the e-learning units
- Data exploration and visualization
- Data preprocessing and feature engineering
- Model development and training with selected machine learning models
- Model evaluation and diagnosis
Course information
Module duration
|
1 Semester
|
Module frequency
|
The module is offered at irregular intervals. |
Credits/workload
|
5 ECTS = 150 hours (workload)
|
Forms of teaching
|
|
Lecturers
|
|
Time and Location
|
Lecture (Prof. Zschech)
E-learning exercise (Lasse Bohlen)
For further information see schedule below. |
Recommended prerequisites
|
English language skills at the basic course level of the Abitur are required. The number of participants is limited to a maximum of 30 students. Places are allocated to students at random. There will be a waiting list.
|
Method of examination
|
Written exam (90 min)
|
Teaching and examination language
|
English
|
Schedule (lecture + exercise)
CW
|
Lecture
|
Date / Location
|
E-Learning Exercise
|
Date / Location
|
---|---|---|---|---|
15
|
0) Opening Session | 07.04. (TUD) | Intro Python Environment | 09.04. (online) |
16 | 1) Intro Predictive Analytics | 14.04. (online, Leipzig) | Python Basics for Data Science | 16.04. (online) |
17 | Easter Monday | 21.04. | - | - |
18 | 2) Pipeline ML + 3) Business Understanding | 28.04. (TUD) | (Catch-up exercise) | 30.04. (online) |
19 | 4) Data Understanding | 05.05. (TUD) | Pandas | 07.05. (online) |
20 | 5) Data Preparation I | 12.05. (online, Leipzig) | Pandas / Visualization / Categorical Features | 14.05. (online) |
21 | 5) Data Preparation II | 19.05. (TUD) | (Catch-up exercise) | 21.05. (online) |
22 | 5) Data Preparation III | 26.05. (TUD) | Linear regression / Decision trees | 28.05. (online) |
23 | 6) Regression Models I | 02.06. (online, Leipzig) | Scaling / Encoding / Cleaning | 04.06. (online) |
24 | Pentecost TUD | 09.06. | (Catch-up exercise) | 11.06. (online) |
25 | ECIS Conference | 16.06. | Advanced Models | 18.06. (online) |
26 | 6) Regression Models II + 7) Decision Trees I | 23.06. (TUD) | (Catch-up exercise) | 25.06. (online) |
27 | 7) Decision Trees II + Tree Ensembles | 30.06. (TUD) | Model Validation / Evaluation | 02.07. (online) |
28 | 8) Evaluation | 07.07. (online, Leipzig) | (Catch-up exercise) | 09.07. (online) |
29 | 9) ANNs + 10) CNNs | 14.07. (TUD) | - | - |