#
Data Science

*Prof. Dr. rer. nat. Marius Lindauer*

Übungsbetreuung:

## Background

In the Era of Big Data, one of the emerging requirements for any scientist is the ability to effectively and critically work with data, i.e., collect and extract data, create surveys, transform the data, apply mathematical models on the data, and visualize the important aspects. In fact, the Society of Computer Science (Gesellschaft der Informatik) has coined the term data literacy to describe various competencies in this regard. In the same spirit, the goal of this course is to teach non-computer scientists the foundational concepts of data science. Students will learn to analyze data for the purpose of understanding and describing real-world phenomena. The students will obtain skills in data-centric programming and statistical inference. Furthermore, the students will gain hands-on experience on daily challenges of a data scientist with best practice approaches for data collection and preparation. Finally, we will discuss ethical and social aspects of data science.
The course consists of a standard lecture and lab work. During the lecture the important concepts are introduced. In the lab sessions, students will be guided in practical programming exercises. In addition, the students receive bi-weekly assignments that follow-up on the lab exercises. The successful participation in the assignments is a pre-requisite to take part in the final written exam.

## Target audience

Computer scientists

## Requirements

We strongly recommend that you know the foundations of
necessities:

- Probability theory
- Lecture on datastructures and algorithms

recommended:

- Lecture on foundations of data bank systems

in order to attend the course.

## Topics

The lectures are partioned in several parts, including:

- Data Sampling and Probability
- Data Preparation
- Visualizations
- Intro to Modeling
- Simple Linear Regression + OLS
- Feature Engineering
- Bias and Variance
- Cross-Validation, Regularization and AutoML
- Classification
- Inference for Modelling
- Conclusion and Ethics

## Literature

## Dynamics

**The course will be in English.**
- Lecture: Monday 14:00 (s.t.) - 15:30, Room F102, building 1101 Mainbuilding Welfengarten 1
- Excercise: Thursday 11:00 - 12:30, Raum 031, e-Classroom eNIFE, building 3702: Informationstechnologie

## Enlist

**Enlisting is done exclusively via studip on a first-come-first-serve basis. There is a limit of 100 Students.**