Classification and Analysis of Categorical Data

In this course you learn methods for analysing discrete, or categorical, data (for example frequencies and proportions). These methods are particularly important in biostatistics.

The course covers models for categorical data, twoway and multiway contingency tables, homogeneity and independence. In particular,  generalized linear models for discrete data (including logistic regression and loglinear models) with respect to regression, classification and diagnostics, are studied.

This course replaces the course Categorical Data Analysis (MT5019) from the autumn 2025.

The course consists of two modules, theory and computer assignments.


Teaching Format

Instruction is given in the form of lectures, exercise sessions, computer assignments and seminars. Some instruction is mandatory, see below under Assessment.


Assessment

The course is assessed through written examination and written reports on computer assignments.

In order to pass the course, mandatory seminars must be attended.

Examiner

The schedule will be available no later than one month before the start of the course. We do not recommend print-outs as changes can occur. At the start of the course, your department will advise where you can find your schedule during the course.


Note that the course literature can be changed up to two months before the start of the course.

Agresti: Categorical Data Analysis. Wiley.

List of course literature Department of Mathematics

Course reports are displayed for the three most recent course instances.

New student
During your studies

Course web

You can find our course webpages on kurser.math.su.se.