The Applied Social Data Science programme offers a wide range of modules that aim to introduce students to state-of-the-art quantitative methods and social science research skills. Lecture and seminar based modules are offered and assessment is based on assignments and exams. The number of modules on offer and topics covered, and whether there is any choice of module topics, varies from year to year depending on student numbers. For 2023/24 students will also need to choose 3 out of 4 elective modules. Students are expected to bring their own laptop (Mac/Windows/Linux) for use in seminars and tutorials throughout the course; note - tablets are not suitable. For minimum laptop specs, please see this helpful guide
- Computer Programming for Social Scientists (10 ECTS/CORE)
Students will be introduced to Python and R, two fundamental data science programming languages. This will be a core module that later modules will build upon. Basic and intermediate programming skills will be introduced. Familiarity with core programming concepts will be covered: e.g., functions, variables, conditions, loops, data structures, working with libraries, interacting with APIs. Students will engage with the material in a hands-on environment, which will include coding homework/exercise throughout the module. Knowledge will also be assessed with a comprehensive exam at the end of the module.
- Applied Statistical Analysis I (10 ECTS/CORE)
This module introduces the core concepts of quantitative social science research. The module begins with a thorough review of probability theory. It then takes a step by step approach, including description of simple and more complex data, to issues of random samples, to the types and requirements of statistical inference, and finally to linear statistical models. The module provides a solid foundation for further training in statistical modeling, in particular “Applied Statistical Analysis II”. The module will be based on the software package R. Further, the module will introduce core methods in data visualization using R for a range of data: univariate, bivariate, time-series, network, etc. Best practices in visualizing data such as tailoring to the target audience and accessibility concerns will also be introduced.
- Research Design for the Social Sciences (10 ECTS/CORE)
This module will cover crucial aspects of the social scientific research process, with a particular focus on causal inference. Students will become familiar with core concepts of the scientific method: conceptualization, theory development, falsifiability, causality, hypothesis testing, and operationalization. Experimental and quasi-experimental design will also be covered. Students will be assessed on their ability to critically consume and produce social scientific research designs. Specifically, students will rely on research design and scientific method concepts to evaluate a paper presented at a recent international research conference via a peer-review report. Further, a research proposal will be required which covers core research components: identification of a research puzzle, concise review of literature, theoretical development, conceptualization and operational definition of concepts, as well as a description and justification of methodological approach.
- Applied Statistical Analysis II (10 ECTS/CORE)
This module will cover important methods from statistics for the applied data analyst. Students will learn to apply core methods in R such as linear regression modeling and limited dependent variable methods to answer social science questions. Students will also learn how to statistically model quasi-experiment designs such as difference-in-differences, regression discontinuity and panel regression. This will be a project-based module that allows the student to become familiar with the quantitative social science workflow.
- Introduction to Machine Learning (5 ECTS/CORE)
Introduction to Machine Learning is designed to offer an introduction to the basics of ML, specifically with a hands-on curriculum aimed at developing knowledge and skills in establishing ML pipelines with state of the art languages and toolkits. This module is designed for students with limited prior experience of programming. It will introduce the fundamentals of programming, with a focus on setting up an effective pipeline for processing datasets to execute common ML techniques such as Scalable Vector Machines and Linear Regression.
- Social Forecasting (5 ECTS/ELECTIVE)
This module will cover the fundamental techniques and approaches to forecasting. Students will in particular learn the main building blocks of a forecasting model, how to build their own forecasting models, and how to evaluate their performance. Techniques ranging from ARIMA models to neural networks will be applied to both time series and panel data. Students will be able to apply the approaches covered in the module to forecast political, economic, and/or social outcomes in a research report.
- Quantitative Text Analysis for Social Scientists (5 ECTS/ELECTIVE)
This module focuses on a range of computational tools—stemming from the fields of machine learning and natural language processing (NLP)—that are essential for large-scale analyses of text information. The aim is to provide students with a hands-on introduction to collecting, processing, and analyzing “text-as-data” for the purpose of answering important social science research questions. The module will also cover corpus acquisition methods as well as social media research applications. Students will apply these skills to produce a state-of-the-art research report based on a novel collection of text documents and meta-data.
- Experimental Methods for Social Scientists (5 ECTS/ELECTIVE)
The purpose of this module is to develop students’ ability to critically analyze the use of experiments in social, political, economic domains. Students will learn about the design of and practical and ethical issues concerning the design and implementation of survey, lab, and field experiments, including sample selection, effective randomization, attrition, spillover effects and internal and external validity. Students will also gain hands-on experience on experimental design by developing their own projects at the end of the term.
- Spatial Data Analysis (5 ECTS/ELECTIVE)
The use of spatial data has become increasingly popular in social sciences research. Micro-surveys now routinely collect GPS coordinates of households and communities, satellites provide real-time measure-ments of night-time luminosity, and geo-referenced historic maps are linked to outcomes both across long time spans and space. Spatial data serve in general two main purposes. First, they allow meas-uring outcomes that are otherwise hard to measure. Second, they aid identification by, for example, controlling for covariates, enabling the construction of instruments, or exploiting boundaries. In the first part of the course, we will discuss how recent papers are us-ing geo-referenced data, focusing on the role of spatial data in answer-ing research questions. The second part of the course will be hands on: we will cover basic spatial tools in ArcGIS, such as creating datasets on our own, merging spatial datasets, computing distances and the basics of map algebra.
- Dissertation (MSc Students Only) (30 ECTS/CORE)
All students on the Dissertation module are required to complete a dissertation for qualification for the course. Each student will be supervised by a member of staff. Students will receive written feedback from the supervisors on their research proposals. Students will complete and have approved a research proposal by last Friday in March. The final dissertation is due by the first Friday in September.