Digital Trace Data

Logo

Introduction to digital trace data`:` Quality, ethics, and analysis. Utrecht University

Updated at: 28 August 2024 (Javier Garcia-Bernardo and Laura Boeschoten)

Course manual

Introducton to digital trace data provides students with foundational knowledge in digital behavioral data, focusing on collecting, analyzing, and interpretating such data. The course emphasizes various data types and methodologies, and the implications that data and algorithmic biases play on reinforcing inequalities.

Over eight weeks, students will delve into the critical analysis of digital behavioral data. Weekly lectures cover theoretical and methodological background, while practical sessions involve hands-on data collection and analysis. Topics include social media data, web scraping, APIs, data donation, survey data, data quality frameworks, and ethical considerations in data collection and analysis.

Pre-requisites: Participants should be familiar with data analysis software (preferably R or another programming language/statistical analysis software e.g. Python, JASP, Stata, SPSS, SAS).

General course information

The course (7.5EC) is structured through eight weeks as following:

Attendance is mandatory. If you are unable to attend a lecture or practical session, please inform the course coordinator in advance.

If you miss a session (e.g., due to sickness), you should catch up in the regular way: Read the readings, go through the lecture slides, do the practicals, ask your peers if you have questions, and (after the above) ask the lab teacher for further explanation.

To pass the course you need:

Resit: If the final grade is between 4.0 and 5.4, students may resit the exam. The resit will replace the grade from th exam. To have the right to the resit you need to attend over 80% of the practicals (i.e., at least 6) and hand-in your answers to the practical (in a PDF file) the Monday after each practical before 17:00 through this link. Make sure to name your file “labX_lastname.pdf”.

Fraud, plagiarism and use of generative AI

Plagiarism and fraud are serious academic offenses. Plagiarism is defined as the use of another person’s work without proper acknowledgment. This includes copying and pasting text from the internet, from books, or from other students. If you use text from another source, you must put it in quotation marks and provide a citation. If you do not, you are committing plagiarism. Fraud is defined as the use of dishonest methods to gain an unfair advantage. This includes copying another student’s work, submitting work that is not your own, or submitting the same work for two different courses. If you commit fraud or plagiarism, you will fail the course. If you are not sure what constitutes plagiarism or fraud, please see the (UU Fraud and plagiarism policy)[https://students.uu.nl/en/practical-information/policies-and-procedures/fraud-and-plagiarism].

The use of generative AI (e.g., chatGPT) in the group assignment is allowed only for the following cases:

The use of generative AI must be clearly indicated in the assignment, clearly identifying the specific model used and for what purpose. The use of generative AI for other purposes is not allowed.

Who to ask what

Course objectives and learning outcomes:

The course aims to provide students with foundational knowledge in digital behavioral data, focusing on collecting, analyzing, and interpretating such data.

At the end of the course:

Required readings (not complete)

During the course, we will use the following readings:

Books:

Articles: