B9145: Topics in Trustworthy AI

Hongseok Namkoong, Columbia University, Spring 2025

Course Schedule

Date Topics Lecture Notes Readings
Jan 30 Course Overview Part 1: Course Overview,
Part 2: Data-centric View of AI
CLIP
Feb 6 Language for Distribution Shifts SlidesDRO DISDE
Feb 13 Domain Adaptation UnderspecificationTatsu Hashimoto's slides A theory of learning from different domains
Feb 20 Invariance InvarianceIRMDomain Generalization In Search of Lost Domain Generalization
Feb 27 LLM pre-training, SFT, RLHF OverviewFinetuningRLHF GPT3
Mar 6 LLM reasoning, inference-time operations Slides
Mar 27 Pre-training data and scaling laws Pre-training Data, Scaling Laws



Email the instructor if you want access to lecture recordings.