CS 4973 / CS 6983: Trustworthy Generative AI Fall
2024 |
Instructors: § Instructor:
Alina Oprea (alinao) § TA: Pravind
Anand Pawar Class Schedule: § Tuesday
11:45am-1:25pm and Thursday 2:50-4:30pm ET §
Location: Ryder Hall 161 Office Hours: § Alina:
Thursday 4:30-5:30pm ET and by appointment § Pravind: Monday 5-6pm ET on Zoom Class forum: Canvas with links to Piazza and Gradescope Class policies: Academic integrity policy is strictly enforced.
Class
Description: Recently, generative AI has been
increasingly deployed in critical domains such as medicine, biology, finance,
and cyber security. Foundation models such as large language models (LLMs)
have been trained on massive datasets crawled from the web and are
subsequently finetuned to new tasks including summarization, translation,
code generation, and conversational agents. This trend raises many concerns
about the security of AI models in critical applications, as well as the
privacy of the data used to train these models. In this course, we will study a variety of
adversarial attacks on generative AI that impact the
security and privacy of these systems. We will cover multiple deployment
models for generative AI, including
fine-tuning and Retrieval Augmented Generation. We will also discuss existing
mitigations against security and privacy vulnerabilities, and the challenges
in making AI trustworthy. We will read
and debate papers published in top-tier conferences in ML and cyber security.
Students will have an opportunity to work on a semester-long research project
in trustworthy AI. Disclaimer: This
course is not meant to be the first course taken by a student in ML/AI. This course focuses on recent research in security and
privacy of ML and AI. Prior knowledge in machine learning is essential for
following this course. If you have any questions about the course content,
please email the instructor.
Pre-requisites:
§
Calculus and linear algebra §
Basic knowledge of machine learning Grading
The grade will be based on:
§ Assignments – 20% § Paper summaries – 10% § Final project report – 40% § Final project presentation – 10% § Paper presentation and class
participation – 20%
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Review
materials § Probability
review notes from Stanford's machine learning class § Sam Roweis's probability
review § Linear
algebra review notes from Stanford's
machine learning class
Other resources
Books: § Trevor Hastie,
Rob Tibshirani, and Jerry Friedman. Elements
of Statistical Learning. Second Edition, Springer, 2009. § Christopher
Bishop. Pattern Recognition and Machine Learning. Springer,
2006. § A. Zhang, Z.
Lipton, and A. Smola. Dive into Deep Learning § C. Dwork and A. Roth. The Algorithmic Foundations of Differential Privacy § Shai
Ben-David and Shai Shalev-Shwartz. Understanding Machine Learning: From Theory to Algorithms |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|