CSC696J: Advanced Topics in Data Systems
The goal of this graduate seminar course is to learn more about research in the general field of data systems. In this course, we will read and review research papers on data systems. We will also learn how to do research in computer science by reading, evaluating, presenting, and conducting a research project in data systems. Topics include big data systems, cloud databases, AI systems, natural language-based querying systems, machine learning for systems, high dimensional data management, data preparation, and serverless computing, etc. This course will host a number of guest lectures given by researchers from industry and academia.
Logisitcs
Time and venue: Tuesday/Thursday 8:00am-9:15am, Gould-Simpson 701
Syllabus
Piazza link; Access Code: csc696j
D2L
Grade Scope; Access Code: EED2WJ
Instuctor
Lei Cao
Gould-Simpson 712
Office Hour: Wed 2:00PM – 3:00PM at my office or on Zoom
Guest Lectures (Tentative)
09/09: Dr. Chuan Lei, Amazon Science; text2SQL
09/16: Prof. Chengliang Chai, BIT; Unstructured Data Analysis
09/18: Dr. Bobbie Yogatama, NVIDIA; GPU Databases
09/23: Ferdi Kossmman, PhD, MIT; Agentic Workflow Optimization
09/25: Prof. Chunwei Liu, Purdue; Declarative AI Systems
10/07: Prof. Jin Wang, ASU; Data Discovery
10/09: Dr. Gerardo Vitagliano, MIT; Multi-modal Data Analytics
10/14: Dr. Ji Sun, Tsinghua University; Data + AI
10/16: Prof. Yuzhang Shang, UCF; Efficient Models
10/28: Prof. Yuyu Luo, HKUST; Data Agent
10/30: Prof. Immanuel Trummer, Cornell; LLMs + Databases
11/06: Dr. Tianyu Li, Incoming Assistant Professor, WISC; Cloud Programming
11/13: Prof. Roee Shraga, Assistant Professor, WPI; Humans in the Data Discovery and Integration Loop
11/25: Alexander Lee, PhD, Brown; Semantics Data Processing Systems
12/09: Sylvia Zhang, PhD, MIT; Vector Databases
Schedule
Please see syllabus.
