Projects

Transfer Learning via Online System Identification.

PDF Code

In this project, we implemented Soft Actor-Critic method on OpenAI gym classic control games with Proximal Policy optimization method and GAE lambda advantage function. We trained universal policy with dynamic system identification to aim to reduce Sim-to-Real gap.

Talks

Language Modles are Few-shot Learners: MLRG 2022 Fall.

Slides

Summary

Teaching

July 2017- Present: CPSC 532M/340: Machine Learning and Data Mining

Jan. 2018- Apr. 2018: Math 152: Linear Systems

Jan. 2017- Apr. 2018: CPSC 121: Models of Computation

July 2017- Aug. 2017: Math 102: Integral Calculus