Home

Tianyue (Helen) Zhang 张天玥

Ph.D. student @ Mila

I am a second year Ph.D. student in computer science at Mila/Université de Montréal, DIRO, under the supervision of Professor Simon Lacoste-Julien.. Here is our group. Here is my CV.

My research interest is understanding optimization algorithms in modern machine learning. By developing theoretical frameworks that capture problem-specific structures, I aim to improve optimizer efficiency and robustness, enabling fast and reliable predictions and decision-making.

I was born in Gansu and grow up in Guangzhou, China. I obtained my Bachlor's (combined honors in computer science and mathematics) and Master's degree at the University of British Columbia, Vancouver, where I worked in lab ml-568 under the supervision of Dr. Mark Schmidt.

I am an organizer for Women@Mila and MTL MLOpt. Here is a picture of my cat Cashew. In my free time, I enjoy going up, down, and around.

Publications

Understanding Adam Requires Better Rotation Dependent Assumptions

Opt Workshop @ Neurips 2024, full paper in submission Paper

L. Maes*, T. H. Zhang*, A. Jolicoeur-Martineau, I. Mitliagkas, D. Scieur, S. Lacoste-Julien, C. Guille-Escuret

On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization

ICML 2024 Paper

M. Sohrabi*, J. Ramirez*, T. H. Zhang, S. Lacoste-Julien and J. Gallego-Posada

Efficient and Adaptive Posterior Sampling Algorithms for Bandits

Paper

B. Hu, Z. Huang, T. H. Zhang, M. Lécuyer, N. Hegde

From 6235149080811616882909238708 to 29: Vanilla Thompson Sampling Revisited

Opt Workshop @ Neurips 2023 Paper Poster

B. Hu, T. H. Zhang

Optimistic Thompson Sampling for Episodic Reinforcement Learning

UAI 2023 Paper Code Poster (Upperbound) Poster (UAI)

B. Hu, T. H. Zhang, N. Hegde, M. Schmidt

Thesis

Optimistic Thompson Sampling: Strategic Exploration in Bandits and Reinforcement Learning

Master's Thesis

Supervisory Committee: Mark Schmidt, Mathias Lecuyer

PDF Slides

Talks

Understanding Adam Requires Better Rotation Dependent Assumptions: Montreal MLOpt Abstract

Safe Reinforcement Learning from Human Feedback: RLHF reading group @Mila Slides

Deep exploration via randomized value function: RLRG 2023 Spring Slides

Transformers, large language-models, and the magic behind chatGPT: Guest Lecture for CPSC 340 Recording Slides

Language models are few-shot learners: MLRG 2022 Fall Slides

Active Learning for semantic segmentation: MLRG 2022 Summer Slides

Probabilistic topic modelling Slides