Master Thesis on “Egocentric Gaze Prediction via Self-Supervised Feature Forecasting”

Predicting human gaze—especially in egocentric or driving scenarios—is fundamentally about modeling where people will attend next, not just where they are looking now. This thesis focus on designing and implementing a pipeline that integrates feature forecasting with a gaze prediction module, conduct experiments on egocentric datasets, and systematically evaluate the benefits of future-aware representations.

Master Thesis on “Learning Latent Action Policies for Autonomous Driving”

Learning action-conditioned driving dynamics from raw pixels is challenging due to high dimensionality and weak temporal cues. This work combines Latent Action Pretraining (LAPA) with a conditional Diffusion Model to learn discrete latent actions and predict their future evolution. The framework captures multi-modal driving behaviors in latent space, enabling interpretable and data-efficient policy learning for autonomous driving.