Scene Understanding (Scene Graph Generation), Vision Language, Graph Neural Networks
Out-of-Distribution Detection and Generalization, Uncertainty estimation, Learning from noisy labels
Robotics, Multimodality, Visual-and-language navigation, Embodied learning