[Topic] Visual Relationship and Vision-Language Representation


Reasoning about the relationships between objects is a crucial task for holistic scene understanding. Beyond existing works of recognition and detection, relationships between objects also constitute rich semantic information about the scene. We have developed a series of methods for visual relation detection, visual relation grounding and object relation reasoning. In particular, we are interested in cross-modality representation learning in visual and language domain, and using commonsense knowledge to facilitate visual relation understanding.

