Abstract: To establish semantic associations between images and texts, existing Image-Text Retrieval (ITR) methods primarily focus on fixed-scale fragments, which only identify explicit semantic ...
Abstract: Text-to-image person retrieval aims to match target pedestrian images based on a text query. Existing methods mainly learn feature alignment between texts and pedestrian images from global ...