Visual object tracking comprises the problem of locating and following the trajectory of a specified target across an image sequence, given only an initial annotation. Techniques have evolved from ...
To address these limitations, a variety of approaches have been proposed to enhance cross-modal interaction and reinforce temporal coherence 7. Some methods employ spatio-temporal attention mechanisms ...