资讯

Abstract: Cross-modal remote sensing image-text retrieval (CMRSITR) involves retrieving relevant samples in one modality based on a query from another modality. Previous dense retrieval methods ...
Abstract: Cross-lingual voice conversion (XVC) is a technology that modifies speaker identity while preserving linguistic content in scenarios where the source and target speakers use different ...