Abstract :
With the explosive growth of web data, effective and efficient technologies are in urgent needs for retrieving semantically relevant contents of heterogeneous modalities. Previous studies construct global transformations to project the heterogeneous data into a measurable subspace. However, global projections cannot appropriately adapt to diverse contents, and the naturally existing multi-level semantic relation in web data is ignored. We study the problem of semantic coherent retrieval, where documents from different modalities should be ranked by the semantic relevance to the queries. Accordingly, we propose TINA, a correlation learning method by Adaptive Hierarchical Semantic Aggregation. First, by joint modeling of content and ontology similarities, we build a semantic hierarchy to measure multi-level semantic relevance. Second, with a set of local linear projections aggregated by gating functions, we optimize the structure risk objective function that involves semantic coherence measurement, local projection consistency and the complexity penalty of local projections. Therefore, semantic coherence and a better bias-variance trade-off can be achieved by TINA. Extensive experiments on widely used NUS-WIDE and ICML-Challenge datasets demonstrate that TINA outperforms state-of-the-art, and achieves better adaptation to the multi-level semantic relation and content divergence.
Keywords :
Internet; content-based retrieval; learning (artificial intelligence); relevance feedback; ICML-challenge dataset; NUS-WIDE dataset; TINA; Web data; adaptive hierarchical semantic aggregation; bias-variance trade-off; complexity penalty; content divergence; content similarity; correlation learning method; cross-modal correlation learning; gating function; global transformation; heterogeneous data; heterogeneous modality; joint modeling; local linear projection; local projection consistency; multilevel semantic relation; multilevel semantic relevance; ontology similarity; semantic coherence measurement; semantic coherent retrieval; semantic hierarchy; semantically relevant content retrieval; structure risk objective function; Adaptation models; Coherence; Correlation; Ontologies; Semantics; Training; Visualization; Cross-modal retrieval; Local correlation learning; Semantic hierarchy;