Compositional Retrieval

FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks

A versatile and efficient multi-task model for fashion-focused V+L tasks.

FashionViL: Fashion-Focused Vision-and-Language Representation Learning

A versatile and flexible framework for fashion-focused V+L representation learning.

UIGR: Unified Interactive Garment Retrieval

A unified framework and benchmark for two interactive garment retrieval tasks.

Copyright © Xiao (Brandon) Han · Last update on June 2023 · Powered by the Academic theme for Hugo.