V+L

FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks

A versatile and efficient multi-task model for fashion-focused V+L tasks.

Large-Scale Product Retrieval with Weakly Supervised Representation Learning

The second place solution for 2nd eBay eProduct Visual Search Challenge (FGVC9-CVPR2022).

FashionViL: Fashion-Focused Vision-and-Language Representation Learning

A versatile and flexible framework for fashion-focused V+L representation learning.


Copyright © Xiao (Brandon) Han · Last update on June 2023 · Powered by the Academic theme for Hugo.