[1] Facial age editing: Disentanglement of labeled age and uncorrelated unlabeled attributes (FFHQ).
[2] Disentanglement of labeled shape (edge map) and unlabeled texture (Edges2Shoes).
[1] Disentanglement of domain label (cat, dog or wild), correlated appearance and uncorrelated pose. FUNIT and StarGAN-v2 rely on architectural biases that tightly preserve the spatial structure, leading to unreliable facial shapes which are unique to the source domain. We disentangle the pose and capture the appearance of the target breed faithfully.
[2] Male-to-Female translation in two settings: (i) When the gender is assumed to be approximately uncorrelated with all the unlabeled attributes. (ii) When we model the hairstyle as localized correlation and utilize a reference image specifying its target.
@inproceedings{gabbay2021overlord,
author = {Aviv Gabbay and Yedid Hoshen},
title = {Scaling-up Disentanglement for Image Translation},
booktitle = {International Conference on Computer Vision (ICCV)},
year = {2021}
}