Mask-CNN: Localizing parts and selecting descriptors for fine-grain...