Abstract:
According to embodiments of the present disclosure, methods of and computer program products for operating a plurality of classifiers are provided. A plurality of input entities are read, each input entity having an associated target label. The input entities are provided to a first classifier, and a category of each input entity is obtained therefrom. A feature map is determined for each input entity. Each feature map is provided to each of a set of classifiers, and an assigned label is obtained for each feature map from each of the set of classifiers. Each classifier is associated with one of the categories. For each classifier, the assigned label for each feature map is compared to the target labels to determine a plurality of gradients. The plurality of gradients are masked according to each category, yielding a masked set of gradients for each category. Each classifier is trained according its associated masked gradients.