Skip to content

‘Hide-and-Seek’ Random Masking Transform #6796

Open
@faberno

Description

@faberno

🚀 The feature

Source
Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization (Scholar, Arxiv)
Number of citations: 542

Method
The image is divided into a grid and then every patch of this grid is masked with probability p. So the inputs are patch_size, p and fill_value.

Motivation, pitch

Like described in the paper, this augmentation method can improve weakly-supervised object localization, as with it not only the most discriminative parts in the image are found, but all relevant ones.

I have already implemented this method, so I could open a PR, if you think this feature is a useful addition.

Alternatives

No response

Additional context

Here an example from the paper.
Screenshot from 2022-10-19 22-56-54

cc @vfdev-5 @datumbox

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions