Skip to content
Discussion options

You must be logged in to vote

Hi @cbe135, the input channel of the mask autoencoder is 8 (see

).
This is mainly because we use binary representation to encode the input mask, which saves memory. 8 channels can represent 2**8 (0~255) labels. Each channel represents a bit (see the following function).
def binarize_labels(x: Tensor, bits: int = 8) -> Tensor:
"""
Convert input tensor to binary representation.
This function takes an input tensor and converts it to a binary representation
using the s…

Replies: 4 comments 3 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by cbe135
Comment options

You must be logged in to vote
1 reply
@guopengf
Comment options

guopengf Sep 9, 2025
Collaborator

Comment options

You must be logged in to vote
2 replies
@guopengf
Comment options

@mar-cry
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants