Hi, im trying to reimplement RACNN, and your src code helps me a lot.
But here are some questions confuse me.
-
How the grad flow back to APN while training APN? I mean that APN have 2 inputs: last image and coordinates[tx, ty, tl] from FC layer, and the output is cropped finer image. But while backprop, how to compute grad of [tx, ty, tl]?
-
Rank loss indeed takes as inputs two probabilities in paper, how to optimize 3 APN networks from 2 loss?
Thanks a lot.