Estimating 6D poses and reconstructing 3D shapes of objects in open-worl...
The raw depth image captured by indoor depth sensors usually has an exte...
In this paper, we propose a novel representation for grasping using cont...
Channel pruning can effectively reduce both computational cost and memor...
Detecting 3D objects from point clouds is a practical yet challenging ta...
The raw depth image captured by the indoor depth sensor usually has an
e...
The vision transformer splits each image into a sequence of tokens with ...
This work investigates the usage of batch normalization in neural
archit...