PubCard - Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

(Preprint) Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

Zhihao Liang ¹ ², Zhihao Li ³, Songcen Xu 许松岑 ³, Mingkui Tan 谭明奎 ¹, Kui Jia 贾奎 ¹ ⁴ ⁵

¹ South China University of Technology
华南理工大学
² DexForce Technology Co., Ltd.
跨维（广州）智能科技有限公司
³ Noah’s Ark Lab, Huawei Technologies
华为诺亚方舟实验室
⁴ Pazhou Laboratory
琶洲实验室（人工智能与数字经济广东省实验室）
⁵ Peng Cheng Laboratory
鹏城实验室

arXiv, 2021-08-17

https://arxiv.org/abs/2108.07478

Abstract

Instance segmentation in 3D scenes is fundamental in many applications of scene understanding. It is yet challenging due to the compound factors of data irregularity and uncertainty in the numbers of instances. State-of-the-art methods largely rely on a general pipeline that first learns point-wise features discriminative at semantic and instance levels, followed by a separate step of point grouping for proposing object instances. While promising, they have the shortcomings that (1) the second step is not supervised by the main objective of instance segmentation, and (2) their point-wise feature learning and grouping are less effective to deal with data irregularities, possibly resulting in fragmented segmentations.

To address these issues, we propose in this work an end-to-end solution of Semantic Superpoint Tree Network (SSTNet) for proposing object instances from scene points. Key in SSTNet is an intermediate, semantic superpoint tree (SST), which is constructed based on the learned semantic features of superpoints, and which will be traversed and split at intermediate tree nodes for proposals of object instances. We also design in SSTNet a refinement module, termed CliqueNet, to prune superpoints that may be wrongly grouped into instance proposals.

Experiments on the benchmarks of ScanNet and S3DIS show the efficacy of our proposed method. At the time of submission, SSTNet ranks top on the ScanNet (V2) leaderboard, with 2% higher of mAP than the second best method.

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks_1

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks_2

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks_3

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks_4

Fast-zoom and high-resolution sparse compound-eye camera based on dual-end collaborative optimization

Yi Zheng, Hao-Ran Zhang, Xiao-Wei Li, You-Ran Zhao, Zhao-Song Li, Ye-Hao Hou, Chao Liu, Qiong-Hua Wang

Opto-Electronic Advances

2025-06-19

Cascaded metasurfaces for adaptive aberration correction

Lei Zhang, Tie Jun Cui

Opto-Electronic Advances

2025-05-27

Embedded solar adaptive optics telescope: achieving compact integration for high-efficiency solar observations

Naiting Gu, Hao Chen, Ao Tang, Xinlong Fan, Carlos Quintero Noda, Yawei Xiao, Libo Zhong, Xiaosong Wu, Zhenyu Zhang, Yanrong Yang, Zao Yi, Xiaohu Wu, Linhai Huang, Changhui Rao

Opto-Electronic Advances

2025-05-27

Spectrally extended line field optical coherence tomography angiography

Si Chen, Kan Lin, Xi Chen, Yukun Wang, Chen Hsin Sun, Jia Qu, Xin Ge, Xiaokun Wang, Linbo Liu

Opto-Electronic Advances

2025-05-27

Wearable photonic smart wristband for cardiorespiratory function assessment and biometric identification

Wenbo Li, Yukun Long, Yingyin Yan, Kun Xiao, Zhuo Wang, Di Zheng, Arnaldo Leal-Junior, Santosh Kumar, Beatriz Ortega, Carlos Marques, Xiaoli Li, Rui Min

Opto-Electronic Advances

2025-05-27

Integrated photonic polarizers with 2D reduced graphene oxide

Junkai Hu, Jiayang Wu, Di Jin, Wenbo Liu, Yuning Zhang, Yunyi Yang, Linnan Jia, Yijun Wang, Duan Huang, Baohua Jia, David J. Moss

Opto-Electronic Science

2025-05-22

Tip-enhanced Raman scattering of glucose molecules

Zhonglin Xie, Chao Meng, Donghua Yue, Lei Xu, Ting Mei, Wending Zhang

Opto-Electronic Science

2025-05-22

Structural color: an emerging nanophotonic strategy for multicolor and functionalized applications

Wenhao Wang, Long Wang, Qianqian Fu, Wang Zhang, Liuying Wang, Gu Liu, Youju Huang, Jie Huang, Haoyuan Zhang, Fuqiang Guo, Xiaohu Wu

Opto-Electronic Science

2025-04-25

Reconfigurable origami chiral response for holographic imaging and information encryption

Zhibiao Zhu, Yongfeng Li, Jiafu Wang, Ze Qin, Lixin Jiang, Yang Chen, Shaobo Qu

Opto-Electronic Science

2025-04-25

Single-layer, cascaded and broadband-heat-dissipation metasurface for multi-wavelength lasers and infrared camouflage

Xingdong Feng, Tianqi Zhang, Xuejun Liu, Fan Zhang, Jianjun Wang, Hong Bao, Shan Jiang, YongAn Huang

Opto-Electronic Advances

2025-04-02

Phase reconstruction via metasurface-integrated quantum analog operation

Qiuying Li, Minggui Liang, Shuoqing Liu, Jiawei Liu, Shizhen Chen, Shuangchun Wen, Hailu Luo

Opto-Electronic Advances

2025-04-02

Full-dimensional complex coherence properties tomography for multi-cipher information security

Yonglei Liu, Siting Dai, Yimeng Zhu, Yahong Chen, Peipei Peng, Yangjian Cai, Fei Wang

Opto-Electronic Advances

2025-03-31

Integrated Sensing and Communications: Towards Dual-functional Wireless Networks for 6G and Beyond Successful New-entry Prediction for Multi-Party Online Conversations via Latent Topics and Discourse Modeling