Login Paper Search My Schedule Paper Index Help

My ICIP 2021 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.
  1. Create a login based on your email (takes less than one minute)
  2. Perform 'Paper Search'
  3. Select papers that you desire to save in your personalized schedule
  4. Click on 'My Schedule' to see the current list of selected papers
  5. Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)

Paper Detail

Paper IDMLR-APPL-IP-3.2
Paper Title DEEP SENSOR FUSION BASED ON FRUSTUM POINT SINGLE SHOT MULTIBOX DETECTOR FOR 3D OBJECT DETECTION
Authors Yu Wang, Ye Zhang, Shaohua Zhai, Hao Chen, Shaoqi Shi, Harbin Institute of Technology, China; Gang Wang, Alibaba Group, China
SessionMLR-APPL-IP-3: Machine learning for image processing 3
LocationArea F
Session Time:Tuesday, 21 September, 08:00 - 09:30
Presentation Time:Tuesday, 21 September, 08:00 - 09:30
Presentation Poster
Topic Applications of Machine Learning: Machine learning for image processing
IEEE Xplore Open Preview  Click here to view in IEEE Xplore
Abstract We present a deep sensor fusion method based on frustum point single shot multibox detector (PointSSD) for autonomous driving scenarios. The proposed method solves the problem of precision degradation in frustum PointNets (F-PointNet) caused by relying heavily on 2D detection and making insufficient use of RGB information. The method mainly consists of two subnetworks: pyramid segmentation network (PSNet) and PointSSD. The proposed PSNet uses a novel architecture capable of performing semantic segmentation on RGB information to generate high quality image semantic information. Using these image semantic information, point cloud semantic information is obtained through projection and is then fused with raw 3D spatial features by deep fusion. The fusion results are processed by PointSSD, which is proposed for classification and bounding box regression. Evaluated on the KITTI dataset, our method is superior to other methods in 3D classification and 3D localization. In addition, our method guarantees robustness to 2D false detections.