Publications

Detailed Information

Fast Point Transformer

DC Field Value Language
dc.contributor.authorPark, Chunghyun-
dc.contributor.authorJeong, Yoonwoo-
dc.contributor.authorCho, Minsu-
dc.contributor.authorPark, Jaesik-
dc.date.accessioned2024-05-09T04:12:35Z-
dc.date.available2024-05-09T04:12:35Z-
dc.date.created2024-05-08-
dc.date.created2024-05-08-
dc.date.issued2022-
dc.identifier.citationProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol.2022-June, pp.16928-16937-
dc.identifier.issn1063-6919-
dc.identifier.urihttps://hdl.handle.net/10371/201294-
dc.description.abstractThe recent success of neural networks enables a better interpretation of 3D point clouds, but processing a large-scale 3D scene remains a challenging problem. Most current approaches divide a large-scale scene into small regions and combine the local predictions together. However, this scheme inevitably involves additional stages for pre- and post-processing and may also degrade the final output due to predictions in a local perspective. This paper introduces Fast Point Transformer that consists of a new lightweight self-attention layer. Our approach encodes continuous 3D coordinates, and the voxel hashing-based architecture boosts computational efficiency. The proposed method is demonstrated with 3D semantic segmentation and 3D detection. The accuracy of our approach is competitive to the best voxel-based method, and our network achieves 129 times faster inference time than the state-of-the-art, Point Transformer, with a reasonable accuracy trade-off in 3D semantic segmentation on S3DIS dataset.-
dc.language영어-
dc.publisherIEEE-
dc.titleFast Point Transformer-
dc.typeArticle-
dc.identifier.doi10.1109/CVPR52688.2022.01644-
dc.citation.journaltitleProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition-
dc.identifier.wosid000870783002072-
dc.identifier.scopusid2-s2.0-85132778978-
dc.citation.endpage16937-
dc.citation.startpage16928-
dc.citation.volume2022-June-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorPark, Jaesik-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
dc.subject.keywordAuthor3D from multi-view and sensors-
dc.subject.keywordAuthorgrouping and shape analysis-
dc.subject.keywordAuthorScene analysis and understanding-
dc.subject.keywordAuthorSegmentation-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Related Researcher

  • College of Engineering
  • Dept. of Computer Science and Engineering
Research Area Computer Graphics, Computer Vision, Machine Learning, Robotics

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share