Publications

Detailed Information

PointMixer: MLP-Mixer for Point Cloud Understanding

DC Field Value Language
dc.contributor.authorChoe, Jaesung-
dc.contributor.authorPark, Chunghyun-
dc.contributor.authorRameau, Francois-
dc.contributor.authorPark, Jaesik-
dc.contributor.authorKweon, In So-
dc.date.accessioned2024-05-09T04:12:23Z-
dc.date.available2024-05-09T04:12:23Z-
dc.date.created2024-05-08-
dc.date.created2024-05-08-
dc.date.issued2022-
dc.identifier.citationLecture Notes in Computer Science, Vol.13687, pp.620-640-
dc.identifier.issn0302-9743-
dc.identifier.urihttps://hdl.handle.net/10371/201290-
dc.description.abstractMLP-Mixer has newly appeared as a new challenger against the realm of CNNs and Transformer. Despite its simplicity compared to Transformer, the concept of channel-mixing MLPs and token-mixing MLPs achieves noticeable performance in image recognition tasks. Unlike images, point clouds are inherently sparse, unordered and irregular, which limits the direct use of MLP-Mixer for point cloud understanding. To overcome these limitations, we propose PointMixer, a universal point set operator that facilitates information sharing among unstructured 3D point cloud. By simply replacing token-mixing MLPs with Softmax function, PointMixer can mix features within/between point sets. By doing so, PointMixer can be broadly used for intra-set, inter-set, and hierarchical-set mixing. We demonstrate that various channel-wise feature aggregation in numerous point sets is better than self-attention layers or dense token-wise interaction in a view of parameter efficiency and accuracy. Extensive experiments show the competitive or superior performance of PointMixer in semantic segmentation, classification, and reconstruction against Transformer-based methods.-
dc.language영어-
dc.publisherSpringer Verlag-
dc.titlePointMixer: MLP-Mixer for Point Cloud Understanding-
dc.typeArticle-
dc.identifier.doi10.1007/978-3-031-19812-0_36-
dc.citation.journaltitleLecture Notes in Computer Science-
dc.identifier.wosid000903590200036-
dc.identifier.scopusid2-s2.0-85142704413-
dc.citation.endpage640-
dc.citation.startpage620-
dc.citation.volume13687-
dc.description.isOpenAccessY-
dc.contributor.affiliatedAuthorPark, Jaesik-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Related Researcher

  • College of Engineering
  • Dept. of Computer Science and Engineering
Research Area Computer Graphics, Computer Vision, Machine Learning, Robotics

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share