8 months ago

Action Recognition

Semantic Segmentation

Convolutional Neural Network

Method/Architecture

Computer Vision

Hehe Fan Xin Yu Yuhang Ding Yi Yang Mohan Kankanalli

Abstract

Point cloud sequences are irregular and unordered in the spatial dimensionwhile exhibiting regularities and order in the temporal dimension. Therefore,existing grid based convolutions for conventional video processing cannot bedirectly applied to spatio-temporal modeling of raw point cloud sequences. Inthis paper, we propose a point spatio-temporal (PST) convolution to achieveinformative representations of point cloud sequences. The proposed PSTconvolution first disentangles space and time in point cloud sequences. Then, aspatial convolution is employed to capture the local structure of points in the3D space, and a temporal convolution is used to model the dynamics of thespatial regions along the time dimension. Furthermore, we incorporate theproposed PST convolution into a deep network, namely PSTNet, to extractfeatures of point cloud sequences in a hierarchical manner. Extensiveexperiments on widely-used 3D action recognition and 4D semantic segmentationdatasets demonstrate the effectiveness of PSTNet to model point cloudsequences.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

8 months ago

Action Recognition

Semantic Segmentation

Convolutional Neural Network

Method/Architecture

Computer Vision

Hehe Fan Xin Yu Yuhang Ding Yi Yang Mohan Kankanalli

Abstract

Point cloud sequences are irregular and unordered in the spatial dimensionwhile exhibiting regularities and order in the temporal dimension. Therefore,existing grid based convolutions for conventional video processing cannot bedirectly applied to spatio-temporal modeling of raw point cloud sequences. Inthis paper, we propose a point spatio-temporal (PST) convolution to achieveinformative representations of point cloud sequences. The proposed PSTconvolution first disentangles space and time in point cloud sequences. Then, aspatial convolution is employed to capture the local structure of points in the3D space, and a temporal convolution is used to model the dynamics of thespatial regions along the time dimension. Furthermore, we incorporate theproposed PST convolution into a deep network, namely PSTNet, to extractfeatures of point cloud sequences in a hierarchical manner. Extensiveexperiments on widely-used 3D action recognition and 4D semantic segmentationdatasets demonstrate the effectiveness of PSTNet to model point cloudsequences.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp