Command Palette
Search for a command to run...
PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization
Saito Shunsuke ; Huang Zeng ; Natsume Ryota ; Morishima Shigeo ; Kanazawa Angjoo ; Li Hao

Abstract
We introduce Pixel-aligned Implicit Function (PIFu), a highly effectiveimplicit representation that locally aligns pixels of 2D images with the globalcontext of their corresponding 3D object. Using PIFu, we propose an end-to-enddeep learning method for digitizing highly detailed clothed humans that caninfer both 3D surface and texture from a single image, and optionally, multipleinput images. Highly intricate shapes, such as hairstyles, clothing, as well astheir variations and deformations can be digitized in a unified way. Comparedto existing representations used for 3D deep learning, PIFu can producehigh-resolution surfaces including largely unseen regions such as the back of aperson. In particular, it is memory efficient unlike the voxel representation,can handle arbitrary topology, and the resulting surface is spatially alignedwith the input image. Furthermore, while previous techniques are designed toprocess either a single image or multiple views, PIFu extends naturally toarbitrary number of views. We demonstrate high-resolution and robustreconstructions on real world images from the DeepFashion dataset, whichcontains a variety of challenging clothing types. Our method achievesstate-of-the-art performance on a public benchmark and outperforms the priorwork for clothed human digitization from a single image.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-human-reconstruction-on-4d-dress | PIFu_Inner | Chamfer (cm): 2.696 IoU: 0.690 Normal Consistency: 0.792 |
| 3d-human-reconstruction-on-4d-dress | PIFu_Outer | Chamfer (cm): 2.783 IoU: 0.697 Normal Consistency: 0.759 |
| 3d-human-reconstruction-on-cape | PIFu (THuman2.0) | Chamfer (cm): 3.573 NC: 0.186 P2S (cm): 1.483 |
| 3d-human-reconstruction-on-customhumans | PIFu | Chamfer Distance P-to-S: 2.209 Chamfer Distance S-to-P: 2.582 Normal Consistency: 0.805 f-Score: 34.881 |
| 3d-object-reconstruction-from-a-single-image | PIFu | Chamfer (cm): 1.5 Point-to-surface distance (cm): 1.52 Surface normal consistency: 0.084 |
| 3d-object-reconstruction-from-a-single-image-1 | PIFu | Chamfer (cm): 1.14 Point-to-surface distance (cm): 1.15 Surface normal consistency: 0.0928 |
| 3d-object-reconstruction-on-renderpeople | PIFu (3 views) | Chamfer (cm): 0.567 Point-to-surface distance (cm): 0.554 Surface normal consistency: 0.094 |
| lifelike-3d-human-generation-on-thuman2-0 | PIFu | CLIP Similarity: 0.8501 LPIPS: 0.1615 PSNR: 15.0248 SSIM: 0.8884 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.