[CVPR 2021] pixelNeRF: Neural Radiance Fields from One or Few Images

Paper Review/3D Reconstruction (3DGS, NERF, LRM)

[CVPR 2021] pixelNeRF: Neural Radiance Fields from One or Few Images

이성훈 Ethan 2023. 4. 15. 21:36

728x90

- Introduction

Problem define: 기존 NeRF 는 너무 많은 수의 image 를 요구하며 너무 긴 optimization 시간으로 인해 impractical

► pixelNeRF 는 image feature 를 사용하지 않는 NeRF 와 달리, 각 pixel 에 aligned 된 spatial image feature 를 input 으로 사용

► pixelNeRF 는 NeRF 와 달리 few input image 로 잘 작동함

Framework

Single Image
1. Input image → Fully convolutional image feature grid
2. Sample the corresponding image feature via projection and bilinear interpolation
3. Query specification is sent along with the image features to NeRF network

Multiple Image
1. Input image → Latent representation in each camera's coordinate frame
2. Pooled in an intermediate layer

pixelNeRF 와 달리 기존 NeRF 는 few image 로 poor 결과를 보임

View space: Viewer-centered coordinate system

Canonical space: Objected-centered coordinate system

- Method

Total architecture

Fully-convolutional image encoder $E$: Input image 를 pixel-aligned feature grid 로 encode
- Pretrained ResNet-34

NeRF network $f$: Outputs color and density

Single-Image pixelNeRF

Input image $\mathbf{I}$

Feature volume $\mathbf{W}=E(\mathbf{I})$

Camera ray $\mathbf{x}$: Retrieve the corresponding image feature by projecting $\mathbf{x}$ to $\pi(\mathbf{x})$

Feature vector $\mathbf{W}(\pi(\mathbf{x}))$: Bilinearly interpolating between the pixelwise features

Image features, position, viewing direction is passed into the NeRF network

$f(\gamma(\mathbf{x}), \mathbf{d} ; \mathbf{W}(\pi(\mathbf{x})))=(\sigma, \mathbf{c})$

$\gamma(\cdot)$: Positional encoding

$\mathbf{d}$: Viewing direction

$\mathbf{x}$: Query point

Multiple Views

기존 연구에선 test time 에 single input view 만을 사용하던 것과는 다르게, pixelNeRF 에선 test time 에 arbitrary number 의 input view 를 사용하여 additional information 을 제공

$i$th input image $\mathbf{I}^{(i)}$

Camera transform from the world space to its view space with $\mathbf{P}^{(i)}=\left[\begin{array}{ll} \mathbf{R}^{(i)} & \mathbf{t}^{(i)} \end{array}\right]$

$\mathbf{x}^{(i)}=\mathbf{P}^{(i)} \mathbf{x}, \quad \mathbf{d}^{(i)}=\mathbf{R}^{(i)} \mathbf{d}$

$\mathbf{V}^{(i)}=f_1\left(\gamma\left(\mathbf{x}^{(i)}\right), \mathbf{d}^{(i)} ; \mathbf{W}^{(i)}\left(\pi\left(\mathbf{x}^{(i)}\right)\right)\right)$

$\mathbf{V}^{(1)}$ 부터 $\mathbf{V}^{(i)}$ 의 mean 을 구해 $f_2$ 로 이동

► $(\sigma, \mathbf{c})=f_2\left(\psi\left(\mathbf{V}^{(1)}, \ldots, \mathbf{V}^{(n)}\right)\right)$

만약 single input 인 경우엔 그냥 $f=f_2 \circ f_1$

- Experiment

Datasets

ShapeNet
- Category-specific
- Category-agnostic

ShapeNet scenes
- Unseen categories
- Multiple objects
- Domain transfer to real car photos

DTU MVS dataset
- Real scenes

Baselines

SRN
DVR
SoftRas for category-agnostic setting

Metrics

PSNR
SSIM
LPIPS

Category-specific

Separate model for cars and chairs

Test time 에 2개의 input view 를 넣었을 때 reconstruction 이 더 잘 됨을 알 수 있음

Category-agnostic

Single model for 13 largest ShapeNet categories

Unseen-categories

Multiple-objects

- Discussion

- Reference

[1] Yu, Alex, et al. "pixelnerf: Neural radiance fields from one or few images." CVPR 2021 [Paper link]

728x90

저작자표시 (새창열림)

'Paper Review > 3D Reconstruction (3DGS, NERF, LRM)' 카테고리의 다른 글

[ICLR 2024] LRM: Large reconstruction model for single image to 3d (0)	2024.12.21
[SIGGRAPH 2023] 3D Gaussian Splatting for Real-Time Radiance Field Rendering (0)	2024.01.08
[CVPR 2022 Oral] Point-NeRF: Point-based Neural Radiance Fields (0)	2023.08.08
[WACV 2023] Vision Transformer for NeRF-Based View Synthesis from a Single Input Image (0)	2023.04.15
[ECCV 2020 oral] NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis (0)	2023.03.12

현재글[CVPR 2021] pixelNeRF: Neural Radiance Fields from One or Few Images

Ethan's Winery

이성훈 Ethan

250x250

GAN, 용어, image classification, incremental learning, 딥러닝, fewshot, dl, Continual Learning,

Today :
Yesterday :

일	월	화	수	목	금	토
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

Ethan's Winery