Hierarchical Architecture Optimization for Efficient Transformer-based Monte Carlo Denoising
Hämtar...
Ladda ner
Publicerad
Författare
Typ
Examensarbete för masterexamen
Master's Thesis
Master's Thesis
Modellbyggare
Tidskriftstitel
ISSN
Volymtitel
Utgivare
Sammanfattning
Physically-based Monte Carlo rendering heavily relies on deep learning denoisers
to reconstruct photorealistic images from low Sample-Per-Pixel (SPP) inputs. Re
cently, the Joint Self-Attention (JSA) framework, a Transformer-base method uti
lizing auxiliary G-buffers, has achieved state-of-the-art visual fidelity. However, the
quadratic computational complexity of standard multi-head self-attention disquali
fies it from interactive or real-time rendering applications. To bridge this efficiency
gap, this thesis proposes a highly efficient denoising architecture by systematically
eliminating the computational redundancy of standard JSA frameworks. The pro
posed optimization strategy is executed on two architectural levels. At the micro
level, we adapt the Single-Head Joint Self-Attention (SH-JSA) module with a partial
channel ratio to preserve high-frequency structural features while reducing compu
tational cost. Furthermore, guided by hardware profiling, we replace early-stage
attention blocks with optimized Convolutional Neural Networks (CNNs). At the
macro-level, we progressively streamline the global U-Net structure by implement
ing a symmetric decoder, reducing the network depth to three stages, and expanding
the input patch size. Extensive evaluations demonstrate a leap in efficiency. For
high-resolution 1024 × 1024 inputs, the proposed framework reduces the network
parameter count by 86.6% and slashes the inference latency by 91.6%. While this
extreme acceleration introduces a minor 4.9% drop in Peak Signal-to-Noise Ratio
(PSNR), it successfully transitions the JSA-based denoiser from offline computation
to interactive frame rates.
Beskrivning
Ämne/nyckelord
Computer graphics, Monte Carlo rendering, real-time rendering, Monte Carlo denoising, transformer-based denoising.
