CUDA Path Tracer

University of Pennsylvania, CIS 5650: GPU Programming and Architecture, Project 3

Michael Rabbitz
- LinkedIn
Tested on: Windows 10, i7-9750H @ 2.60GHz 32GB, RTX 2060 6GB (Personal)

Part 1: Introduction

This project takes an incomplete skeleton of a C++/CUDA path tracer, transforms it to a functional state by implementing core features, then enhances it by implementing physically-based visual improvements, mesh enhancements, and performance optimizations.

Path Tracing Overview

Path tracing is a sophisticated rendering technique in computer graphics designed to achieve photorealistic images by accurately simulating the behavior of light in a scene. This technique flips the conventional perspective on light: instead of light traveling from sources to the eye, rays are cast from the camera into the scene, exploring how light interacts with the surfaces of objects in the scene and determining how those surfaces are illuminated. The Bidirectional Scattering Distribution Function (BSDF) plays a key role in this process, governing how light scatters when it hits a surface, accounting for both reflection and refraction.

As rays bounce off surfaces, they generate multiple reflections and/or refractions until they reach a light source or exit the scene. Path tracing employs Monte Carlo integration to estimate pixel colors by averaging many random samples, enhancing image quality at the cost of increased rendering time. A standout feature of path tracing is its ability to simulate global illumination, capturing the complex interplay of light as it bounces between surfaces. While it delivers high-quality results and effectively handles various materials and lighting conditions, path tracing can be computationally demanding and may introduce noise, which can be mitigated by increasing the sample count.

Global Illumination = Direct Illumination + Indirect Illumination

Left: Light Ray directly illuminating the point on the floor from the viewer's perspective via no intermediate bounce(s)
Right: Light Ray indirectly illuminating the point on the floor from the viewer's perspective via an intermediate bounce

Bidirectional Scattering Distribution Function (BSDF)

A BSDF is a mathematical model that describes how light is scattered when it interacts with a surface. It provides a way to characterize the reflection and transmission of light at a surface by specifying how incoming light is redistributed into outgoing directions. The BSDF can be divided into two components: the Bidirectional Reflectance Distribution Function (BRDF), which describes reflection, and the Bidirectional Transmittance Distribution Function (BTDF), which describes transmission. BSDFs are essential in rendering and computer graphics, as they help simulate realistic materials and lighting interactions, contributing to the overall realism of rendered images.
Image Source

"Scattering" in Path Tracing using BSDFs

This series illustrates a single ray cast from the eye in path tracing (yellow ray) as it focuses on a point on the floor. It demonstrates how global illumination at that point is achieved through multiple bounces of the ray, interacting with surfaces based on their BSDFs. Each bounce scatters additional rays according to the surface's BSDF, creating many ray paths. The contribution to the illumination of the initial point decreases with each bounce and ultimately concludes when all ray paths either hit a light source, exit the scene, or reach the bounce/depth limit.

Part 2: Core Features

Ideal Diffuse BSDF Evaluation

Ideal Diffuse (Lambertian) BSDF evaluation models perfectly diffuse surfaces that scatter light uniformly in all directions. The amount of light reflected is proportional to the cosine of the angle between the incoming light direction and the surface normal, following Lambert's cosine law. As a result, light is scattered more strongly in directions closer to the surface normal. In this path tracer, the evaluation is computed by randomly sampling directions in a hemisphere around the surface normal, with a cosine-weighted bias for more accurate light contribution.

Lambertian Reflectance

Image Source

All objects besides the light have diffuse surface materials

Perfect Specular Reflection BSDF Evaluation

Perfect Specular Reflection (Mirrored) BSDF evaluation models surfaces that reflect light in a single, mirror-like direction. Incoming light rays are reflected at an angle equal to the incident angle relative to the surface normal, creating sharp reflections without any scattering. In this path tracer, the reflection is computed by reflecting the incoming ray about the surface normal, effectively simulating the behavior of ideal mirrors.

Mirrored Reflectance

Image Source

Perfect Specular Sphere + Cuboid	Perfect Specular Floor

Stochastic Sampled Antialiasing

Stochastic Sampled Antialiasing enhances the visual quality of rendered images by reducing aliasing artifacts through jittering ray directions within a single pixel. In this implementation, rays are generated from the camera's position and directed toward the scene, incorporating a random offset to the pixel coordinates. When stochastic sampling is enabled, each ray is slightly perturbed within the pixel range, creating a more varied sampling pattern. This randomization averages out pixel colors over multiple samples, resulting in smoother transitions and improved image fidelity.

Antialising OFF - Shooting a ray in the center of each pixel
The ray either hits yellow or gray. The pixel gets the associated color, leading to an image with jagged edges.

Image Source

Antialiasing ON - Shooting multiple rays in the space of a single pixel
This example shows 25 ray samples taken for a single pixel (the top-left pixel in the above image), where each sample either returns yellow or gray. The color is averaged over all samples taken.

Image Source

Antialising OFF - Sharp corners

Antialiasing ON - Sharp corners

Path Continuation/Termination using Stream Compaction

In the path tracer implementation, Path Continuation/Termination is handled using stream compaction with thrust::stable_partition. During each iteration, rays (or path segments) are traced through the scene, and after each bounce, path segments that have not yet terminated are compacted in memory to improve computational efficiency. Path segments that either hit light sources or escape the scene (without hitting any objects) are removed from the pool of active path segments.

This use of stream compaction reduces wasted computation on terminated segments and makes more efficient use of GPU resources. By focusing only on active path segments, the path tracer optimizes workload distribution, ensuring that only relevant segments contribute to the image. This approach enhances the scalability and efficiency of the path tracer, especially in complex scenes or deep bounce scenarios.

Path Segments contiguous in memory by Material before BSDF Evaluation and Shading

Sorting path segments by material before BSDF evaluation helps improve memory coherence and efficiency during shading by grouping similar materials together. This allows the path tracer to batch shading operations, reducing divergence in GPU kernels, especially in scenes with many different materials. When path segments are contiguous in memory by material, similar shading tasks (e.g., diffuse, reflective, or refractive) are processed more efficiently.

However, this sorting step can introduce overhead in simpler scenes with few materials, where the cost of sorting outweighs the benefits. In such cases, sorting adds computational expense without significantly improving performance, leading to slower runtimes. The advantage of sorting is most noticeable in complex scenes with a diverse range of materials.

Part 3: Physically-based Visual Improvements

Dielectric BSDF Evaulation

Dielectric BSDF Evaluation models the behavior of materials that exhibit both reflective and refractive properties, such as glass or water. In this implementation, the evaluation begins by determining the index of refraction based on whether the incoming ray hits the front or back face of the surface. Using Snell's Law, the algorithm computes the direction of the refracted ray, while also considering Fresnel effects, which dictate that some portion of the incoming light will be reflected. If total internal reflection occurs, the ray is reflected instead of refracted. This approach allows for a realistic simulation of light interactions at boundaries, capturing the subtleties of refraction and reflection based on the angle of incidence and the material's properties.

Glass Cuboids	Glass Sphere

Metal BSDF Evaulation

Metal BSDF evaluation models the unique reflective properties of metallic surfaces, capturing both roughness and Fresnel effects. When light hits a metal surface, it is primarily reflected, but some light can scatter into the material, causing absorption. This scattering can lead to softer highlights and reduced overall brightness. The evaluation takes into account the angle of incidence and the roughness of the surface to create realistic reflections. As a result, this approach achieves a more accurate representation of metals, balancing their shiny appearance with the subtle dimming effects from light absorption.

Metal Sphere + Cuboid
Rough	Rougher	Roughest

Part 4: Mesh Enhancements

OBJ Loading and Rendering

OBJ Loading and Rendering enables the loading and rendering of 3D models from OBJ files. It uses the tinyObj library to parse the OBJ file, extracting material and geometry data, including vertices and normals. The extracted material data are categorized into types such as dielectrics, metals, diffuse materials, and light-emitting materials.

The renderer implements the Möller–Trumbore intersection algorithm to test whether a ray intersects with a triangle in the scene. This algorithm uses barycentric coordinates to determine the intersection point and calculates the surface normal.

Stanford Bunny	Mario	Homer Simpson

OBJ Source	OBJ Source	OBJ Source

Part 5: Performance Optimizations

Bounding Volume Hierarchy (BVH)

Reference 1 Reference 2

The Bounding Volume Hierarchy (BVH) implementation utilizes the Surface Area Heuristic (SAH) to efficiently build a hierarchical structure for 3D geometries, which significantly enhances ray intersection performance. Each geometry is encapsulated within an Axis-Aligned Bounding Box (AABB), calculated for various shapes such as triangles, cubes, and spheres. The BVH nodes store the AABBs and reference the geometries, while the subdivision process intelligently partitions the geometries based on their centroids. The SAH optimizes this subdivision by evaluating potential split positions and axes to minimize the expected cost of ray-object intersection tests, effectively balancing the number of rays and objects in each node.

This hierarchical organization allows the BVH to reduce the number of geometry checks during ray tracing. The algorithm first tests for intersections with the bounding boxes of BVH nodes, enabling early exits if a node is not hit. If a leaf node is reached, it checks for intersections with the individual geometries it contains. This efficient approach to spatial data organization results in faster rendering times, making it ideal for complex scenes with numerous geometries.

Scene	Geometry Count	FPS - BVH OFF	FPS - BVH ON	% Speedup
Sphere + Cuboid	8	37.00	34.00	-8.1
Homer Simpson	12,006	0.50	21.00	4100
Mario	36,488	0.17	10.60	6135
Stanford Bunny	69,457	0.09	13.00	14344

The results indicate that while a BVH improves performance overall, its impact varies based on scene complexity. Without a BVH, rendering time increases linearly with the number of geometries, as each ray must check for intersections with every object. However, the overhead of the BVH can slow down simpler scenes, such as the Sphere + Cuboid example, where the costs of subdivision and traversal outweigh the benefits.

In contrast, the speedup for more complex scenes is dramatic. In the Homer Simpson scene, the FPS increases by over 4000%, while in the Mario scene, it jumps by over 6000%. The most impressive improvement is seen in the Stanford Bunny scene, with a remarkable 14,344% speedup. This highlights the efficiency of the BVH in handling highly complex geometries, as it reduces the number of intersection tests by effectively grouping and organizing the objects spatially.

Interestingly, the Stanford Bunny scene, despite having nearly double the number of geometries compared to the Mario scene, outperforms it. This can be attributed to the shapes of the models; the Mario figure has thin, perpendicular limbs (T-shaped), making BVH traversal less efficient. In contrast, the Stanford Bunny's compact and spherical form allows the BVH to better organize its geometry, resulting in improved performance.

Part 6: Running the code

The main function requires a scene description file. Call the program with one as an argument: cis565_path_tracer scenes/sphere.json. (In Visual Studio, ../scenes/sphere.json.)

If you are using Visual Studio, you can set this in the Debugging > Command Arguments section in the Project Properties. Make sure you get the path right - read the console for errors.

Controls

Esc to save an image and exit.
S to save an image. Watch the console for the output filename.
Space to re-center the camera at the original scene lookAt point.
Left mouse button to rotate the camera.
Right mouse button on the vertical axis to zoom in/out.
Middle mouse button to move the LOOKAT point in the scene's X/Z plane.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
cmake		cmake
external		external
img		img
scenes		scenes
src		src
stream_compaction		stream_compaction
.cproject		.cproject
.gitignore		.gitignore
.project		.project
CMakeLists.txt		CMakeLists.txt
GNUmakefile		GNUmakefile
INSTRUCTION.md		INSTRUCTION.md
Project3-CUDA-Path-Tracer.launch		Project3-CUDA-Path-Tracer.launch
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUDA Path Tracer

Part 1: Introduction

Path Tracing Overview

Part 2: Core Features

Ideal Diffuse BSDF Evaluation

Perfect Specular Reflection BSDF Evaluation

Stochastic Sampled Antialiasing

Path Continuation/Termination using Stream Compaction

Path Segments contiguous in memory by Material before BSDF Evaluation and Shading

Part 3: Physically-based Visual Improvements

Dielectric BSDF Evaulation

Metal BSDF Evaulation

Part 4: Mesh Enhancements

OBJ Loading and Rendering

Part 5: Performance Optimizations

Bounding Volume Hierarchy (BVH)

Part 6: Running the code

Controls

About

Releases

Packages

Languages

mrabbitz/Project3-CUDA-Path-Tracer

Folders and files

Latest commit

History

Repository files navigation

CUDA Path Tracer

Part 1: Introduction

Path Tracing Overview

Part 2: Core Features

Ideal Diffuse BSDF Evaluation

Perfect Specular Reflection BSDF Evaluation

Stochastic Sampled Antialiasing

Path Continuation/Termination using Stream Compaction

Path Segments contiguous in memory by Material before BSDF Evaluation and Shading

Part 3: Physically-based Visual Improvements

Dielectric BSDF Evaulation

Metal BSDF Evaulation

Part 4: Mesh Enhancements

OBJ Loading and Rendering

Part 5: Performance Optimizations

Bounding Volume Hierarchy (BVH)

Part 6: Running the code

Controls

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages