FilterHN

Measuring Acceleration Structures

79 points

by ibobev

3 months ago

| past

| 3 comments

| zeux.io

| HN

▲

whizzter

3 months ago

[-]

I wrote an non-RTX on-GPU raytracer a while back (naive compared to this) and it's super-interesting to read about the advances in compressing BVH structures.

But the changes also highlights a change in focus from just implementing this naively(RDNA3 technically not too much removed from the naive raytracer I wrote) to moving it to something carefully engineered and optimized for memory bandwidth (with savings circuits even built into silicon?).

▲

ahartmetz

3 months ago

[-]

Seems very likely that the hardware decompresses the data more or less on the fly. The acceleration structures are for the hardware, arithmetics hardware is cheap (compared to memory access), and they could use the compressed structures on older hardware with new drivers if hardware support wasn't necessary.

▲

whizzter

3 months ago

[-]

Right, the point of raytracing extensions is that there can definitively be wins thanks to specialized circuitry.

What I do wonder, like you mention that older chips could probably use the more optimized structures via software (after all, my naive-ish raytracer is fully in OpenGL and could me modified to use these structures instead), with memory being the big pain-point, what hardware optimizations/specializations are most relevant to get big gains compared to what can be done in "microcode". Circuitry for triangle-intersections, bit-unpacking but considering stack management there's probably other parts left to microcode.

▲

vardump

3 months ago

[-]

Smaller data is where it’s at when optimizing nowadays. Less bandwidth required and higher cache hit rate.

You can compute a ton per bit transferred from DRAM. On both CPUs and GPUs.

▲

genpfault

3 months ago

[-]