batch rays together, decimate allocations and speedup ~3x by fjbarter · Pull Request #4 · fjbarter/PRICK.jl

fjbarter · 2026-03-19T01:03:18Z

rather large effort to batch rays together for passing to ImplicitBVH. overload LVT traversal algorithm to only trace 'active' rays, i.e. that have not been terminated (hit a sink, bbox, max bounces, max length etc)

this aims to effectively eliminate allocations in the ray tracing loop, as traversal caches are now being adequately utilised for ray tracing, and direction + position matrices do not need to be created per traverse_rays call. simply mutate the RayBatchBuffer

strong scaling is decent but not amazing: 1000 rays -> 11.3 s on 1 thread, 3.6 s on 4 threads for a ~3.1x speedup

abhirup-roy · 2026-06-12T23:46:45Z

AK dep missing in Project.toml!

abhirup-roy

See comments

abhirup-roy · 2026-06-12T23:51:12Z

-    verts::Vector{SVector{3,Float64}}
+    verts::Vector{NTuple{3,Float64}}
    tris::Vector{NTuple{3,Int32}}
    kinds::Vector{SurfaceKind}


We can make this a BitVector seeing as there are 2 types? Maybe make into an is_sink var

abhirup-roy · 2026-06-13T00:04:08Z

+        tcur = wall_t[ray_idx]
+        idxcur = wall_idx[ray_idx]
+        if (t < tcur) || ((t == tcur) && (idxcur == 0 || leaf_idx < idxcur))
+            n = triangle_unit_normal(v0, v1, v2)


I think we might get a speedup by precomputing the normals and storing in SurfaceBVH?

abhirup-roy · 2026-06-13T00:27:16Z

-            hit_found = true
+        tcur = sphere_t[ray_idx]
+        idxcur = sphere_idx[ray_idx]
+        if (t < tcur) || ((t == tcur) && (idxcur == 0 || Int(leaf_idx) < idxcur))


Not sure how much performance this gives but... if we change the ray_trianglle_intersect negative case to Inf, we 1. get a a Float64 (instead of a Union) and 2. get use a boolean op here. In my head the gains from the boolean op add up over time?

abhirup-roy · 2026-06-13T00:35:13Z

+@inline add3(a::NTuple{3,Float64}, b::NTuple{3,Float64}) = (a[1] + b[1], a[2] + b[2], a[3] + b[3])
+@inline sub3(a::NTuple{3,Float64}, b::NTuple{3,Float64}) = (a[1] - b[1], a[2] - b[2], a[3] - b[3])
+@inline mul3(a::NTuple{3,Float64}, s::Float64) = (a[1] * s, a[2] * s, a[3] * s)
+@inline madd3(a::NTuple{3,Float64}, s::Float64, b::NTuple{3,Float64}) = (a[1] + s * b[1], a[2] + s * b[2], a[3] + s * b[3])


Yk there's a builtin func called muladd (found out by accident when i was showing someone what mullah means in arabic)

I reckon we can use it here and in dot3 and cross3

abhirup-roy · 2026-06-13T01:05:13Z

    tmin = -Inf
    tmax = Inf
    @inbounds for k in 1:3
        dk = d[k]


Lowkey i think we're overthinking on this function? We can make it like

invd = 1.0 / d[k] t1 = (mins[k] - p[k]) * invd t2 = (maxs[k] - p[k]) * invd tmin = max(tmin, min(t1, t2)) tmax = min(tmax, max(t1, t2)) if tmax < max(tmin, eps): return nothing # or Inf if you like my previous idea return tmin > eps ? tmin : tmax

cus if the ray is parallel to the ray is exactly parallel to the axis, it will be Inf ygm?

Hopefully this means it will parallelise better?

batch rays together, decimate allocations and speedup ~3x

6af0643

abhirup-roy reviewed Jun 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch rays together, decimate allocations and speedup ~3x#4

batch rays together, decimate allocations and speedup ~3x#4
fjbarter wants to merge 1 commit into
mainfrom
batch_tracing

fjbarter commented Mar 19, 2026

Uh oh!

abhirup-roy commented Jun 12, 2026

Uh oh!

abhirup-roy left a comment

Uh oh!

abhirup-roy Jun 12, 2026

Uh oh!

abhirup-roy Jun 13, 2026

Uh oh!

abhirup-roy Jun 13, 2026

Uh oh!

abhirup-roy Jun 13, 2026

Uh oh!

abhirup-roy Jun 13, 2026

Uh oh!

abhirup-roy Jun 13, 2026

Uh oh!

abhirup-roy Jun 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

fjbarter commented Mar 19, 2026

Uh oh!

abhirup-roy commented Jun 12, 2026

Uh oh!

abhirup-roy left a comment

Choose a reason for hiding this comment

Uh oh!

abhirup-roy Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

abhirup-roy Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

abhirup-roy Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

abhirup-roy Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

abhirup-roy Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

abhirup-roy Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

abhirup-roy Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants