YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

rayv9_learnt_baseline_snap

S23DR 2026 submission.

Pipeline

  1. v1d heatmap model β€” 7-channel (gestalt+depth+ade) CNN predicts vertex heatmaps per view; spatial NMS across views β†’ ~80 rays/scene
  2. v9 voxel model β€” rays + SfM points β†’ 6-channel 64Γ—32Γ—64 voxel volume β†’ RayVoxelTransformerV7a β†’ hybrid ray/dense NMS β†’ world-space vertices
  3. Learned baseline β€” point fusion (COLMAP + depth) β†’ EdgeDepthSegmentsModel β†’ wireframe (vertices + edges)
  4. Snap+unmatched β€” midpoint-snap baseline verts to nearest v9 vert within 1m; append unmatched v9 verts

Validation results (100 scenes, hoho22k_2026_trainval)

Method Mean HSS Median HSS
Learned baseline 0.352 0.369
+ Snap to v9 0.411 0.453

Vertex F1@0.5=0.494, F1@1.0=0.685

Checkpoints

File Description Size
v1d_checkpoint.pt v1d heatmap model (GroupNorm CNN) 12M
v9_checkpoint.pt RayVoxelTransformerV7a 22M
baseline_checkpoint.pt EdgeDepthSegmentsModel 102M
Downloads last month
370
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support