YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
rayv9_learnt_baseline_snap
S23DR 2026 submission.
Pipeline
- v1d heatmap model β 7-channel (gestalt+depth+ade) CNN predicts vertex heatmaps per view; spatial NMS across views β ~80 rays/scene
- v9 voxel model β rays + SfM points β 6-channel 64Γ32Γ64 voxel volume β RayVoxelTransformerV7a β hybrid ray/dense NMS β world-space vertices
- Learned baseline β point fusion (COLMAP + depth) β EdgeDepthSegmentsModel β wireframe (vertices + edges)
- Snap+unmatched β midpoint-snap baseline verts to nearest v9 vert within 1m; append unmatched v9 verts
Validation results (100 scenes, hoho22k_2026_trainval)
| Method | Mean HSS | Median HSS |
|---|---|---|
| Learned baseline | 0.352 | 0.369 |
| + Snap to v9 | 0.411 | 0.453 |
Vertex F1@0.5=0.494, F1@1.0=0.685
Checkpoints
| File | Description | Size |
|---|---|---|
v1d_checkpoint.pt |
v1d heatmap model (GroupNorm CNN) | 12M |
v9_checkpoint.pt |
RayVoxelTransformerV7a | 22M |
baseline_checkpoint.pt |
EdgeDepthSegmentsModel | 102M |
- Downloads last month
- 370
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support