pallet-dope-cropaug-truncation (DOPE, crop-aug, truncation-best)
νλ νΈ 6D ν¬μ¦ μΆμ μ© DOPE λͺ¨λΈ. crop + padding μ¦κ°μΌλ‘ fine-tune νμ¬, μ€μ μΉ΄λ©λΌκ° μ’μ°λ‘ ν¨λνλ©° νλ νΈκ° νλ©΄ λ°μΌλ‘ μ리λ(truncation) μν©μμ κ°κ±΄νκ² λμνλ κ²μ΄ νΉμ§.
- best checkpoint:
final_net_epoch_0180.pth(λμ epoch 180) - convention: camera-facing 0123 (0~3 μλ©΄, {0,1,4,5}=μ / {2,3,6,7}=μλ, 8=centroid)
- backbone: VGG-19, 9 belief maps + 16 affinity fields
- input: 448Γ448, sigma=4.0
μ μ΄ λͺ¨λΈμΈκ° (truncation best)
baseline DOPE λλΉ real truncation νκ²½μμ:
metric baseline crop-aug (this)
βββββββββββββββββββββββββββββββββββββββββββββ
detection rate 13% 94%
PnP success 23% 99%
YOLO-pose μ crop+padding μ¦κ° λ°©μμ DOPE λ‘ μ΄μνμ¬ μ»μ κ²°κ³Ό.
νμ΅ μ€μ
net_path : dope_cropaug_ft_s1/final_net_epoch_0150.pth (μ΄μ΄μ fine-tune)
epochs : 180 (λμ λͺ©ν, base 150 + μΆκ° 30)
lr : 5e-5
batchsize : 4
imagesize : 448
sigma : 4.0
seed : 3709
data : capturepallet{02~09,cad} + capturenight{04~09}
+ forklift_raw + truncation_crops_dope/ft_real (manual GT)
μ 체 νμ΅ μΈμλ header.txt μ°Έμ‘°.
νκ° (synthetic val, 200 frames)
eval_summary.json:
PCK@3px : 0.224 PnP success : 0.555
PCK@5px : 0.277 reproj median: 164.6 px*
PCK@10px : 0.416 volume ratio median: 0.836
* reproj μμΉλ evaluate_on_val μ convention λΆμΌμΉ λ²κ·Έλ‘ κ³Όλνκ°λ¨ (μ€μ reproj λ ν μ리μ px). κ²μΆλ₯ Β·PnP success λ μ λ’° κ°λ₯. truncation κ°κ±΄μ±μ real νκ²½(μ ν)μμ νμΈ.
μ¬μ©
import torch
# DOPE DreamNetwork μ μλ Deep_Object_Pose/common/models.py μ°Έμ‘°
state = torch.load("final_net_epoch_0180.pth", map_location="cpu")
model.load_state_dict(state)
μ£Όμ
- μ΄ λͺ¨λΈμ λ
Όλ¬Έμ© truncation κ°κ±΄μ± νΈλ μ°μΆλ¬Ό. challenge(κ³Όμ μ©) νΈλμ
pallet-dope-challenge0123-ft-manual/pallet-dope-challengenightμλ λ³κ°. - YOLO backend μ νΌμ© μ R_fix=diag(-1,1,-1) 보μ νμ(+Z λΆνΈ λ°λ). DOPE λ¨λ μ λΆνμ.
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support