jsanzolac/qwen3_emb_512_packed
Viewer • Updated • 6M • 74
LoRA drifts on top of frozen jsanzolac/bpe_glove_512 BPE-GloVe-512 embeddings, distilled
from Qwen/Qwen3-Embedding-8B (MRL-truncated to 512 dims).
Variant 1 loss: λ_c·InfoNCE + λ_D·||ρ_T - ρ_S||_F² with λ_c=1.0, λ_D=0.1.
Each rank_<r>/ folder contains:
checkpoint_final.ptconfig.jsonvectors_drifted.txttrain_log.jsonlRanks shipped: [512, 256, 128, 64, 32, 16, 8, 4, 2]
Base model
jsanzolac/bpe_glove_512