English
glove
lora
distillation
bpe
cl100k_base

bpe_glove_512_lora_v1

LoRA drifts on top of frozen jsanzolac/bpe_glove_512 BPE-GloVe-512 embeddings, distilled from Qwen/Qwen3-Embedding-8B (MRL-truncated to 512 dims).

Variant 1 loss: λ_c·InfoNCE + λ_D·||ρ_T - ρ_S||_F² with λ_c=1.0, λ_D=0.1.

Each rank_<r>/ folder contains:

  • checkpoint_final.pt
  • config.json
  • vectors_drifted.txt
  • train_log.jsonl

Ranks shipped: [512, 256, 128, 64, 32, 16, 8, 4, 2]

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for jsanzolac/bpe_glove_512_lora_v1

Adapter
(2)
this model

Datasets used to train jsanzolac/bpe_glove_512_lora_v1