Commit History

fix: has_no_defaults_at_init=True per evitare RecursionError in to_diff_dict
c54f415
verified

ThingsAI commited on

Update README.md
86ea512
verified

ThingsAI commited on

Update README.md
6b0d704
verified

ThingsAI commited on

feat: repetition penalty in generate_text
3a388e1
verified

ThingsAI commited on

fix: inv_freq calcolato runtime, non buffer (evita corruzione meta-device)
38feb38
verified

ThingsAI commited on

fix: RotaryEmbedding lazy cache build (evita garbage da meta-device init)
4e1e026
verified

ThingsAI commited on

fix: safetensors ricostruito solo da named_parameters
176de1f
verified

ThingsAI commited on

fix: inv_freq persistent=False, safetensors solo named_parameters
257acf0
verified

ThingsAI commited on

fix: cast float32 prima di RoPE per evitare overflow
9716697
verified

ThingsAI commited on

fix: SDPA in float32 per stabilità numerica
23eadd7
verified

ThingsAI commited on

fix: cast q,k a dtype di v dopo RoPE — identico a train.py
434cb12
verified

ThingsAI commited on

fix: aggiunto inv_freq nel safetensors
674494c
verified

ThingsAI commited on

fix: inv_freq persistent=True + incluso nel safetensors
5f72285
verified

ThingsAI commited on

fix: weight tying via embed_tokens.weight.T, rimuove lm_head
d660488
verified

ThingsAI commited on

fix: safetensors da named_parameters() con verifica NaN
4d92421
verified

ThingsAI commited on

fix: cast q,k,v a dtype originale dopo RoPE
b0ed64c
verified

ThingsAI commited on

fix: v.to(q.dtype) prima di SDPA
85e1731
verified

ThingsAI commited on

fix: copia esatta architettura da train.py, RoPE senza cast dtype
5c0671e
verified

ThingsAI commited on

fix: tie_word_embeddings false
3c77b79
verified

ThingsAI commited on

fix: lm_head.weight esplicito nel safetensors
3db9f1c
verified

ThingsAI commited on

fix: NaN guard in generate_text + _keys_to_ignore_on_load_missing
91da9fd
verified

ThingsAI commited on

fix: tie_weights(**kwargs) per compatibilità transformers 4.40+
e0d9139
verified

ThingsAI commited on

fix: dtype cast in SDPA + tie_weights() esplicito
cd6716a
verified

ThingsAI commited on

Export Quark Instruct checkpoint
92a7d9d
verified

ThingsAI commited on

initial commit
3c3cbe3
verified

ThingsAI commited on