Commit History

Update README.md
c52514a
Running
verified

IMJONEZZ commited on

Update README.md
641b11a
verified

IMJONEZZ commited on

Update README.md
c001a1d
verified

IMJONEZZ commited on

Update README.md
64b0c0d
verified

IMJONEZZ commited on

Update LICENSE
909fa57
verified

IMJONEZZ commited on

Create LICENSE
d4f4dd2
verified

IMJONEZZ commited on

menu: loopback api backend reads 'running on this machine', not generic 'api' (the hosted Space serves the real Warden over localhost)
8d2b52f

IMJONEZZ commited on

space: ssr_mode=False on launch β€” gradio 6 SSR's Node proxy doesn't forward the raw /pty websocket
95ab054

IMJONEZZ commited on

space: transformers 5 apply_chat_template returns BatchEncoding β€” use return_dict + **enc into generate (fixes AttributeError on .shape)
aac926a

IMJONEZZ commited on

space: load Nemotron the normal way β€” transformers-native (no trust_remote_code), NO mamba_ssm/causal_conv1d. Those custom Triton CUDA kernels were the segfault (THCPModule_initExtension); native falls back to pure-torch Mamba on ZeroGPU.
0c2e095

IMJONEZZ commited on

space: adopt the org's proven NPCverse structure β€” gradio 6 Server + @app .api + app.launch() (installs ZeroGPU hooks), transformers 5 (compatible with gradio 6; trust_remote_code uses our repo's modeling). Replaces the custom engine.launch+route-surgery that broke the hooks and segfaulted.
9203831

IMJONEZZ commited on

space: drive @spaces.GPU through Gradio's API (gr.api + gradio_client), not run_in_threadpool β€” the threadpool call inits CUDA off-thread and segfaults. Matches how the org's NPCverse/the-deal spaces invoke GPU work.
13015f6

IMJONEZZ commited on

space: pin python_version 3.12 β€” ZeroGPU defaulted to 3.10.13, breaking the cp312 mamba wheels
505cbc6

IMJONEZZ commited on

space: finetuned Warden on ZeroGPU the documented way β€” bf16 + .to('cuda') module-level + @spaces.GPU(xlarge), no bitsandbytes/device_map (the actual fix). Direct run_in_threadpool call verified by the probe.
321303b

IMJONEZZ commited on

space: ZeroGPU diagnostic β€” measure CPU RAM/disk/VRAM + confirm a @spaces.GPU call works, before loading the model the documented .to('cuda') way
10c83ac

IMJONEZZ commited on

space: move /static mount ahead of gradio catch-all (styling regression fix)
ee38482

IMJONEZZ commited on

space: add /api/probe to verify live Warden generation end-to-end
6152ad5

IMJONEZZ commited on

space: revert to Gradio SDK + CPU llama-cpp-python (keeps the prize; ZeroGPU was the problem, not the SDK)
e577af2

IMJONEZZ commited on

space: load model lazily inside the GPU worker β€” module-level device_map=cuda + bnb poisoned the ZeroGPU fork's CUDA context
c1a8f99

IMJONEZZ commited on

space: route GPU calls through Gradio (gr.api + gradio_client) so the ZeroGPU per-request CUDA hooks fire
4468bdc

IMJONEZZ commited on

space: duration=120 for cold start + /api/status fast-path (causal_conv1d) probe
3af751e

IMJONEZZ commited on

space: blocking GPU generate instead of threaded streamer (hung across ZeroGPU fork); 503 on failure so the game falls back cleanly
255e227

IMJONEZZ commited on

play: reserve a bottom row + taller frame so the board prompt isn't clipped
a0de8fb

IMJONEZZ commited on

play: autosize the terminal so the full board always fits (cards were clipping)
6f42620

IMJONEZZ commited on

space: gradio 5.49 β€” transformers<5 needs hub<1.0, which gradio 6 forbids
1330ecb

IMJONEZZ commited on

space: serve via Blocks.launch (ZeroGPU handshake) + pin transformers<5 for the bnb4 checkpoint format
8051e61

IMJONEZZ commited on

space: torch 2.10 β€” mamba wheels demand triton>=3.5, impossible under torch 2.8
f68cd1e

IMJONEZZ commited on

merge Space model-dir removal
69f9134

IMJONEZZ commited on

space: load the released nf4 Warden from the hub (1GB Space LFS cap rules out in-repo weights)
caef9bc

IMJONEZZ commited on

Delete files model/* with huggingface_hub
0e95292
verified

IMJONEZZ commited on

merge Space model upload commit
e2bb0c6

IMJONEZZ commited on

gitignore: allow the shipped Space model shards
29e3001

IMJONEZZ commited on

space: load the Warden shipped in the repo (no boot download)
a6fd68b

IMJONEZZ commited on

Upload folder using huggingface_hub
2635f57
verified

IMJONEZZ commited on

space: surface mamba install diagnostics in /api/status; bnb4 prequant script
b5186d6

IMJONEZZ commited on

space: WebGL renderer + customGlyphs β€” card art was warping in the browser
34b513d

IMJONEZZ commited on

game: totem ritual now downloads the finetuned Warden GGUF
713805c

IMJONEZZ commited on

space: pin torch 2.8 + prebuilt mamba-ssm/causal-conv1d wheels
0abba5d

IMJONEZZ commited on

space: bootstrap mamba-ssm/causal-conv1d at runtime for Nemotron-H
52d29cc

IMJONEZZ commited on

space: disable gradio SSR on mount β€” the Node shell was stealing port 7860
40ab456

IMJONEZZ commited on

space: /api/status β€” expose Warden load state for ops
d49d2f3

IMJONEZZ commited on

space: merge Space creation commit (keep our tree; drop template app.py stub)
f12838c

IMJONEZZ commited on

space: adopt Space template frontmatter pins + LFS gitattributes
0365225

IMJONEZZ commited on

space: ZeroGPU port β€” Gradio SDK runtime, on-Space Warden inference
d94c85e

IMJONEZZ commited on

finetune: explicit LoRAMerge before HF export + GGUF conversion rails
d2fa034

IMJONEZZ commited on

initial commit
d391909

IMJONEZZ commited on

finetune: merge LoRA via streaming HF export (avoids 2x-model save peak)
7bd2c00

IMJONEZZ commited on

DEPLOY: switch plan to ZeroGPU (Gradio SDK, on-Space inference, no API key)
11143b6

IMJONEZZ commited on

SCRYPT: initial commit β€” game, sandbox, Warden, Space web layer
9fca766

IMJONEZZ commited on