Spaces:

build-small-hackathon
/

Scrypt

Running on Zero

App Files Files Community

Commit History

Update README.md

c52514a

Running
verified

IMJONEZZ commited on 6 days ago

Update README.md

641b11a
verified

IMJONEZZ commited on 6 days ago

Update README.md

c001a1d
verified

IMJONEZZ commited on 6 days ago

Update README.md

64b0c0d
verified

IMJONEZZ commited on 6 days ago

Update LICENSE

909fa57
verified

IMJONEZZ commited on 8 days ago

Create LICENSE

d4f4dd2
verified

IMJONEZZ commited on 8 days ago

menu: loopback api backend reads 'running on this machine', not generic 'api' (the hosted Space serves the real Warden over localhost)

8d2b52f

IMJONEZZ commited on 8 days ago

space: ssr_mode=False on launch — gradio 6 SSR's Node proxy doesn't forward the raw /pty websocket

95ab054

IMJONEZZ commited on 8 days ago

space: transformers 5 apply_chat_template returns BatchEncoding — use return_dict + **enc into generate (fixes AttributeError on .shape)

aac926a

IMJONEZZ commited on 8 days ago

space: load Nemotron the normal way — transformers-native (no trust_remote_code), NO mamba_ssm/causal_conv1d. Those custom Triton CUDA kernels were the segfault (THCPModule_initExtension); native falls back to pure-torch Mamba on ZeroGPU.

0c2e095

IMJONEZZ commited on 8 days ago

space: adopt the org's proven NPCverse structure — gradio 6 Server + @app .api + app.launch() (installs ZeroGPU hooks), transformers 5 (compatible with gradio 6; trust_remote_code uses our repo's modeling). Replaces the custom engine.launch+route-surgery that broke the hooks and segfaulted.

9203831

IMJONEZZ commited on 8 days ago

space: drive @spaces.GPU through Gradio's API (gr.api + gradio_client), not run_in_threadpool — the threadpool call inits CUDA off-thread and segfaults. Matches how the org's NPCverse/the-deal spaces invoke GPU work.

13015f6

IMJONEZZ commited on 8 days ago

space: pin python_version 3.12 — ZeroGPU defaulted to 3.10.13, breaking the cp312 mamba wheels

505cbc6

IMJONEZZ commited on 8 days ago

space: finetuned Warden on ZeroGPU the documented way — bf16 + .to('cuda') module-level + @spaces.GPU(xlarge), no bitsandbytes/device_map (the actual fix). Direct run_in_threadpool call verified by the probe.

321303b

IMJONEZZ commited on 8 days ago

space: ZeroGPU diagnostic — measure CPU RAM/disk/VRAM + confirm a @spaces.GPU call works, before loading the model the documented .to('cuda') way

10c83ac

IMJONEZZ commited on 8 days ago

space: move /static mount ahead of gradio catch-all (styling regression fix)

ee38482

IMJONEZZ commited on 8 days ago

space: add /api/probe to verify live Warden generation end-to-end

6152ad5

IMJONEZZ commited on 9 days ago

space: revert to Gradio SDK + CPU llama-cpp-python (keeps the prize; ZeroGPU was the problem, not the SDK)

e577af2

IMJONEZZ commited on 9 days ago

space: load model lazily inside the GPU worker — module-level device_map=cuda + bnb poisoned the ZeroGPU fork's CUDA context

c1a8f99

IMJONEZZ commited on 9 days ago

space: route GPU calls through Gradio (gr.api + gradio_client) so the ZeroGPU per-request CUDA hooks fire

4468bdc

IMJONEZZ commited on 9 days ago

space: duration=120 for cold start + /api/status fast-path (causal_conv1d) probe

3af751e

IMJONEZZ commited on 9 days ago

space: blocking GPU generate instead of threaded streamer (hung across ZeroGPU fork); 503 on failure so the game falls back cleanly

255e227

IMJONEZZ commited on 9 days ago

play: reserve a bottom row + taller frame so the board prompt isn't clipped

a0de8fb

IMJONEZZ commited on 9 days ago

play: autosize the terminal so the full board always fits (cards were clipping)

6f42620

IMJONEZZ commited on 9 days ago

space: gradio 5.49 — transformers<5 needs hub<1.0, which gradio 6 forbids

1330ecb

IMJONEZZ commited on 9 days ago

space: serve via Blocks.launch (ZeroGPU handshake) + pin transformers<5 for the bnb4 checkpoint format

8051e61

IMJONEZZ commited on 9 days ago

space: torch 2.10 — mamba wheels demand triton>=3.5, impossible under torch 2.8

f68cd1e

IMJONEZZ commited on 9 days ago

merge Space model-dir removal

69f9134

IMJONEZZ commited on 9 days ago

space: load the released nf4 Warden from the hub (1GB Space LFS cap rules out in-repo weights)

caef9bc

IMJONEZZ commited on 9 days ago

Delete files model/* with huggingface_hub

0e95292
verified

IMJONEZZ commited on 9 days ago

merge Space model upload commit

e2bb0c6

IMJONEZZ commited on 9 days ago

gitignore: allow the shipped Space model shards

29e3001

IMJONEZZ commited on 9 days ago

space: load the Warden shipped in the repo (no boot download)

a6fd68b

IMJONEZZ commited on 9 days ago

Upload folder using huggingface_hub

2635f57
verified

IMJONEZZ commited on 9 days ago

space: surface mamba install diagnostics in /api/status; bnb4 prequant script

b5186d6

IMJONEZZ commited on 9 days ago

space: WebGL renderer + customGlyphs — card art was warping in the browser

34b513d

IMJONEZZ commited on 9 days ago

game: totem ritual now downloads the finetuned Warden GGUF

713805c

IMJONEZZ commited on 9 days ago

space: pin torch 2.8 + prebuilt mamba-ssm/causal-conv1d wheels

0abba5d

IMJONEZZ commited on 9 days ago

space: bootstrap mamba-ssm/causal-conv1d at runtime for Nemotron-H

52d29cc

IMJONEZZ commited on 9 days ago

space: disable gradio SSR on mount — the Node shell was stealing port 7860

40ab456

IMJONEZZ commited on 9 days ago

space: /api/status — expose Warden load state for ops

d49d2f3

IMJONEZZ commited on 9 days ago

space: merge Space creation commit (keep our tree; drop template app.py stub)

f12838c

IMJONEZZ commited on 9 days ago

space: adopt Space template frontmatter pins + LFS gitattributes

0365225

IMJONEZZ commited on 9 days ago

space: ZeroGPU port — Gradio SDK runtime, on-Space Warden inference

d94c85e

IMJONEZZ commited on 9 days ago

finetune: explicit LoRAMerge before HF export + GGUF conversion rails

d2fa034

IMJONEZZ commited on 9 days ago

initial commit

d391909

IMJONEZZ commited on 9 days ago

finetune: merge LoRA via streaming HF export (avoids 2x-model save peak)

7bd2c00

IMJONEZZ commited on 9 days ago

DEPLOY: switch plan to ZeroGPU (Gradio SDK, on-Space inference, no API key)

11143b6

IMJONEZZ commited on 9 days ago

SCRYPT: initial commit — game, sandbox, Warden, Space web layer

9fca766

IMJONEZZ commited on 9 days ago