Commit History

Documents serverless motivation and testing instructions
5da2aef

Luke Stanley commited on

Avoid unneeded imports, make serverless output more sensible, removing some debugging and comments
469f650

Luke Stanley commited on

Fix RUNPOD_ENDPOINT_ID environment variable
ce5ad5f

Luke Stanley commited on

Add more serverless GPU endpoint setup instruction detail
b51ce5c

Luke Stanley commited on

Document serverless setup
f2e80c9

Luke Stanley commited on

Rename serverless test file, set default model to Phi 2 for test, removed jq install, and env vars that are set the same in utils.py already, ignore .cache in git
83e4d57

Luke Stanley commited on

Introduces worker mode env var
56e785c

Luke Stanley commited on

Make GPU detection and llama-cpp-python re-installation conditional
434144a

Luke Stanley commited on

Initialise global variables in improvement_loop function
e30b729

Luke Stanley commited on

Ensure N_GPU_LAYERS is int
9475016

Luke Stanley commited on

Expose json typed LLM interface for RunPod
976ea17

Luke Stanley commited on

RunPod Mixtral JSON output test
233efeb

Luke Stanley commited on

Add hello world RunPod setup
feeb679

Luke Stanley commited on

Update default GPU layer, temperature values
e327a9e

lukestanley commited on

Add env vars to set GPU layer count and context size, make verbose
e01e28e

lukestanley commited on

Fix gif link since LFS related gif binary purge due to HF requirments
0945e5b

lukestanley commited on

Add n_gpu_layers parameter to Llama initialization
88e6118

lukestanley commited on

Fix: Move n_ctx parameter to model setup!
358cd20

lukestanley commited on

Fix check for LLM_MODEL_PATH to avoid load error
ff938c3

lukestanley commited on

Correct Space metadata
f5a3b9d

lukestanley commited on

Add HuggingFace space metadata
994c606

lukestanley commited on

Adds Gradio app wrapper and Dockerfile
c355718

Luke Stanley commited on

Auto-downloads model if env var is not set
74d6e52

Luke Stanley commited on

Make llm_stream_sans_network actually stream to stdout
a0f49a0

Luke Stanley commited on

Default to in-memory LLM interface
ddb0d91

Luke Stanley commited on

Updates README with some realist hope
5c4f1cd

lukestanley commited on

Bug fix for undefined last_edit
f065ef3

lukestanley commited on

Add comment about adaptability
6c32632

lukestanley commited on

Documents command line usage and module import functionality
3e32321

lukestanley commited on

Print new line after LLM output end and some linting
3ebb6e1

lukestanley commited on

Make reusable via CLI, and modue, moved core logic to improvement_loop.
fbb0bdf

lukestanley commited on

Makes URL more obvious, update comments, lowers temp
a96b492

lukestanley commited on

Update comments and remove unused code
68a2a07

lukestanley commited on

Make goal more clear
e2b2995

lukestanley commited on

Removes unused line and tidying
f932ac0

lukestanley commited on

Move prompt strings and types to own file, reorder code a bit
139217d

lukestanley commited on

Move prompts to own file
550885e

lukestanley commited on

Add more detailed setup notes with GPU, fork, and other pip dependencies
2831e2c

lukestanley commited on

Removed unused lines
f84c1a6

lukestanley commited on

Slight refactor tidying
327982a

lukestanley commited on

Minor changes, typo fixes
9412837

lukestanley commited on

Adds README
428143c

Luke Stanley commited on

Add script based on my old Gist https://gist.github.com/lukestanley/881d3c30c64362126352a9cecb069a3b
ec7a11c

Luke Stanley commited on

Initial commit
6d5b429

Luke Stanley commited on