Commits · lukestanley/ChillTranslator

Documents serverless motivation and testing instructions

5da2aef

Luke Stanley commited on Feb 28

Avoid unneeded imports, make serverless output more sensible, removing some debugging and comments

469f650

Luke Stanley commited on Feb 28

Fix RUNPOD_ENDPOINT_ID environment variable

ce5ad5f

Luke Stanley commited on Feb 28

Add more serverless GPU endpoint setup instruction detail

b51ce5c

Luke Stanley commited on Feb 28

Document serverless setup

f2e80c9

Luke Stanley commited on Feb 28

Rename serverless test file, set default model to Phi 2 for test, removed jq install, and env vars that are set the same in utils.py already, ignore .cache in git

83e4d57

Luke Stanley commited on Feb 28

Introduces worker mode env var

56e785c

Luke Stanley commited on Feb 28

Make GPU detection and llama-cpp-python re-installation conditional

434144a

Luke Stanley commited on Feb 28

Initialise global variables in improvement_loop function

e30b729

Luke Stanley commited on Feb 28

Ensure N_GPU_LAYERS is int

9475016

Luke Stanley commited on Feb 27

Expose json typed LLM interface for RunPod

976ea17

Luke Stanley commited on Feb 27

RunPod Mixtral JSON output test

233efeb

Luke Stanley commited on Feb 27

Add hello world RunPod setup

feeb679

Luke Stanley commited on Feb 26

Update default GPU layer, temperature values

e327a9e

lukestanley commited on Feb 26

Add env vars to set GPU layer count and context size, make verbose

e01e28e

lukestanley commited on Feb 26

Fix gif link since LFS related gif binary purge due to HF requirments

0945e5b

lukestanley commited on Feb 25

Add n_gpu_layers parameter to Llama initialization

88e6118

lukestanley commited on Feb 25

Fix: Move n_ctx parameter to model setup!

358cd20

lukestanley commited on Feb 25

Fix check for LLM_MODEL_PATH to avoid load error

ff938c3

lukestanley commited on Feb 25

Correct Space metadata

f5a3b9d

lukestanley commited on Feb 25

Add HuggingFace space metadata

994c606

lukestanley commited on Feb 25

Adds Gradio app wrapper and Dockerfile

c355718

Luke Stanley commited on Feb 25

Auto-downloads model if env var is not set

74d6e52

Luke Stanley commited on Feb 25

Make llm_stream_sans_network actually stream to stdout

a0f49a0

Luke Stanley commited on Feb 25

Default to in-memory LLM interface

ddb0d91

Luke Stanley commited on Feb 25

Updates README with some realist hope

5c4f1cd

lukestanley commited on Feb 24

Bug fix for undefined last_edit

f065ef3

lukestanley commited on Feb 24

Add comment about adaptability

6c32632

lukestanley commited on Feb 24

Documents command line usage and module import functionality

3e32321

lukestanley commited on Feb 24

Print new line after LLM output end and some linting

3ebb6e1

lukestanley commited on Feb 24

Make reusable via CLI, and modue, moved core logic to improvement_loop.

fbb0bdf

lukestanley commited on Feb 24

Makes URL more obvious, update comments, lowers temp

a96b492

lukestanley commited on Feb 23

Update comments and remove unused code

68a2a07

lukestanley commited on Feb 23

Make goal more clear

e2b2995

lukestanley commited on Feb 23

Removes unused line and tidying

f932ac0

lukestanley commited on Feb 23

Move prompt strings and types to own file, reorder code a bit

139217d

lukestanley commited on Feb 23

Move prompts to own file

550885e

lukestanley commited on Feb 23

Linting with Black

e4b918c

lukestanley commited on Feb 23

Removes unused code

811e485

lukestanley commited on Feb 23

Emoji refactor

d641f9a

lukestanley commited on Feb 23

Add more detailed setup notes with GPU, fork, and other pip dependencies

2831e2c

lukestanley commited on Feb 23

Adds demo gif

a97efcc

lukestanley commited on Feb 23

Add clone example

ce18752

lukestanley commited on Feb 23

Removed unused lines

f84c1a6

lukestanley commited on Feb 23

Slight refactor tidying

327982a

lukestanley commited on Feb 23

Emoji changes

0519e07

lukestanley commited on Feb 23

Minor changes, typo fixes

9412837

lukestanley commited on Feb 23

Adds README

428143c

Luke Stanley commited on Feb 23

Add script based on my old Gist https://gist.github.com/lukestanley/881d3c30c64362126352a9cecb069a3b

ec7a11c

Luke Stanley commited on Feb 23

Initial commit

6d5b429

Luke Stanley commited on Feb 23

Commit History

Documents serverless motivation and testing instructions 5da2aef

Avoid unneeded imports, make serverless output more sensible, removing some debugging and comments 469f650

Fix RUNPOD_ENDPOINT_ID environment variable ce5ad5f

Add more serverless GPU endpoint setup instruction detail b51ce5c

Document serverless setup f2e80c9

Rename serverless test file, set default model to Phi 2 for test, removed jq install, and env vars that are set the same in utils.py already, ignore .cache in git 83e4d57

Introduces worker mode env var 56e785c

Make GPU detection and llama-cpp-python re-installation conditional 434144a

Initialise global variables in improvement_loop function e30b729

Ensure N_GPU_LAYERS is int 9475016

Expose json typed LLM interface for RunPod 976ea17

RunPod Mixtral JSON output test 233efeb

Add hello world RunPod setup feeb679

Update default GPU layer, temperature values e327a9e

Add env vars to set GPU layer count and context size, make verbose e01e28e

Fix gif link since LFS related gif binary purge due to HF requirments 0945e5b

Add n_gpu_layers parameter to Llama initialization 88e6118

Fix: Move n_ctx parameter to model setup! 358cd20

Fix check for LLM_MODEL_PATH to avoid load error ff938c3

Correct Space metadata f5a3b9d

Add HuggingFace space metadata 994c606

Adds Gradio app wrapper and Dockerfile c355718

Auto-downloads model if env var is not set 74d6e52

Make llm_stream_sans_network actually stream to stdout a0f49a0

Default to in-memory LLM interface ddb0d91

Updates README with some realist hope 5c4f1cd

Bug fix for undefined last_edit f065ef3

Add comment about adaptability 6c32632

Documents command line usage and module import functionality 3e32321

Print new line after LLM output end and some linting 3ebb6e1

Make reusable via CLI, and modue, moved core logic to improvement_loop. fbb0bdf

Makes URL more obvious, update comments, lowers temp a96b492

Update comments and remove unused code 68a2a07

Make goal more clear e2b2995

Removes unused line and tidying f932ac0

Move prompt strings and types to own file, reorder code a bit 139217d

Move prompts to own file 550885e

Linting with Black e4b918c

Removes unused code 811e485

Emoji refactor d641f9a

Add more detailed setup notes with GPU, fork, and other pip dependencies 2831e2c

Adds demo gif a97efcc

Add clone example ce18752

Removed unused lines f84c1a6

Slight refactor tidying 327982a

Emoji changes 0519e07

Minor changes, typo fixes 9412837

Adds README 428143c

Add script based on my old Gist https://gist.github.com/lukestanley/881d3c30c64362126352a9cecb069a3b ec7a11c

Initial commit 6d5b429

Documents serverless motivation and testing instructions

5da2aef

Avoid unneeded imports, make serverless output more sensible, removing some debugging and comments

469f650

Fix RUNPOD_ENDPOINT_ID environment variable

ce5ad5f

Add more serverless GPU endpoint setup instruction detail

b51ce5c

Document serverless setup

f2e80c9

Rename serverless test file, set default model to Phi 2 for test, removed jq install, and env vars that are set the same in utils.py already, ignore .cache in git

83e4d57

Introduces worker mode env var

56e785c

Make GPU detection and llama-cpp-python re-installation conditional

434144a

Initialise global variables in improvement_loop function

e30b729

Ensure N_GPU_LAYERS is int

9475016

Expose json typed LLM interface for RunPod

976ea17

RunPod Mixtral JSON output test

233efeb

Add hello world RunPod setup

feeb679

Update default GPU layer, temperature values

e327a9e

Add env vars to set GPU layer count and context size, make verbose

e01e28e

Fix gif link since LFS related gif binary purge due to HF requirments

0945e5b

Add n_gpu_layers parameter to Llama initialization

88e6118

Fix: Move n_ctx parameter to model setup!

358cd20

Fix check for LLM_MODEL_PATH to avoid load error

ff938c3

Correct Space metadata

f5a3b9d

Add HuggingFace space metadata

994c606

Adds Gradio app wrapper and Dockerfile

c355718

Auto-downloads model if env var is not set

74d6e52

Make llm_stream_sans_network actually stream to stdout

a0f49a0

Default to in-memory LLM interface

ddb0d91

Updates README with some realist hope

5c4f1cd

Bug fix for undefined last_edit

f065ef3

Add comment about adaptability

6c32632

Documents command line usage and module import functionality

3e32321

Print new line after LLM output end and some linting

3ebb6e1

Make reusable via CLI, and modue, moved core logic to improvement_loop.

fbb0bdf

Makes URL more obvious, update comments, lowers temp

a96b492

Update comments and remove unused code

68a2a07

Make goal more clear

e2b2995

Removes unused line and tidying

f932ac0

Move prompt strings and types to own file, reorder code a bit

139217d

Move prompts to own file

550885e

Linting with Black

e4b918c

Removes unused code

811e485

Emoji refactor

d641f9a

Add more detailed setup notes with GPU, fork, and other pip dependencies

2831e2c

Adds demo gif

a97efcc

Add clone example

ce18752

Removed unused lines

f84c1a6

Slight refactor tidying

327982a

Emoji changes

0519e07

Minor changes, typo fixes

9412837

Adds README

428143c

Add script based on my old Gist https://gist.github.com/lukestanley/881d3c30c64362126352a9cecb069a3b

ec7a11c

Initial commit

6d5b429