Commit History

fix workflow since we use tag instead of releasE
326c217

winglian commited on

fix docker run
1590858

winglian commited on

fix docker tag
ea14ce0

winglian commited on

ctransformers inconsistencise
7cca7c3

winglian commited on

fix paths
c30045e

winglian commited on

build in /build root
4cb6306

winglian commited on

more fixes
ee33b00

winglian commited on

fix pipeline names
43f11da

winglian commited on

more fixes to actions
b5da0d9

winglian commited on

Fix ggml
e31a4eb

winglian commited on

move v in version to part of the arg
ca86014

winglian commited on

support for ctransformers
6025266

winglian commited on

upgrade to v0.1.53
a5688af

winglian commited on

update to latest 0.1.52 llama.cpp
2c81a6b

winglian commited on

remove revision from yml
a6bc28e

winglian commited on

various fixes post-release
dc8c932

winglian commited on

fix for ZWSP
3a335b7

winglian commited on

fix chat prompts for vicuna
dfe6be1

winglian commited on

use stable vicuna as the base for everyone to demo
24b11c7

winglian commited on

fix streaming on instruct
fd55e6d

winglian commited on

use new wizard mega ggml models
b03fb98

winglian commited on

fix stop tokens to match new prompt formatting, stream instruct response, add comments about concurrency to config
e0bf185

winglian commited on

remove chat and instruct, fix instruct prompt format
44eb762

winglian commited on

fix prompt for sharegpt chat
db54789

winglian commited on

fix layout, max size back to 1, llama.cpp doesn't like parallel calls
80c7d2e

winglian commited on

loading torch doesn't play nice, just use config for now
54d2e53

winglian commited on

try to fix combining gr.interface with blocks, try to increase concurrency on larger gpus
dce6894

winglian commited on

support tabbed interface and fix instruct to use GPU
ec779d5

winglian commited on

patience
58f7b09

winglian commited on

update copy
a68164e

winglian commited on

more fixes to config file based llama
776e6fd

winglian commited on

update instruct to use gpu, add verbose description
3143561

winglian commited on

default to instruct for now
2ade250

winglian commited on

cleanup instruct app. add queue so UI notifies users
ed16925

winglian commited on

single concurrency is probably safest
2b019dd

winglian commited on

base model in config is unnecessary, and fix grammar
946b2e7

winglian commited on

move byline around
d90dc27

winglian commited on

tweak the system prompt
496b4f8

winglian commited on

add prediction settings to ui
ce7dde7

winglian commited on

docker cp doesn't support wildcards
988bc04

winglian commited on

link model attributions, use config.yml for some of the chat settings, increase context size
1dc6c65

winglian commited on

rm docker implementation, add llama-cpp-python builder github actions, update copy to identify model in ui
e3ba05b

winglian commited on

fix chat history, update settings to use GPU
4cc03d2

winglian commited on

require torch and nvidia-cublas
0a981aa

winglian commited on

rollback to gradio
51c11ce

winglian commited on

try out docker sdk
6f7e42d

winglian commited on

Update requirements.txt
29b3b7f

winglian commited on

let's see what happens on a GPU
39f7680

winglian commited on

retry on failed hub fetch
de2057e

winglian commited on