Transformers
llama
TheBloke commited on
Commit
6d9e7de
·
1 Parent(s): fc99f64

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -32,7 +32,7 @@ quantized_by: TheBloke
32
 
33
  This repo contains GGML format model files for [Jon Durbin's Airoboros L2 70B GPT4 m2.0](https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-m2.0).
34
 
35
- GPU acceleration is now available for Llama 2 70B GGML files, with both CUDA (NVidia) and Metal (macOS). The following clients/libraries are known to work with these files, including with CUDA GPU acceleration:
36
  * [llama.cpp](https://github.com/ggerganov/llama.cpp), commit `e76d630` and later.
37
  * [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI.
38
  * [KoboldCpp](https://github.com/LostRuins/koboldcpp), version 1.37 and later. A powerful GGML web UI, especially good for story telling.
 
32
 
33
  This repo contains GGML format model files for [Jon Durbin's Airoboros L2 70B GPT4 m2.0](https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-m2.0).
34
 
35
+ GPU acceleration is now available for Llama 2 70B GGML files, with both CUDA (NVidia) and Metal (macOS). The following clients/libraries are known to work with these files, including with GPU acceleration:
36
  * [llama.cpp](https://github.com/ggerganov/llama.cpp), commit `e76d630` and later.
37
  * [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI.
38
  * [KoboldCpp](https://github.com/LostRuins/koboldcpp), version 1.37 and later. A powerful GGML web UI, especially good for story telling.