--- language: - en tags: - ggml - causal-lm - gpt2 license: mit --- ``` ▄▄▄ ██▓ ▓█████▄ █ ██ ███▄ █ ▄████ ▓█████ ▒█████ ███▄ █ ▒████▄ ▓██▒ ▒██▀ ██▌ ██ ▓██▒ ██ ▀█ █ ██▒ ▀█▒▓█ ▀ ▒██▒ ██▒ ██ ▀█ █ ▒██ ▀█▄ ▒██▒ ░██ █▌▓██ ▒██░▓██ ▀█ ██▒▒██░▄▄▄░▒███ ▒██░ ██▒▓██ ▀█ ██▒ ░██▄▄▄▄██ ░██░ ░▓█▄ ▌▓▓█ ░██░▓██▒ ▐▌██▒░▓█ ██▓▒▓█ ▄ ▒██ ██░▓██▒ ▐▌██▒ ▓█ ▓██▒░██░ ░▒████▓ ▒▒█████▓ ▒██░ ▓██░░▒▓███▀▒░▒████▒░ ████▓▒░▒██░ ▓██░ ▒▒ ▓▒█░░▓ ▒▒▓ ▒ ░▒▓▒ ▒ ▒ ░ ▒░ ▒ ▒ ░▒ ▒ ░░ ▒░ ░░ ▒░▒░▒░ ░ ▒░ ▒ ▒ ▒ ▒▒ ░ ▒ ░ ░ ▒ ▒ ░░▒░ ░ ░ ░ ░░ ░ ▒░ ░ ░ ░ ░ ░ ░ ▒ ▒░ ░ ░░ ░ ▒░ ░ ▒ ▒ ░ ░ ░ ░ ░░░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ▒ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ░ ``` ### This repository contains quantized conversions of the AI Dungeon 2 checkpoint, "model_v5". *For use with frontends that support GGML quantized GPT-2 models.* *Last updated on 2023-09-23.* Model | RAM usage (KoboldCpp) | RAM usage (Oobabooga) :--:|:--:|:--: aid2classic-ggml-q4_0.bin | 984.1 MiB | 1.4 GiB aid2classic-ggml-q4_1.bin | 1.1 GiB | 1.5 GiB aid2classic-ggml-q5_0.bin | 1.2 GiB | 1.6 GiB aid2classic-ggml-q5_1.bin | 1.2 GiB | 1.7 GiB aid2classic-ggml-q8_0.bin | 1.7 GiB | 2.2 GiB aid2classic-ggml-f16.bin | 3.2 GiB | 3.6 GiB **Description:** 2019 AI Dungeon users may recognize the model quantized was used by [the open-source AI Dungeon 2 project](https://github.com/Latitude-Archives/AIDungeon) and its various forks. This was before it moved to its own website and rebranded to "AI Dungeon". 2020-2021 AI Dungeon users may recognize this model as "Classic". If you want a better model trained on the same dataset, see [Spring Dragon 13B](https://huggingface.co/TheBloke/Spring-Dragon-GGUF), intended to replicate 2020 AI Dungeon's "Dragon" experience on local hardware. **Notes:** - KoboldCpp [[bfc696f]](https://github.com/LostRuins/koboldcpp/tree/bfc696fcc452975dbe8967c39301ba856d04a030) was tested without OpenBLAS. - Oobabooga [[895ec9d]](https://github.com/oobabooga/text-generation-webui/tree/895ec9dadb96120e8202a83052bf9032ca3245ae) was tested with with the `--model --loader ctransformers --model_type gpt2` launch arguments. - ggerganov/ggml [[8ca2c19]](https://github.com/ggerganov/ggml/tree/8ca2c19a3bb8622954d858fbf6383522684eaf34)'s gpt-2 conversion script was used for conversion and quantization. - The original model was found in the `generator/gpt2/models/model_v5` directory of [AI Dungeon 2 Unleashed](https://henk.tech/aid/).