prepare for exllamav3

Browse files

Files changed (12) hide show

README.md +14 -13
{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/INSTRUCTIONS.txt +0 -0
{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/auto-exl2-upload.zip +0 -0
{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/exl2-quant.py +0 -0
{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/linux-setup.sh +0 -0
{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/windows-setup.bat +0 -0
{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/INSTRUCTIONS.txt +0 -0
{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/exl2-multi-quant-local.zip +0 -0
{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/exl2-quant.py +0 -0
{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/linux-setup.sh +0 -0
{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/windows-setup.bat +0 -0
exllamav3 scripts/placeholder +0 -0

README.md CHANGED Viewed

@@ -11,24 +11,25 @@ Feel free to send in PRs or use this code however you'd like.\
 **For GitHub**: Would recommend creating pull requests and discussions on the [offical huggingface repo](https://huggingface.co/Anthonyg5005/hf-scripts)
-## existing files
-- [Auto EXL2 HF upload](https://huggingface.co/Anthonyg5005/hf-scripts/resolve/main/auto-exl2-upload/auto-exl2-upload.zip?download=true)
-- [EXL2 Local Quants](https://huggingface.co/Anthonyg5005/hf-scripts/resolve/main/exl2-multi-quant-local/exl2-multi-quant-local.zip?download=true)
-- [Upload folder to HF](https://huggingface.co/Anthonyg5005/hf-scripts/blob/main/upload%20folder%20to%20repo.py)
 - [Manage branches (create/delete)](https://huggingface.co/Anthonyg5005/hf-scripts/blob/main/manage%20branches.py)
 - [EXL2 Single Quant V3](https://colab.research.google.com/#fileId=https://huggingface.co/Anthonyg5005/hf-scripts/blob/main/ipynb/EXL2_Private_Quant_V3.ipynb) **(COLAB)**
-## work in progress/not tested (ordered by priority)
-- Easy exl2 quants
-  - Add custom safetensors shard size.
-  - Allow using finegrained tokens to login scripts
 ## other recommended stuff
 - [Exllama Discord server](https://discord.gg/NSFwVuCjRq)
@@ -44,14 +45,14 @@ Feel free to send in PRs or use this code however you'd like.\
 - EXL2 Local Quants
   - Easily creates environment to quantize models to exl2 to your local machine. Supports both Windows and Linux.
 - Upload folder to repo
-  - Uploads user specified folder to specified repo, can create private repos too. Not the same as git commit and push, instead uploads any additional files. This is more of a practice for me than for actual usage.
 - Manage branches
   - Run script and follow prompts. You will be required to be logged in to HF Hub. If you are not logged in, you will need a WRITE token. You can get one in your [HuggingFace settings](https://huggingface.co/settings/tokens). Colab and Kaggle secret keys are supported.
-- EXL2 Single Quant
-  - Allows you to quantize to exl2 using colab. This version creates a exl2 quant to upload to private repo. Only 7B tested on colab.
 - Download models (oobabooga)
   - To use the script, open a terminal and run '`python download-model.py USER/MODEL:BRANCH`'. There's also a '`--help`' flag to show the available arguments. To download from private repositories, make sure to login using '`huggingface-cli login`' or (not recommended) `HF_TOKEN` environment variable.

 **For GitHub**: Would recommend creating pull requests and discussions on the [offical huggingface repo](https://huggingface.co/Anthonyg5005/hf-scripts)
+## main files
+- [Auto EXL2 HF upload](https://huggingface.co/Anthonyg5005/hf-scripts/resolve/main/exllamav2%20scripts/auto-exl2-upload/auto-exl2-upload.zip?download=true)
+- [EXL2 Local Quants](https://huggingface.co/Anthonyg5005/hf-scripts/resolve/main/exllamav2%20scripts/exl2-multi-quant-local/exl2-multi-quant-local.zip?download=true)
 - [Manage branches (create/delete)](https://huggingface.co/Anthonyg5005/hf-scripts/blob/main/manage%20branches.py)
+## outdated or not main focus
 - [EXL2 Single Quant V3](https://colab.research.google.com/#fileId=https://huggingface.co/Anthonyg5005/hf-scripts/blob/main/ipynb/EXL2_Private_Quant_V3.ipynb) **(COLAB)**
+- [Upload folder to HF](https://huggingface.co/Anthonyg5005/hf-scripts/blob/main/upload%20folder%20to%20repo.py)
+<!--
+## work in progress/not tested (ordered by priority)
+none for now. perhaps adding finegrained token support to my hf login code
+-->
 ## other recommended stuff
 - [Exllama Discord server](https://discord.gg/NSFwVuCjRq)
 - EXL2 Local Quants
   - Easily creates environment to quantize models to exl2 to your local machine. Supports both Windows and Linux.
+  - EXL2 Single Quant
+  - Allows you to quantize to exl2 using colab. This version creates a exl2 quant to upload to private repo. Only 7B tested on colab.
 - Upload folder to repo
+  - Uploads user specified folder to specified repo, can create private repos too. Not the same as git commit and push, instead uploads any additional files. This is more of a practice for me than for actual usage as most of the time it crashes on the quantizing process due to lack of ram.
 - Manage branches
   - Run script and follow prompts. You will be required to be logged in to HF Hub. If you are not logged in, you will need a WRITE token. You can get one in your [HuggingFace settings](https://huggingface.co/settings/tokens). Colab and Kaggle secret keys are supported.
 - Download models (oobabooga)
   - To use the script, open a terminal and run '`python download-model.py USER/MODEL:BRANCH`'. There's also a '`--help`' flag to show the available arguments. To download from private repositories, make sure to login using '`huggingface-cli login`' or (not recommended) `HF_TOKEN` environment variable.

{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/INSTRUCTIONS.txt RENAMED Viewed

File without changes

{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/auto-exl2-upload.zip RENAMED Viewed

File without changes

{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/exl2-quant.py RENAMED Viewed

File without changes

{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/linux-setup.sh RENAMED Viewed

File without changes

{auto-exl2-upload → exllamav2 scripts/auto-exl2-upload}/windows-setup.bat RENAMED Viewed

File without changes

{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/INSTRUCTIONS.txt RENAMED Viewed

File without changes

{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/exl2-multi-quant-local.zip RENAMED Viewed

File without changes

{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/exl2-quant.py RENAMED Viewed

File without changes

{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/linux-setup.sh RENAMED Viewed

File without changes

{exl2-multi-quant-local → exllamav2 scripts/exl2-multi-quant-local}/windows-setup.bat RENAMED Viewed

File without changes

exllamav3 scripts/placeholder ADDED Viewed

File without changes