Nice results

#1
by darraghd - opened

Very nice work, after some initial teething problems, like here, I have this running on A100. The results on my prompts are showing to be slightly better than the google/tf-flan-xxl with full precision. Maybe this is be down to chance but there was an improvement on a number of prompts.
In any case, the memory usage is way down, 40GB for full precision and a short prompt, down to around 18GB for your 8-bit version. This is great, it allows me to experiment with much larger prompts...
Thanks a lot for releasing this !!

darraghd changed discussion status to closed

Sign up or log in to comment