Updated the architecture descriptions, images and caption text for the display of the architectures cc46ec6 alfraser commited on Feb 6
Added trendline option in the scatterplot and associated update to the project requirements file for deployent on HF spaces 943d243 alfraser commited on Feb 6
Updated from using random.choices to random.sample throughout where I need a random distinct set as choices does replacement so you can get the same item twice. Discovered in pricing testing. b897a48 alfraser commited on Feb 5
Added seoarate key for the question count as getting weird results in the counts f3f6cf6 alfraser commited on Feb 5
Added ability to set the number of testing threads dynamically from the UI fc8884e alfraser commited on Feb 5
Modified test runner to dispatch requests in parallel to make use of the fact that there is a lot of wait time for the LLM. Defaulting to 16 threads. bb7db2c alfraser commited on Feb 1
Added runner for pricing fact checks to assess the level of fact embedding in the latest model c319c31 alfraser commited on Feb 1
Saved test records and refactored reporter UI code into smaller functions a9d1d49 alfraser commited on Jan 31
Added a push button to generate a random question to the UI, so users don't have to phrase something themselves. 7c479ac alfraser commited on Jan 26
Added the option to pause a failed endpoint in order to be able to kick it with a restart 5ecd875 alfraser commited on Jan 26
Added ability to select which models to compare side by side, allowing for more flexibility in testing my fine-tuned llamas 57b94ca alfraser commited on Jan 25
Added a test runner page which allows you to run a batch of test from the UI ab87be2 alfraser commited on Jan 24
Fixed issue in resolving sleep function caused by import time.time elsewhere f443824 alfraser commited on Jan 23
Removed decimals from group tag and moved where it is called in the side by side flow so they actually share the same ID 2f008c2 alfraser commited on Jan 23
Changed function name as getting a conflict on the server which I am not getting locally 59df961 alfraser commited on Jan 23
Added the display of the question typed into the side by side setup. d0a9930 alfraser commited on Jan 20
Removed debug print statements which cause extra chaff in the logs 1c63a8a alfraser commited on Jan 20
Updated to refresh every 10 seconds when an endpoint is starting up 38dd285 alfraser commited on Jan 19
Added a page to see and control (pause/resume) the model inference endpoints dfe6290 alfraser commited on Jan 19
Removed now unused code which calls huggin face via the free HTTP interface - moved everything to endpoints to be equivalent. 227dc25 alfraser commited on Jan 19
Fixed a display issue in the markdown with dollar signs in prices aff284c alfraser commited on Jan 16
Added option to include a diagram of the architecture under test on the architectures page. 2db4636 alfraser commited on Jan 15
Added structure for the final dissertation work product (documents and video). e30a304 alfraser commited on Dec 8, 2023
Integrated the architecture page structure and code from the old codebase and integrated for hugging face deployment cc9a95f alfraser commited on Nov 29, 2023
Tidied up the models page and added another model just to get multi-model view 19f408b alfraser commited on Nov 28, 2023
Set up configuration for models on HF and an associated page on the application to allow end use test chat. 54b3256 alfraser commited on Nov 28, 2023
Migrated the databrowser code into the repo for hugging face. 53dc0ac alfraser commited on Nov 28, 2023
Set up security with secrets on the hugging face side and helper function across pages which manages the security and logo. d934e05 alfraser commited on Nov 23, 2023