loading from csv instead of processing data each time
28e8799
Corey Morriscommited on
updated
667f9a4
Corey Morriscommited on
WIP. Loading data from csv
1a1910c
Corey Morriscommited on
Changed error logging from print statements to logger. It is not currently working to save to a file locally
d96fdf9
Corey Morriscommited on
Catching exceptions in processing files. As new data is introduced, I want to know which files may have different formats and cause problems, but the application shouldn't halt if it can't process a single file
68bce52
Corey Morriscommited on
Added new results from hugging face evaluations
ad0b971
Corey Morriscommited on
added code to split moral scenario question from one question to two
65d6581
Corey Morriscommited on
updated gitignore
76c8220
Corey Morriscommited on
updated dev requirements
5ca617c
Corey Morriscommited on
Extracted plotting functions from moral_app to plotting_utils to improve organization and testability
2b55a03
Corey Morriscommited on
copied main streamlit application to one that will specifically investigate moral reasoning
298ba1f
Corey Morriscommited on
updated date and model count
0c07f8b
Corey Morriscommited on
updated results
f1eba6e
Corey Morriscommited on
Added new hugging face results
3f507e0
Corey Morriscommited on
added a test and removed the code to only test a specific file because that code did not work
6ed8672
Corey Morriscommited on
updated to run submodule update
25d217c
Corey Morriscommited on
Update pytest run to only run specific test files. Other test files are not ready to be run on a different system yet
9345a86
Corey Morriscommited on
Merge branch 'main' of https://github.com/c1505/LLM-Dashboard into main
0e575e0
Corey Morriscommited on
Added additional results
7863417
Corey Morriscommited on
Updated to reflect number of models. Previously, I think there were duplicates
d396c1e
Corey Morriscommited on
Create python-app.yml
063ba51
unverified
Coreycommited on
Updated dependencies
73da8d6
Corey Morriscommited on
Show a random question from the moral scenarios evaluation