LennardZuendorf commited on
Commit
3f2a988
1 Parent(s): ad3c909

chore: updating md files and credits

Browse files
README.md CHANGED
@@ -9,34 +9,43 @@ app_file: main.py
9
  pinned: true
10
  license: mit
11
  app_port: 8080
 
 
 
12
  ---
13
 
14
- # Bachelor Thesis
15
- ## Webapp
16
- #### This is the ui showcase for my thesis about interpretability in a LLM based chatbot application and applications of XAI.
17
 
18
- ## 🔗 Links:
 
 
 
19
 
20
  **[Github Repository](https://github.com/LennardZuendorf/thesis-webapp)**
21
- **[Huggingface Spaces Showcase](https://huggingface.co/spaces/lennardzuendorf/thesis-webapp-docker**
22
 
23
- ## 🏗️ Tech Stack:
 
 
24
 
25
  **Language and Framework:** Python
26
 
27
  **Noteable Packages:** 🤗 Transformers, FastAPI, Gradio, SHAP, BERTViz
28
 
29
- ## 👨‍💻 Author and Credits:</h2>
30
 
31
- **Author:** [@LennardZuendorf](https://github.com/LennardZuendorf)
32
 
33
- **Thesis Supervisor**: [Prof. Dr. Simbeck](https://www.htw-berlin.de/hochschule/personen/person/?eid=9862)
34
  <br> Second Corrector: [Prof. Dr. Hochstein](https://www.htw-berlin.de/hochschule/personen/person/?eid=10628)
35
 
36
- See code for in detailed credits, work is based on
 
 
 
 
37
 
38
- - GODEL:
39
- - SHAP:
40
- - BERTViz:
41
 
42
  This Project was part of my studies of Business Computing at University of Applied Science for Technology and Business Berlin (HTW Berlin).
 
9
  pinned: true
10
  license: mit
11
  app_port: 8080
12
+ models: microsoft/GODEL-v1_1-large-seq2seq
13
+ tags: ["CHAT", "XAI", "SHAP", "GODEL", "Gradio"]
14
+ disable_embedding: true
15
  ---
16
 
17
+ # Bachelor Thesis Webapp
 
 
18
 
19
+ ## 📜 Info:
20
+ This is the ui showcase for my thesis about interpretability in a LLM based chatbot application and applications of XAI.
21
+
22
+ ### 🔗 Links:
23
 
24
  **[Github Repository](https://github.com/LennardZuendorf/thesis-webapp)**
 
25
 
26
+ **[Huggingface Spaces Showcase](https://huggingface.co/spaces/lennardzuendorf/thesis-webapp-docker)**
27
+
28
+ ### 🏗️ Tech Stack:
29
 
30
  **Language and Framework:** Python
31
 
32
  **Noteable Packages:** 🤗 Transformers, FastAPI, Gradio, SHAP, BERTViz
33
 
34
+ ## 👨‍💻 Author and Credits:
35
 
36
+ **Author: [@LennardZuendorf](https://github.com/LennardZuendorf)**
37
 
38
+ **Thesis Supervisor: [Prof. Dr. Simbeck](https://www.htw-berlin.de/hochschule/personen/person/?eid=9862)**
39
  <br> Second Corrector: [Prof. Dr. Hochstein](https://www.htw-berlin.de/hochschule/personen/person/?eid=10628)
40
 
41
+ See code for in detailed credits, work is strongly based on:
42
+ #### GODEL
43
+ - [HGF Model Page](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq?text=Hey+my+name+is+Mariama%21+How+are+you%3F)
44
+ - [Paper on HGF](https://huggingface.co/papers/2206.11309)
45
+ - [Paper Print](https://arxiv.org/abs/2206.11309)
46
 
47
+ #### SHAP
48
+ - [Github](https://github.com/shap/shap)
49
+ - [Inital Paper](https://arxiv.org/abs/1705.07874)
50
 
51
  This Project was part of my studies of Business Computing at University of Applied Science for Technology and Business Berlin (HTW Berlin).
main.py CHANGED
@@ -60,7 +60,7 @@ with gr.Blocks(
60
  # markdown component to display the header
61
  gr.Markdown("""
62
  # Thesis Demo - AI Chat Application with GODEL
63
- ## Interpretability powered by SHAP and BERTVIZ
64
  ### Select between tabs below for the different views.
65
  """)
66
  # ChatBot tab used to chat with the AI chatbot
 
60
  # markdown component to display the header
61
  gr.Markdown("""
62
  # Thesis Demo - AI Chat Application with GODEL
63
+ Interpretability powered by shap and attention visualization,
64
  ### Select between tabs below for the different views.
65
  """)
66
  # ChatBot tab used to chat with the AI chatbot
public/about.md CHANGED
@@ -8,14 +8,13 @@ This research tackles the rise of LLM based applications such a chatbots and exp
8
 
9
  - [GitHub Repository](https://github.com/LennardZuendorf/thesis-webapp) - The GitHub repository of this project.
10
  - [HTW Berlin](https://www.htw-berlin.de/) - The University I have built this project for, as part of my thesis.
11
- - [Thesis Print]() - Link to the thesis pdf (in English), containing more information about the project. And a full list of sources for this work as well as additional evaluations and fundamental information for the project.
12
 
13
 
14
  ## Implementation
15
 
16
- This project is an implementation of PartitionSHAP and BERTViz into GODEL by Microsoft - [GODEL Model](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq) which is a generative seq2seq transformer fine-tuned for goal directed dialog. It supports context and knowledge base inputs.
17
 
18
- The UI is build with Gradio.
19
 
20
  ### Usage
21
 
@@ -39,13 +38,13 @@ Please credit the original authors of this project (Lennard Zündorf) and the cr
39
  ### Author
40
 
41
  - Lennard Zündorf
42
- - lennard.zuendorf@student.htw-berlin.de
43
- - [GitHub](https://github.com/LennardZuendorf)
44
- - [LinkedIn](https://www.zuendorf.me/linkd)
45
 
46
 
47
  ### University
48
  Hochschule für Technik und Wirtschaft Berlin (HTW Berlin) - University of Applied Sciences for Engineering and Economics Berlin
49
 
50
- 1. Supervisor: Prof. Dr. Katarina Simbeck
51
- 2. Supervisor: Prof. Dr. Axel Hochstein
 
8
 
9
  - [GitHub Repository](https://github.com/LennardZuendorf/thesis-webapp) - The GitHub repository of this project.
10
  - [HTW Berlin](https://www.htw-berlin.de/) - The University I have built this project for, as part of my thesis.
 
11
 
12
 
13
  ## Implementation
14
 
15
+ This project is an implementation of PartitionSHAP into GODEL by Microsoft - [GODEL Model](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq) which is a generative seq2seq transformer fine-tuned for goal directed dialog. It supports context and knowledge base inputs.
16
 
17
+ The UI is build with Gradio, utilizing some custom components and FastAPI.
18
 
19
  ### Usage
20
 
 
38
  ### Author
39
 
40
  - Lennard Zündorf
41
+ - [lennard.zuendorf@student.htw-berlin.de](mailto:lennard.zuendorf@student.htw-berlin.de) or [lennard@zuendorf.me](mailto:lennard@zuendorf.me)
42
+ - [GitHub](https://zuendorf.me/github)
43
+ - [LinkedIn](https://zuendorf.me/github)
44
 
45
 
46
  ### University
47
  Hochschule für Technik und Wirtschaft Berlin (HTW Berlin) - University of Applied Sciences for Engineering and Economics Berlin
48
 
49
+ Supervisor: Prof. Dr. Katarina Simbeck
50
+ 2. Corrector : Prof. Dr. Axel Hochstein
public/credits_dataprotection_license.md CHANGED
@@ -1,7 +1,4 @@
1
-
2
-
3
  # Credits
4
- For full credits, please refer to the [thesis print]()
5
 
6
  ### Models
7
  This implementation is build on GODEL by Microsoft, Inc.
@@ -10,7 +7,7 @@ This implementation is build on GODEL by Microsoft, Inc.
10
  GODEL is an open source model by Microsoft. See [offical paper](https://arxiv.org/abs/2206.11309) for more information.
11
 
12
  - the version used in this project is GODEL Large, see [huggingface model hub](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq?text=Hey+my+name+is+Thomas%21+How+are+you%3F)
13
- - the model as is a generative seq2seq transformer fine tuned for goal directed dialog
14
  - it supports context and knowledge base inputs
15
 
16
  ### Libraries
@@ -19,14 +16,14 @@ This project uses a number of open source libraries, only the most important one
19
  ##### Shap
20
  This application uses a custom version of the shap library, which is available at [GitHub](https://github.com/shap/shap).
21
 
22
- - please refer to the [thesis-custom-shap](https://github.com/LennardZuendorf/thesis-custom-shap) repository for more information about the changes made to the library, specifically the README and CHANGES files
23
- - the shap library and the used partition SHAP explainer are based on work by Lundberg et al. (2017), see [offical paper](https://arxiv.org/pdf/1705.07874.pdf) for more information
24
 
25
- ##### BertViz
26
- This application uses a slightly customized version of the bertviz library, which is available at [GitHub](https://github.com/jessevig/bertviz)
27
 
28
- - the bertviz was introduced by Vig et al. (2019), see [offical paper](https://arxiv.org/pdf/1906.05714.pdf) for more information
29
- - there are no changes to the library itself, only to the way it is used in this project (adapted to use Mistral/LlaMa 2 instead of BERT)
30
 
31
 
32
  # Data Protection
 
 
 
1
  # Credits
 
2
 
3
  ### Models
4
  This implementation is build on GODEL by Microsoft, Inc.
 
7
  GODEL is an open source model by Microsoft. See [offical paper](https://arxiv.org/abs/2206.11309) for more information.
8
 
9
  - the version used in this project is GODEL Large, see [huggingface model hub](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq?text=Hey+my+name+is+Thomas%21+How+are+you%3F)
10
+ - the model as is a generative seq2seq transformer fine-tuned for goal directed dialog
11
  - it supports context and knowledge base inputs
12
 
13
  ### Libraries
 
16
  ##### Shap
17
  This application uses a custom version of the shap library, which is available at [GitHub](https://github.com/shap/shap).
18
 
19
+ - Please refer to the [thesis-shap](https://github.com/LennardZuendorf/thesis-custom-shap) repository for more information about the changes made to the library, specifically the README file.
20
+ - The shap library and the used partition SHAP explainer are based on work by Lundberg et al. (2017), see [offical paper](https://arxiv.org/pdf/1705.07874.pdf) for more information.
21
 
22
+ ##### Visualizations
23
+ This application uses attention visualization inspired by the bertviz library, which is available at[GitHub](https://github.com/jessevig/bertviz). It doesn't actually use BERTViz.
24
 
25
+ - The bertviz was introduced by Vig et al. (2019), see [offical paper](https://arxiv.org/pdf/1906.05714.pdf) for more information.
26
+ - This project only uses cross attention visualization with gradio and matplotlib.
27
 
28
 
29
  # Data Protection