Files changed (1) hide show
  1. README.md +27 -13
README.md CHANGED
@@ -25,19 +25,33 @@ It achieves the following results on the evaluation set:
25
  - Rougelsum: 0.3525
26
  - Gen Len: 55.882
27
 
28
- ## Model description
29
-
30
- More information needed
31
-
32
- ## Intended uses & limitations
33
-
34
- More information needed
35
-
36
- ## Training and evaluation data
37
-
38
- More information needed
39
-
40
- ## Training procedure
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
 
42
  ### Training hyperparameters
43
 
 
25
  - Rougelsum: 0.3525
26
  - Gen Len: 55.882
27
 
28
+ ### Project Purpose
29
+
30
+ Our goal is to deliver an effective summarization solution aimed at making doctor discharge notes more structured and comprehensive.
31
+ A physician's job goes far beyond saving lives, doctors are also responsible for providing a comforting environment for their patients.
32
+ With that in mind, while accommodating in a high-stress environment it is difficult to follow a structure and formulate notes with universal interpretability in mind.
33
+ This leads to long and convoluted discharge documentation that becomes very tedious to leverage and understand. Our model is a product that will alleviate
34
+ a significant amount of discomfort when creating and utilizing physician notes, which ultimately will lead to more fluid workflows and increased convenience for healthcare providers.
35
+
36
+ ### Intended Use
37
+ #### Model
38
+ We leveraged Google's Pegasus abstractive text summarization to generate summaries of the discharged transcriptions included in the MTSamples dataset.
39
+ This was later utilized to prompt the Transformer's Masked Language Modeling(MLM) functionality to train the model to generate meaningful text with better structure and organization than the original.
40
+ Additionally, Data Engineers that work with patient electronic records consistently spend an excessive amount of time parsing through the unstructured discharge notes format to accomplish their tasks.
41
+ The solution will be instrumental for agents who are not directly facing patients but hold back-end roles that are also of immense importance.
42
+
43
+ Data Engineer?
44
+ #### Use Cases
45
+ This model allows for the efficient summarization of complexly documented doctor notes. It provides instant access to insight with proper semantic cues in place.
46
+
47
+ ##### Limitations & Future Aspirations
48
+ With an increased amount of data, more deliberate results might be achieved through more training. Also, further improvements on the model's summarization capabilities have been considered.
49
+ One of which is implementing summarization based on clustered titles within the discharge notes. The feature would allow for easier traversal through partitioned summarization and result in better structure.
50
+
51
+ ##### Training and evaluation data
52
+ The generated summaries were assigned to the original transcription and after splitting the data into the train and test sets, the table was converted into a json file.
53
+ The structure allowed us to effectively train the model on the premise of transcription to summarization prompts. After all the metrics were evaluated, a number of medical transcriptions were generated through
54
+ generative transformers to summarize and upon testing the model performed well.
55
 
56
  ### Training hyperparameters
57