renegarza commited on
Commit
9721ce9
1 Parent(s): 9ccbe40

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -4
README.md CHANGED
@@ -47,15 +47,14 @@ This was later utilized to prompt the Transformer's Masked Language Modeling(MLM
47
  Additionally, Data Engineers that work with patient electronic records consistently spend an excessive amount of time parsing through the unstructured discharge notes format to accomplish their tasks.
48
  The solution will be instrumental for agents who are not directly facing patients but hold back-end roles that are also of immense importance.
49
 
50
- Data Engineer?
51
  #### Use Cases
52
  This model allows for the efficient summarization of complexly documented doctor notes. It provides instant access to insight with proper semantic cues in place.
53
 
54
- ##### Limitations & Future Aspirations
55
- With an increased amount of data, more deliberate results might be achieved through more training. Also, further improvements on the model's summarization capabilities have been considered.
56
  One of which is implementing summarization based on clustered titles within the discharge notes. The feature would allow for easier traversal through partitioned summarization and result in better structure.
57
 
58
- ##### Training and evaluation data
59
  The generated summaries were assigned to the original transcription and after splitting the data into the train and test sets, the table was converted into a json file.
60
  The structure allowed us to effectively train the model on the premise of transcription to summarization prompts. After all the metrics were evaluated, a number of medical transcriptions were generated through
61
  generative transformers to summarize and upon testing the model performed well.
 
47
  Additionally, Data Engineers that work with patient electronic records consistently spend an excessive amount of time parsing through the unstructured discharge notes format to accomplish their tasks.
48
  The solution will be instrumental for agents who are not directly facing patients but hold back-end roles that are also of immense importance.
49
 
 
50
  #### Use Cases
51
  This model allows for the efficient summarization of complexly documented doctor notes. It provides instant access to insight with proper semantic cues in place.
52
 
53
+ #### Limitations & Future Aspirations
54
+ With an increased amount of data, more deliberate results might be achieved through more training. Synthetic transcriptions could be created with GPT models to in turn train on. Also, further improvements on the model's summarization capabilities have been considered.
55
  One of which is implementing summarization based on clustered titles within the discharge notes. The feature would allow for easier traversal through partitioned summarization and result in better structure.
56
 
57
+ #### Training and evaluation data
58
  The generated summaries were assigned to the original transcription and after splitting the data into the train and test sets, the table was converted into a json file.
59
  The structure allowed us to effectively train the model on the premise of transcription to summarization prompts. After all the metrics were evaluated, a number of medical transcriptions were generated through
60
  generative transformers to summarize and upon testing the model performed well.