rkotcher
/

roberta_legal_experiment

Text Classification

Inference Endpoints

Model card Files Files and versions Community

rkotcher commited on May 18

Commit

d1be56c

•

1 Parent(s): 827db0c

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -29,13 +29,15 @@ I'm interested in using encoder-based extraction of named legal document section
 - ROBERTa base output shape (batch size, seq length, hidden size)
 - ROBERTa base hidden size = 768
 - ROBERTa base max input seq length = 512
-- ** Using ROBERTA for segmentation involves combining sentences A and B into single input to ROBERTA. See below:
 [cls] A [sep] B
 and the embedding for [cls] can be used in a binary classifier.
-Specifically, the architecture used here (implemented via ) is:
 1. standard ROBERTA model
 2. classification of [CLS] token embedding:

 - ROBERTa base output shape (batch size, seq length, hidden size)
 - ROBERTa base hidden size = 768
 - ROBERTa base max input seq length = 512
+### Using ROBERTA for segmentation involves combining sentences A and B into single input to ROBERTA. See below:
 [cls] A [sep] B
 and the embedding for [cls] can be used in a binary classifier.
+### But, the architecture used here (implemented via ) is:
 1. standard ROBERTA model
 2. classification of [CLS] token embedding: