End of training
Browse files
    	
        README.md
    CHANGED
    
    | 
         @@ -5,23 +5,23 @@ tags: 
     | 
|
| 5 | 
         
             
            metrics:
         
     | 
| 6 | 
         
             
            - rouge
         
     | 
| 7 | 
         
             
            model-index:
         
     | 
| 8 | 
         
            -
            - name: flan-t5-small-taboo-for-llms
         
     | 
| 9 | 
         
             
              results: []
         
     | 
| 10 | 
         
             
            ---
         
     | 
| 11 | 
         | 
| 12 | 
         
             
            <!-- This model card has been generated automatically according to the information the Trainer had access to. You
         
     | 
| 13 | 
         
             
            should probably proofread and complete it, then remove this comment. -->
         
     | 
| 14 | 
         | 
| 15 | 
         
            -
            # flan-t5-small-taboo-for-llms
         
     | 
| 16 | 
         | 
| 17 | 
         
            -
            This model is a fine-tuned version of [ 
     | 
| 18 | 
         
             
            It achieves the following results on the evaluation set:
         
     | 
| 19 | 
         
            -
            - Loss: 2. 
     | 
| 20 | 
         
            -
            - Rouge1:  
     | 
| 21 | 
         
            -
            - Rouge2:  
     | 
| 22 | 
         
            -
            - Rougel:  
     | 
| 23 | 
         
            -
            - Rougelsum:  
     | 
| 24 | 
         
            -
            - Gen Len:  
     | 
| 25 | 
         | 
| 26 | 
         
             
            ## Model description
         
     | 
| 27 | 
         | 
| 
         @@ -46,13 +46,22 @@ The following hyperparameters were used during training: 
     | 
|
| 46 | 
         
             
            - seed: 42
         
     | 
| 47 | 
         
             
            - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
         
     | 
| 48 | 
         
             
            - lr_scheduler_type: linear
         
     | 
| 49 | 
         
            -
            - num_epochs:  
     | 
| 50 | 
         | 
| 51 | 
         
             
            ### Training results
         
     | 
| 52 | 
         | 
| 53 | 
         
            -
            | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 
     | 
| 54 | 
         
            -
             
     | 
| 55 | 
         
            -
            | No log        | 1.0   | 6    | 2. 
     | 
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 56 | 
         | 
| 57 | 
         | 
| 58 | 
         
             
            ### Framework versions
         
     | 
| 
         | 
|
| 5 | 
         
             
            metrics:
         
     | 
| 6 | 
         
             
            - rouge
         
     | 
| 7 | 
         
             
            model-index:
         
     | 
| 8 | 
         
            +
            - name: flan-t5-small-taboo-for-llms-repo
         
     | 
| 9 | 
         
             
              results: []
         
     | 
| 10 | 
         
             
            ---
         
     | 
| 11 | 
         | 
| 12 | 
         
             
            <!-- This model card has been generated automatically according to the information the Trainer had access to. You
         
     | 
| 13 | 
         
             
            should probably proofread and complete it, then remove this comment. -->
         
     | 
| 14 | 
         | 
| 15 | 
         
            +
            # flan-t5-small-taboo-for-llms-repo
         
     | 
| 16 | 
         | 
| 17 | 
         
            +
            This model is a fine-tuned version of [vlkn/flan-t5-small-taboo-for-llms](https://huggingface.co/vlkn/flan-t5-small-taboo-for-llms) on the None dataset.
         
     | 
| 18 | 
         
             
            It achieves the following results on the evaluation set:
         
     | 
| 19 | 
         
            +
            - Loss: 2.4246
         
     | 
| 20 | 
         
            +
            - Rouge1: 31.1045
         
     | 
| 21 | 
         
            +
            - Rouge2: 13.9259
         
     | 
| 22 | 
         
            +
            - Rougel: 30.6689
         
     | 
| 23 | 
         
            +
            - Rougelsum: 31.0064
         
     | 
| 24 | 
         
            +
            - Gen Len: 19.0
         
     | 
| 25 | 
         | 
| 26 | 
         
             
            ## Model description
         
     | 
| 27 | 
         | 
| 
         | 
|
| 46 | 
         
             
            - seed: 42
         
     | 
| 47 | 
         
             
            - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
         
     | 
| 48 | 
         
             
            - lr_scheduler_type: linear
         
     | 
| 49 | 
         
            +
            - num_epochs: 10
         
     | 
| 50 | 
         | 
| 51 | 
         
             
            ### Training results
         
     | 
| 52 | 
         | 
| 53 | 
         
            +
            | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
         
     | 
| 54 | 
         
            +
            |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
         
     | 
| 55 | 
         
            +
            | No log        | 1.0   | 6    | 2.7272          | 29.2133 | 12.3393 | 27.335  | 27.1699   | 16.6667 |
         
     | 
| 56 | 
         
            +
            | No log        | 2.0   | 12   | 2.6153          | 29.2351 | 10.0862 | 27.3832 | 27.3832   | 17.0    |
         
     | 
| 57 | 
         
            +
            | No log        | 3.0   | 18   | 2.5546          | 31.4176 | 11.5699 | 29.5658 | 29.5658   | 17.0    |
         
     | 
| 58 | 
         
            +
            | No log        | 4.0   | 24   | 2.5072          | 31.2198 | 12.9464 | 30.3189 | 30.4981   | 19.0    |
         
     | 
| 59 | 
         
            +
            | No log        | 5.0   | 30   | 2.4758          | 31.6534 | 14.1563 | 30.3831 | 30.5725   | 19.0    |
         
     | 
| 60 | 
         
            +
            | No log        | 6.0   | 36   | 2.4548          | 31.5148 | 14.1563 | 31.0945 | 31.4799   | 19.0    |
         
     | 
| 61 | 
         
            +
            | No log        | 7.0   | 42   | 2.4410          | 31.1045 | 13.9259 | 30.6689 | 31.0064   | 19.0    |
         
     | 
| 62 | 
         
            +
            | No log        | 8.0   | 48   | 2.4319          | 31.1045 | 13.9259 | 30.6689 | 31.0064   | 19.0    |
         
     | 
| 63 | 
         
            +
            | No log        | 9.0   | 54   | 2.4265          | 31.1045 | 13.9259 | 30.6689 | 31.0064   | 19.0    |
         
     | 
| 64 | 
         
            +
            | No log        | 10.0  | 60   | 2.4246          | 31.1045 | 13.9259 | 30.6689 | 31.0064   | 19.0    |
         
     | 
| 65 | 
         | 
| 66 | 
         | 
| 67 | 
         
             
            ### Framework versions
         
     | 
    	
        logs/events.out.tfevents.1685801092.85a114cc9afe.156.5
    CHANGED
    
    | 
         @@ -1,3 +1,3 @@ 
     | 
|
| 1 | 
         
             
            version https://git-lfs.github.com/spec/v1
         
     | 
| 2 | 
         
            -
            oid sha256: 
     | 
| 3 | 
         
            -
            size  
     | 
| 
         | 
|
| 1 | 
         
             
            version https://git-lfs.github.com/spec/v1
         
     | 
| 2 | 
         
            +
            oid sha256:28c533bba136e76fd880e25ddd8a188942038da49113d5143fec60b3657f3de5
         
     | 
| 3 | 
         
            +
            size 10634
         
     | 
    	
        logs/events.out.tfevents.1685801279.85a114cc9afe.156.7
    ADDED
    
    | 
         @@ -0,0 +1,3 @@ 
     | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
| 
         | 
|
| 1 | 
         
            +
            version https://git-lfs.github.com/spec/v1
         
     | 
| 2 | 
         
            +
            oid sha256:ac7691f8ee9f65b5afa837c6794d150ed98e6b46cde76e06a44dfaa5e003061e
         
     | 
| 3 | 
         
            +
            size 603
         
     |