embedding_finetuned / README.md
smokxy's picture
pytorch_model.bin upload/update
97b2d78 verified
|
raw
history blame
41.7 kB
metadata
base_model: BAAI/bge-small-en-v1.5
datasets: []
language: []
library_name: sentence-transformers
metrics:
  - cosine_accuracy@1
  - cosine_accuracy@5
  - cosine_accuracy@10
  - cosine_precision@1
  - cosine_precision@5
  - cosine_precision@10
  - cosine_recall@1
  - cosine_recall@5
  - cosine_recall@10
  - cosine_ndcg@5
  - cosine_ndcg@10
  - cosine_ndcg@100
  - cosine_mrr@5
  - cosine_mrr@10
  - cosine_mrr@100
  - cosine_map@100
  - dot_accuracy@1
  - dot_accuracy@5
  - dot_accuracy@10
  - dot_precision@1
  - dot_precision@5
  - dot_precision@10
  - dot_recall@1
  - dot_recall@5
  - dot_recall@10
  - dot_ndcg@5
  - dot_ndcg@10
  - dot_ndcg@100
  - dot_mrr@5
  - dot_mrr@10
  - dot_mrr@100
  - dot_map@100
pipeline_tag: sentence-similarity
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - generated_from_trainer
  - dataset_size:900
  - loss:GISTEmbedLoss
widget:
  - source_sentence: How can I contact the National Co-operative Development Corporation?
    sentences:
      - >-
        '1.1 Chhattisgarh is among the few states in India that have recorded
        impressive growth in agriculture in recent years. Development of farmers
        own institutions catering to their various needs, has kept pace with the
        agricultural growth. As on 30 September 2014, the state had 3,679
        farmers clubs (FCs). There were eight federations of farmer clubs in the
        state, five in Mahasamund, two in Bilaspur and one in Mungeli district.
        In Bilaspur and Mungeli districts (the study area), 300 FCs were formed,
        of which 201 were active. Majority of the farmer clubs (129 clubs) were
        formed by the Regional Rural Bank (Gramin Bank). Other promoting
        institutions include Chhattisgarh Agricon Samiti (30), CARMDAKSH (12),
        SBI (12), ARDB (8) and IFFDC (5). While all the clubs were active in the
        initial three years, many slipped into dormancy through inaction and
        non-availability of hand-holding support. These clubs did not have any
        vision or roadmap for the future. 1.2 The Chhattisgarh RO and DDM
        Bilaspur were keen to make the farmer clubs a sustainable entity and
        felt the need to federate the clubs to a higher tier so as to make the
        entire farmer clubs programme sustainable and the organization a viable
        model. With this in view, the farmer clubs were federated into four
        farmer club federations and were registered under 'Chhattisgarh Society
        Registrikaran Adhiniyam, 1973' in the year 2012.'
      - >-
        '10.1 Under the scheme, financial support to Farmer Producer
        Organization (FPO)      @ up to maximum of Rs. 18 lakh / FPO or actual,
        whichever is lesser is to be provided during three years from the year
        of formation. The financial support is not meant for reimbursing the
        entire administrative and management cost of FPO but it is to provide
        the financial support to the FPOs to the extent provided  to  make them
        sustainable and economically viable. Hence, the fourth year onwards of
        formation, the FPO has to manage their financial support from their own
        business activities. The indicative financial support broadly covers (i)
        the support for salary of its CEO/Manager (maximum up to Rs.25000/month)
        and Accountant (maximum up to Rs. 10000/month); (ii) one time
        registration cost(one time up to  maximum Rs. 40000 or actual whichever
        is lower); (iii) office rent (maximum up to Rs. 48,000/year); (iv)
        utility charges (electricity and telephone charges of office of FPO
        maximum up to Rs. 12000/year); (v) one-time cost for minor equipment
        (including furniture and fixture maximum up to Rs. 20,000); (vi) travel
        and meeting cost (maximum up to Rs.18,000/year); and (vii) misc.
        (cleaning, stationery etc. maximum up to Rs. 12,000/year). Any
        expenditure of operations, management, working capital requirement and
        infrastructure development etc., over and above this, will be met by the
        FPOs from their financial resources.  10.2 FPO being organization of
        farmers, it does not become feasible for FPO itself to  professionally
        administer its activities and day to day business, therefore, FPO
        requires some professionally equipped Manager/CEO to administer its
        activities and day to day business with a sole objective to make FPO
        economically sustainable and farmers' benefiting agri-enterprise. Not
        only for business  development but the value of professional is immense
        in democratizing the FPOs and strengthening its governing system.'
      - >-
        'Risk Analysis    For further information, please contact: Chief General
        Manager, Managing Director Small Farmers' Agri- Business Consortium,
        National Bank for Agriculture & Rural Head office, NCUI Auditorium
        Development, **NABARD**, Building C-24, 'G' Block, 5th floor, 3, Siri
        Institutional Area Bandra-Kurla Complex, August Kranti Marg, Hauz Bandra
        East, Khas, Mumbai - 400051 New Delhi-110016 Tel: 022- Tel:
        011-41060075, 26966017 26539530,26539500 e-mail: sfac@nic.in Website:
        www.sfacindia.com csr.murthy@nabard.org, fsdd@nabard.org Website:
        www.nabard.org Agriculture Marketing Adviser Directorate of Marketing &
        Inspection DAC&FW, New CGO Complex, NH-IV, Faridabad - 121001 Tel: 0129-
        2412518 e-mail: mdrc-dac@gov.in Website: www.dmi.gov.in Managing
        Director National Co-operative Development Corporation, **NCDC**, 4-Siri
        Institutional Area, Hauz Khas, New Delhi - 110016 Tel: 011- 26960796,
        26567140 e-mail:  e-mail: mail@ncdc.in Website: www.ncdc.in Agricultural
        Marketing Division Department of Agriculture, Co-operation & Farmers'
        Welfare Ministry of Agriculture & Farmers' Welfare Krishi Bhawan, New
        Delhi-110001 Tel: 011-23386235, 23388579 Website: www.agricoop.nic.in'
  - source_sentence: >-
      Are any qualifications or certifications required for the VLEs of CSCs in
      facilitating the enrollment of non-loanee farmers?
    sentences:
      - >-
        '13.3.1   Insurance company shall take all necessary steps to take
        appropriate reinsurance cover for their portfolio in order to safeguard
        insured's interest. In case premium to claims ratio exceeds 1:3.5 or
        percentage of claims to Sum Insured exceeds 35%, whichever is higher, at
        the National Level in a crop  season, then Govt. will provide protection
        to Insurance Companies. The losses exceeding the above mentioned level
        in the crop season would be met from equal contribution of the Central
        Govt. and the  concerned State/UT Govts. In case losses are below the
        above mentioned condition, insurers shall be responsible to settle the
        admissible claims.'
      - >-
        '| Picking 1                                             | Picking
        2      | Picking 4   
        |\n|-------------------------------------------------------|----------------|--------------|\n|
        Total Yield Kg)                                       |               
        |              |\n| Picking
        3                                             |               
        |              |\n| Yield
        (Kg)                                            |               
        |              |\n|
        Crop                                                  | Experiment no.
        |              |\n|
        Yield                                                 |               
        |              |\n|
        (Kg)                                                  |               
        |              |\n|
        Yield                                                 |               
        |              |\n|
        (Kg)                                                  |               
        |              |\n|
        Yield                                                 |               
        |              |\n|
        (Kg)                                                  |               
        |              |\n|
        P1                                                    | P2             |
        P3           |\n| Well Conducted CCEs in the  Taluka with 4 pickings   
        |                |              |\n|
        Cotton                                                | E1             |
        1            |\n| Cotton                                               
        | E2             | 1            |\n|
        Cotton                                                | E3             |
        0.75         |\n| Cotton                                               
        | E4             | 0.8          |\n|
        Cotton                                                | E5             |
        0.95         |\n|                                                      
        | Average        | 0.9          |\n|
        6.373                                                 | 2.128          |
        1.282        |\n| (1                                                   
        |                |              |\n|
        st                                                    |               
        |              |\n| +
        2                                                   |               
        |              |\n|
        nd                                                    |               
        |              |\n|
        +3                                                    |               
        |              |\n|
        rd                                                    |               
        |             
        |\n|                                                      
        |                |              |\n| Factor (Total
        yield/                                  |                |             
        |\n| Picking Yield)                                       
        |                |             
        |\n|                                                      
        |                |              |\n|
        (1                                                    |               
        |              |\n|
        st                                                    |               
        |              |\n|
        )                                                     | (1            
        |              |\n|
        st                                                    |               
        |              |\n|
        +                                                     |               
        |              |\n|
        2                                                     |               
        |              |\n|
        nd                                                    |               
        |              |\n|
        )                                                     | )             
        |              |\n| CCEs with Less Pickings in any IU within that 
        Taluka |                |              |\n|
        Cotton                                                | E6 (only 1    
        |              |\n|
        st                                                    |               
        |              |\n|
        Picking)                                              | 1             
        |              |\n|
        Cotton                                                | E7 (1         
        |              |\n|
        st                                                    |               
        |              |\n| and
        2                                                 |               
        |              |\n|
        nd                                                    |               
        |              |\n|
        Picking)                                              | 1.2            |
        1.75         |\n| Cotton                                               
        | E8 (1          |              |\n|
        st                                                    |               
        |              |\n| ,
        2                                                   |               
        |              |\n|
        nd                                                    |               
        |              |\n| &
        3                                                   |               
        |              |\n|
        rd                                                    |               
        |              |\n|
        Picking)                                              | 1.1            |
        1.85         |'
      - >-
        '8.1    CSCs under Ministry of Electronics and Information Technology
        (MeITY) have been engaged to enrol    non-loanee farmers. The Insurance
        Companies are required to enter into a separate agreement with    CSC
        and pay service charges as fixed by DAC&FW, GOI per farmer per village
        per season. No other    agreement or payment is required to be made for
        this purpose. Nodal agency for engagement with    Ministry of
        Agriculture and Farmers Welfare and Insurance Companies will be CSC-SPV,
        a company    established under MeITY for carrying out e-governance
        initiatives of GoI.  8.2    No charges/fee shall be borne or paid by the
        farmers being enrolled through CSCs i.e. CSC-SPV and    CSC-VLE  8.3   
        As per IRDA circular, no separate qualification/certification will be
        required for the VLEs of CSCs to    facilitate enrolment of non-loanee
        farmers.  8.4    All empanelled Insurance Companies will compulsorily be
        required to enter into an agreement with    CSC for enrolment of
        non-loanee farmers and for provision of other defined services to
        farmers.   8.5    Other designated intermediaries may be linked with the
        Portal in due course.   8.6    Empanelled Insurance Companies have to
        necessarily register on the portal and submit list and details    of
        agents/intermediaries engaged for enrolment of non-loanee farmers in the
        beginning of each    season  within 10 days of award of work in the
        State.  Further all agents/intermediaries have to work    strictly as
        per the provisions of the Scheme and IRDA regulations'
  - source_sentence: How many federations of farmer clubs are there in Chhattisgarh?
    sentences:
      - >-
        'i. Shareholder List and Share Capital contribution by each Member
        verified and certified by a Chartered Accountant (CA) prior to
        submission (Format attached, Annexure I- Enclosure-I). ii. Resolution of
        FPO Board/Governing Council to seek Equity Grant for Members (Format
        attached, Annexure I- Enclosure-II).  iii. Consent of Shareholders,
        stating name of shareholder, gender, number of shares held, face value
        of shares, land holding, and signature, signifying consent for
        Implementing Agency to directly transfer the Equity Grant sanctioned to
        the FPC on their behalf, to FPC Bank account, against the consideration
        of additional shares of equivalent value to be issued to them by FPC and
        on exit- transfer of the shares as per rules (Format attached, Annexure
        I-Enclosure-III).   iv. Audited Financials of FPO for a minimum 1
        year/for all years of existence of the FPO if formed less than three
        years prior to application/ for the last 3 years for FPO in existence
        for 3 years or more, verified and certified by a Chartered Accountant
        (CA) prior to submission. v. Photocopy of FPO Bank Account Statement for
        last six months authenticated by Branch Manager. vi. Business plan and
        budget for next 18 months. vii. Names, photographs, and identity proof
        (one from among ration card, Aadhaar card, election identification card,
        and passport of Representatives/ Directors authorized by the Board for
        executing and signing all documents under the Scheme. viii. Each page of
        Application Form   and accompanying documents should be signed by a
        minimum of two Board Member Authorised Representatives of FPO;'
      - >-
        '  2.7    Secured credential/login, preferably linked with Aadhaar
        Number and mobile OTP based, for all    Stakeholders viz, Central
        Government, State Governments, Banks, empanelled Insurance Companies   
        and their designated field functionaries will be provided on the Portal
        to enable them to    enter/upload/download the requisite information.  
        2.8    Insurance Companies shall not distribute/collect/allow any other
        proforma/utility/web Portal etc for    collecting details of insured
        farmers separately. However they may provide all requisite support to   
        facilitate Bank Branches/PACS for uploading the farmer's details on the
        Portal well within the    prescribed cut-off dates.  2.9    Only farmers
        whose data is uploaded on the National Crop Insurance Portal shall be
        eligible for    Insurance coverage and the premium subsidy from State
        and Central Govt. will be released    accordingly.  2.10    All data
        pertaining to crop-wise, area-wise historical yield data, weather data,
        sown area, coverage    and claims data, calamity years and actual yield
        shall be made available on the National Crop    Insurance Portal for the
        purpose of premium rating, claim calculation etc.  2.11   
        Banks/Financial Institutions/other intermediaries need to compulsorily
        transfer the individual farmer's    data electronically to the National
        Crop Insurance Portal. Accordingly Banks/FIs may endeavour to   
        undertake CBS integration in a time bound manner for real time transfer
        of information/data.   2.12     It is also proposed to develop an
        integrated platform/portal for both PMFBY and Interest Subvention   
        Scheme. The data/information of both the Schemes shall be auto
        synchronized to enable real time    sharing of information and better
        program monitoring.  2.13   Insurance Companies shall compulsorily use
        technology/mobile applications for monitoring of crop    health/Crop
        Cutting Experiments (CCEs) in coordination with concerned States. States
        shall also    facilitate Insurance Companies with Satellite
        Imagery/Usage of Drones by way of prior approval of    agency from which
        such data can be sourced. This is required for better monitoring and
        ground-   truthing.'
      - >-
        '1.1 Chhattisgarh is among the few states in India that have recorded
        impressive growth in agriculture in recent years. Development of farmers
        own institutions catering to their various needs, has kept pace with the
        agricultural growth. As on 30 September 2014, the state had 3,679
        farmers clubs (FCs). There were eight federations of farmer clubs in the
        state, five in Mahasamund, two in Bilaspur and one in Mungeli district.
        In Bilaspur and Mungeli districts (the study area), 300 FCs were formed,
        of which 201 were active. Majority of the farmer clubs (129 clubs) were
        formed by the Regional Rural Bank (Gramin Bank). Other promoting
        institutions include Chhattisgarh Agricon Samiti (30), CARMDAKSH (12),
        SBI (12), ARDB (8) and IFFDC (5). While all the clubs were active in the
        initial three years, many slipped into dormancy through inaction and
        non-availability of hand-holding support. These clubs did not have any
        vision or roadmap for the future. 1.2 The Chhattisgarh RO and DDM
        Bilaspur were keen to make the farmer clubs a sustainable entity and
        felt the need to federate the clubs to a higher tier so as to make the
        entire farmer clubs programme sustainable and the organization a viable
        model. With this in view, the farmer clubs were federated into four
        farmer club federations and were registered under 'Chhattisgarh Society
        Registrikaran Adhiniyam, 1973' in the year 2012.'
  - source_sentence: How is the Sum Insured calculated for an individual farmer?
    sentences:
      - >-
        '(i) The software for vertical and horizontal e-Market place will
        provide information  about the details of FPOs produce with its quality
        ready for sale.  (ii) It will provide details of availability of quality
        inputs with price and also the details  of custom hiring facility with
        rate.   (iii) It will also provide detailed information about formation
        of FPO, its registration,  location, major business activity,
        profitability status and all other related information FPO-wise.  (iv) 
        It will have window to highlight the problems being encountered in any
        stage of  formation and promotion of FPOs and quick response by
        respective agency to resolve the problem.  (v) All the software
        developed by Implementing Agencies with respect to FPOs are  to be
        inter-operable with this Integrated portal.'
      - >-
        '12.1   Sum Insured per hectare for both loanee and non-loanee farmers
        will be same and equal to the Scale  of Finance as decided by the
        DLTC/SLTC, and would be pre-declared by SLCCCI and notified. No other
        calculation of Scale of Finance will be applicable. Sum Insured for
        individual farmer is equal to the SOF  per hectare multiplied by area of
        the notified crop proposed by the farmer for insurance. Area under 
        cultivation shall always be expressed in hectare'.  12.2   In cases
        where crops are separately notified under irrigated, un-irrigated
        category by State Govts. Sum  insured for irrigated and un-irrigated
        areas should be separately indicated.'
      - >-
        'i. The credit guarantee cover per FPO will be limited to the project
        loan of Rs. 2  crore. In case of project loan up to Rs. 1 crore, credit
        guarantee cover will be 85% of bankable project loan with ceiling of Rs.
        85 lakh; while in case of project  loan above Rs.1 crore and up to Rs. 2
        crore, credit guarantee cover will be 75% of bankable project loan with
        a maximum ceiling of Rs. 150 lakh. However, for project loan over Rs. 2
        crore of bankable projet loan, credit guarantee cover will be limited
        maximum upto Rs.2.0 crore only.  ii. ELI shall be eligible to seek
        Credit Guarantee Cover for a credit facility  sanctioned in respect of a
        single FPO borrower for a maximum of 2 times over a period of 5 years.  
        iii. In case of default, claims shall be settled up to 85% or 75 % of
        the amount in  default subject to maximum cover as specified above.  
        iv. Other charges such as penal interest, commitment charge, service
        charge, or  any other levies/ expenses, or any  costs whatsoever debited
        to the account of FPO by the ELI other than the contracted interest
        shall not qualify for Credit Guarantee Cover.  v. The Cover shall only
        be granted after the ELI enters into an agreement with  NABARD or NCDC,
        as the case may be, and shall be granted or delivered in accordance with
        the Terms and Conditions decided upon by NABARD or NCDC, as the case may
        be, from time to time.'
  - source_sentence: >-
      What is the requirement of Aadhaar for crop loan or Kisan Credit Card
      (KCC) under the Interest Subvention Scheme?
    sentences:
      - >-
        ' This consent of the beneficiary should be \'to agree that the
        department responsible for implementation of the Pradhan Mantri Kisan
        Samman Nidhi Yojana in Union Government or the State Government / Union
        Territory Administration can use the beneficiary Aadhaar number and
        other information provided in the declaration to verify the eligibility
        of the beneficiary for scheme as per extant scheme guidelines with the
        concerned agencies\'. 6.3  The existing land-ownership system in the
        concerned State / UT will be used for identification of beneficiaries.
        Accordingly, it is of utmost importance that the land records are clear
        and updated. Further, State / UT Governments would also expedite    the
        progress of digitization of the land records and linking the same with
        Aadhaar as well as bank details of the beneficiaries. 6.4  The lists of
        eligible beneficiaries would be published at the village level.
        Farmers'  families who are eligible but have been excluded should be
        provided an opportunity to represent their case.'
      - >-
        ' Date………………………………   ……………………………… Signature of Branch Manager with
        branch seal  Name…………………………………… … Designation ……………………………………
        ………………………………  ……………………………… Signature of Authorized Person in zonal
        office Name………………………………… Designation ……………………………………  5. Promoter's
        request letter  List of Enclosures  1. Recommendation  9. List of
        shareholders  addressed to the Bank Manager on original letter head of
        FPO  confirmed by promoter and bank  with amount of CGC  sought on
        Bank's  Original letterhead with date and dispatch number duly signed by
        the Branch Manager on each page.  2. Sanction letter of  6.
        Implementation Schedule  10. Affidavit of promoters that  confirmed by
        the bank.  they have not availed CGC  from any other institution for 
        sanctioned Credit Facility.  sanctioning authority  addressed to
        recommending  branch.  3. Bank's approved  7. Up-to-date statement of
        account of  11. Field inspection report of  Term loan and Cash Credit
        (if Sanctioned).  Bank official as on recent date.  Appraisal/Process
        note bearing signature of sanctioning authority.  4. Potential Impact
        on  8. a).Equity Certificate, C.A/CS  * Pin Code at Column No. 1. a), 
        certificate/RCS certificate  2. b), 2. c), 4. a) and 9. a) is Mandatory 
        b). FORM-2, FORM-5 and FORM-23  filed with ROC for Company/RCS.  small
        farmer producers  1. Social Impact,  2. Environmental  Impact  3.'
      - >-
        '6.3.1   Aadhaar has been made mandatory for availing Crop insurance
        from Kharif 2017 season onwards.    Therefore, all banks are advised to
        mandatorily obtain Aadhaar number of their farmers and the same   
        applies  for  non-loanee  farmers  enrolled  through  banks/Insurance 
        companies/insurance    intermediaries.  6.3.2   Farmers not having
        Aadhaar ID may also enrol under PMFBY subject to their enrolment for   
        Aadhaar and submission of proof of such enrolment as per notification
        No. 334.dated 8th February,    2017 issued by GOI under Section 7 of
        Aadhaar Act 2016(Targeted Delivery of Financial and other    Subsidies,
        Benefits and Services). Copy of the notification may be perused on
        www.pmfby.gov.in. This    may be  subject to further directions issued
        by Govt. from time to time.  6.3.3    All banks have to compulsorily
        take Aadhaar/Aadhaar enrolment number as per notification under  Aadhaar
        Act before sanction of crop loan/KCC under Interest Subvention Scheme.
        Hence the coverage    of loanee farmers without Aadhaar does not arise
        and such accounts need to be reviewed by the    concerned bank branch
        regularly.'
model-index:
  - name: SentenceTransformer based on BAAI/bge-small-en-v1.5
    results:
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: val evaluator
          type: val_evaluator
        metrics:
          - type: cosine_accuracy@1
            value: 0.51
            name: Cosine Accuracy@1
          - type: cosine_accuracy@5
            value: 0.89
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.93
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.51
            name: Cosine Precision@1
          - type: cosine_precision@5
            value: 0.17799999999999996
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.09299999999999997
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.51
            name: Cosine Recall@1
          - type: cosine_recall@5
            value: 0.89
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.93
            name: Cosine Recall@10
          - type: cosine_ndcg@5
            value: 0.7199414259514458
            name: Cosine Ndcg@5
          - type: cosine_ndcg@10
            value: 0.7332305184181033
            name: Cosine Ndcg@10
          - type: cosine_ndcg@100
            value: 0.7506961623500871
            name: Cosine Ndcg@100
          - type: cosine_mrr@5
            value: 0.6626666666666666
            name: Cosine Mrr@5
          - type: cosine_mrr@10
            value: 0.6683611111111114
            name: Cosine Mrr@10
          - type: cosine_mrr@100
            value: 0.6730785505344332
            name: Cosine Mrr@100
          - type: cosine_map@100
            value: 0.6730785505344328
            name: Cosine Map@100
          - type: dot_accuracy@1
            value: 0.51
            name: Dot Accuracy@1
          - type: dot_accuracy@5
            value: 0.89
            name: Dot Accuracy@5
          - type: dot_accuracy@10
            value: 0.93
            name: Dot Accuracy@10
          - type: dot_precision@1
            value: 0.51
            name: Dot Precision@1
          - type: dot_precision@5
            value: 0.17799999999999996
            name: Dot Precision@5
          - type: dot_precision@10
            value: 0.09299999999999997
            name: Dot Precision@10
          - type: dot_recall@1
            value: 0.51
            name: Dot Recall@1
          - type: dot_recall@5
            value: 0.89
            name: Dot Recall@5
          - type: dot_recall@10
            value: 0.93
            name: Dot Recall@10
          - type: dot_ndcg@5
            value: 0.7199414259514458
            name: Dot Ndcg@5
          - type: dot_ndcg@10
            value: 0.7332305184181033
            name: Dot Ndcg@10
          - type: dot_ndcg@100
            value: 0.7506961623500871
            name: Dot Ndcg@100
          - type: dot_mrr@5
            value: 0.6626666666666666
            name: Dot Mrr@5
          - type: dot_mrr@10
            value: 0.6683611111111114
            name: Dot Mrr@10
          - type: dot_mrr@100
            value: 0.6730785505344332
            name: Dot Mrr@100
          - type: dot_map@100
            value: 0.6730785505344328
            name: Dot Map@100

SentenceTransformer based on BAAI/bge-small-en-v1.5

This is a sentence-transformers model finetuned from BAAI/bge-small-en-v1.5. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: BAAI/bge-small-en-v1.5
  • Maximum Sequence Length: 512 tokens
  • Output Dimensionality: 384 tokens
  • Similarity Function: Cosine Similarity

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("smokxy/embedding_finetuned")
# Run inference
sentences = [
    'What is the requirement of Aadhaar for crop loan or Kisan Credit Card (KCC) under the Interest Subvention Scheme?',
    "'6.3.1   Aadhaar has been made mandatory for availing Crop insurance from Kharif 2017 season onwards.    Therefore, all banks are advised to mandatorily obtain Aadhaar number of their farmers and the same    applies  for  non-loanee  farmers  enrolled  through  banks/Insurance  companies/insurance    intermediaries.  6.3.2   Farmers not having Aadhaar ID may also enrol under PMFBY subject to their enrolment for    Aadhaar and submission of proof of such enrolment as per notification No. 334.dated 8th February,    2017 issued by GOI under Section 7 of Aadhaar Act 2016(Targeted Delivery of Financial and other    Subsidies, Benefits and Services). Copy of the notification may be perused on www.pmfby.gov.in. This    may be  subject to further directions issued by Govt. from time to time.  6.3.3    All banks have to compulsorily take Aadhaar/Aadhaar enrolment number as per notification under  Aadhaar Act before sanction of crop loan/KCC under Interest Subvention Scheme. Hence the coverage    of loanee farmers without Aadhaar does not arise and such accounts need to be reviewed by the    concerned bank branch regularly.'",
    "' Date………………………………   ……………………………… Signature of Branch Manager with branch seal  Name…………………………………… … Designation …………………………………… ………………………………  ……………………………… Signature of Authorized Person in zonal office Name………………………………… Designation ……………………………………  5. Promoter's request letter  List of Enclosures  1. Recommendation  9. List of shareholders  addressed to the Bank Manager on original letter head of FPO  confirmed by promoter and bank  with amount of CGC  sought on Bank's  Original letterhead with date and dispatch number duly signed by the Branch Manager on each page.  2. Sanction letter of  6. Implementation Schedule  10. Affidavit of promoters that  confirmed by the bank.  they have not availed CGC  from any other institution for  sanctioned Credit Facility.  sanctioning authority  addressed to recommending  branch.  3. Bank's approved  7. Up-to-date statement of account of  11. Field inspection report of  Term loan and Cash Credit (if Sanctioned).  Bank official as on recent date.  Appraisal/Process note bearing signature of sanctioning authority.  4. Potential Impact on  8. a).Equity Certificate, C.A/CS  * Pin Code at Column No. 1. a),  certificate/RCS certificate  2. b), 2. c), 4. a) and 9. a) is Mandatory  b). FORM-2, FORM-5 and FORM-23  filed with ROC for Company/RCS.  small farmer producers  1. Social Impact,  2. Environmental  Impact  3.'",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Information Retrieval

Metric Value
cosine_accuracy@1 0.51
cosine_accuracy@5 0.89
cosine_accuracy@10 0.93
cosine_precision@1 0.51
cosine_precision@5 0.178
cosine_precision@10 0.093
cosine_recall@1 0.51
cosine_recall@5 0.89
cosine_recall@10 0.93
cosine_ndcg@5 0.7199
cosine_ndcg@10 0.7332
cosine_ndcg@100 0.7507
cosine_mrr@5 0.6627
cosine_mrr@10 0.6684
cosine_mrr@100 0.6731
cosine_map@100 0.6731
dot_accuracy@1 0.51
dot_accuracy@5 0.89
dot_accuracy@10 0.93
dot_precision@1 0.51
dot_precision@5 0.178
dot_precision@10 0.093
dot_recall@1 0.51
dot_recall@5 0.89
dot_recall@10 0.93
dot_ndcg@5 0.7199
dot_ndcg@10 0.7332
dot_ndcg@100 0.7507
dot_mrr@5 0.6627
dot_mrr@10 0.6684
dot_mrr@100 0.6731
dot_map@100 0.6731

Training Details

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • gradient_accumulation_steps: 4
  • learning_rate: 1e-05
  • weight_decay: 0.01
  • num_train_epochs: 1.0
  • warmup_ratio: 0.1
  • load_best_model_at_end: True

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 8
  • per_device_eval_batch_size: 8
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 4
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 1e-05
  • weight_decay: 0.01
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 1.0
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: True
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: False
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • eval_use_gather_object: False
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: proportional

Training Logs

Epoch Step Training Loss loss val_evaluator_cosine_map@100
0.531 15 0.5565 0.0661 0.6731
0.9912 28 - 0.0661 0.6731
  • The bold row denotes the saved checkpoint.

Framework Versions

  • Python: 3.10.14
  • Sentence Transformers: 3.0.1
  • Transformers: 4.43.4
  • PyTorch: 2.4.0+cu121
  • Accelerate: 0.33.0
  • Datasets: 2.21.0
  • Tokenizers: 0.19.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

GISTEmbedLoss

@misc{solatorio2024gistembed,
    title={GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning}, 
    author={Aivin V. Solatorio},
    year={2024},
    eprint={2402.16829},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}