Upload folder using huggingface_hub
Browse files- .gitattributes +2 -0
 - LICENSE.md +114 -0
 - Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q5_K_M.gguf +3 -0
 - Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q8_0.gguf +3 -0
 - NOTICE +1 -0
 - README.md +108 -0
 - USE POLICY.md +14 -0
 
    	
        .gitattributes
    CHANGED
    
    | 
         @@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text 
     | 
|
| 33 | 
         
             
            *.zip filter=lfs diff=lfs merge=lfs -text
         
     | 
| 34 | 
         
             
            *.zst filter=lfs diff=lfs merge=lfs -text
         
     | 
| 35 | 
         
             
            *tfevents* filter=lfs diff=lfs merge=lfs -text
         
     | 
| 
         | 
|
| 
         | 
| 
         | 
|
| 33 | 
         
             
            *.zip filter=lfs diff=lfs merge=lfs -text
         
     | 
| 34 | 
         
             
            *.zst filter=lfs diff=lfs merge=lfs -text
         
     | 
| 35 | 
         
             
            *tfevents* filter=lfs diff=lfs merge=lfs -text
         
     | 
| 36 | 
         
            +
            Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
         
     | 
| 37 | 
         
            +
            Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
         
     | 
    	
        LICENSE.md
    ADDED
    
    | 
         @@ -0,0 +1,114 @@ 
     | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
| 
         | 
|
| 1 | 
         
            +
            LLAMA 3.1 COMMUNITY LICENSE AGREEMENT
         
     | 
| 2 | 
         
            +
            Llama 3.1 Version Release Date: July 23, 2024
         
     | 
| 3 | 
         
            +
             
     | 
| 4 | 
         
            +
            “Agreement” means the terms and conditions for use, reproduction, distribution and modification of the
         
     | 
| 5 | 
         
            +
            Llama Materials set forth herein.
         
     | 
| 6 | 
         
            +
             
     | 
| 7 | 
         
            +
            “Documentation” means the specifications, manuals and documentation accompanying Llama 3.1
         
     | 
| 8 | 
         
            +
            distributed by Meta at https://llama.meta.com/doc/overview.
         
     | 
| 9 | 
         
            +
             
     | 
| 10 | 
         
            +
            “Licensee” or “you” means you, or your employer or any other person or entity (if you are entering into
         
     | 
| 11 | 
         
            +
            this Agreement on such person or entity’s behalf), of the age required under applicable laws, rules or
         
     | 
| 12 | 
         
            +
            regulations to provide legal consent and that has legal authority to bind your employer or such other
         
     | 
| 13 | 
         
            +
            person or entity if you are entering in this Agreement on their behalf.
         
     | 
| 14 | 
         
            +
             
     | 
| 15 | 
         
            +
            “Llama 3.1” means the foundational large language models and software and algorithms, including
         
     | 
| 16 | 
         
            +
            machine-learning model code, trained model weights, inference-enabling code, training-enabling code,
         
     | 
| 17 | 
         
            +
            fine-tuning enabling code and other elements of the foregoing distributed by Meta at
         
     | 
| 18 | 
         
            +
            https://llama.meta.com/llama-downloads.
         
     | 
| 19 | 
         
            +
             
     | 
| 20 | 
         
            +
            “Llama Materials” means, collectively, Meta’s proprietary Llama 3.1 and Documentation (and any
         
     | 
| 21 | 
         
            +
            portion thereof) made available under this Agreement.
         
     | 
| 22 | 
         
            +
             
     | 
| 23 | 
         
            +
            “Meta” or “we” means Meta Platforms Ireland Limited (if you are located in or, if you are an entity, your
         
     | 
| 24 | 
         
            +
            principal place of business is in the EEA or Switzerland) and Meta Platforms, Inc. (if you are located
         
     | 
| 25 | 
         
            +
            outside of the EEA or Switzerland).
         
     | 
| 26 | 
         
            +
             
     | 
| 27 | 
         
            +
            By clicking “I Accept” below or by using or distributing any portion or element of the Llama Materials,
         
     | 
| 28 | 
         
            +
            you agree to be bound by this Agreement.
         
     | 
| 29 | 
         
            +
             
     | 
| 30 | 
         
            +
            1. License Rights and Redistribution.
         
     | 
| 31 | 
         
            +
             
     | 
| 32 | 
         
            +
              a. Grant of Rights. You are granted a non-exclusive, worldwide, non-transferable and royalty-free
         
     | 
| 33 | 
         
            +
            limited license under Meta’s intellectual property or other rights owned by Meta embodied in the Llama
         
     | 
| 34 | 
         
            +
            Materials to use, reproduce, distribute, copy, create derivative works of, and make modifications to the
         
     | 
| 35 | 
         
            +
            Llama Materials.
         
     | 
| 36 | 
         
            +
             
     | 
| 37 | 
         
            +
              b. Redistribution and Use.
         
     | 
| 38 | 
         
            +
             
     | 
| 39 | 
         
            +
                  i. If you distribute or make available the Llama Materials (or any derivative works
         
     | 
| 40 | 
         
            +
            thereof), or a product or service (including another AI model) that contains any of them, you shall (A)
         
     | 
| 41 | 
         
            +
            provide a copy of this Agreement with any such Llama Materials; and (B) prominently display “Built with
         
     | 
| 42 | 
         
            +
            Llama” on a related website, user interface, blogpost, about page, or product documentation. If you use
         
     | 
| 43 | 
         
            +
            the Llama Materials or any outputs or results of the Llama Materials to create, train, fine tune, or
         
     | 
| 44 | 
         
            +
            otherwise improve an AI model, which is distributed or made available, you shall also include “Llama” at
         
     | 
| 45 | 
         
            +
            the beginning of any such AI model name.
         
     | 
| 46 | 
         
            +
             
     | 
| 47 | 
         
            +
                  ii. If you receive Llama Materials, or any derivative works thereof, from a Licensee as part 
         
     | 
| 48 | 
         
            +
            of an integrated end user product, then Section 2 of this Agreement will not apply to you.
         
     | 
| 49 | 
         
            +
             
     | 
| 50 | 
         
            +
                  iii. You must retain in all copies of the Llama Materials that you distribute the following
         
     | 
| 51 | 
         
            +
            attribution notice within a “Notice” text file distributed as a part of such copies: “Llama 3.1 is
         
     | 
| 52 | 
         
            +
            licensed under the Llama 3.1 Community License, Copyright © Meta Platforms, Inc. All Rights
         
     | 
| 53 | 
         
            +
            Reserved.”
         
     | 
| 54 | 
         
            +
             
     | 
| 55 | 
         
            +
                  iv. Your use of the Llama Materials must comply with applicable laws and regulations
         
     | 
| 56 | 
         
            +
            (including trade compliance laws and regulations) and adhere to the Acceptable Use Policy for the Llama
         
     | 
| 57 | 
         
            +
            Materials (available at https://llama.meta.com/llama3_1/use-policy), which is hereby incorporated by
         
     | 
| 58 | 
         
            +
            reference into this Agreement.
         
     | 
| 59 | 
         
            +
             
     | 
| 60 | 
         
            +
            2. Additional Commercial Terms. If, on the Llama 3.1 version release date, the monthly active users
         
     | 
| 61 | 
         
            +
            of the products or services made available by or for Licensee, or Licensee’s affiliates, is greater than 700
         
     | 
| 62 | 
         
            +
            million monthly active users in the preceding calendar month, you must request a license from Meta,
         
     | 
| 63 | 
         
            +
            which Meta may grant to you in its sole discretion, and you are not authorized to exercise any of the
         
     | 
| 64 | 
         
            +
            rights under this Agreement unless or until Meta otherwise expressly grants you such rights.
         
     | 
| 65 | 
         
            +
             
     | 
| 66 | 
         
            +
            3. Disclaimer of Warranty. UNLESS REQUIRED BY APPLICABLE LAW, THE LLAMA MATERIALS AND ANY
         
     | 
| 67 | 
         
            +
            OUTPUT AND RESULTS THEREFROM ARE PROVIDED ON AN “AS IS” BASIS, WITHOUT WARRANTIES OF
         
     | 
| 68 | 
         
            +
            ANY KIND, AND META DISCLAIMS ALL WARRANTIES OF ANY KIND, BOTH EXPRESS AND IMPLIED,
         
     | 
| 69 | 
         
            +
            INCLUDING, WITHOUT LIMITATION, ANY WARRANTIES OF TITLE, NON-INFRINGEMENT,
         
     | 
| 70 | 
         
            +
            MERCHANTABILITY, OR FITNESS FOR A PARTICULAR PURPOSE. YOU ARE SOLELY RESPONSIBLE FOR
         
     | 
| 71 | 
         
            +
            DETERMINING THE APPROPRIATENESS OF USING OR REDISTRIBUTING THE LLAMA MATERIALS AND
         
     | 
| 72 | 
         
            +
            ASSUME ANY RISKS ASSOCIATED WITH YOUR USE OF THE LLAMA MATERIALS AND ANY OUTPUT AND
         
     | 
| 73 | 
         
            +
            RESULTS.
         
     | 
| 74 | 
         
            +
             
     | 
| 75 | 
         
            +
            4. Limitation of Liability. IN NO EVENT WILL META OR ITS AFFILIATES BE LIABLE UNDER ANY THEORY OF
         
     | 
| 76 | 
         
            +
            LIABILITY, WHETHER IN CONTRACT, TORT, NEGLIGENCE, PRODUCTS LIABILITY, OR OTHERWISE, ARISING
         
     | 
| 77 | 
         
            +
            OUT OF THIS AGREEMENT, FOR ANY LOST PROFITS OR ANY INDIRECT, SPECIAL, CONSEQUENTIAL,
         
     | 
| 78 | 
         
            +
            INCIDENTAL, EXEMPLARY OR PUNITIVE DAMAGES, EVEN IF META OR ITS AFFILIATES HAVE BEEN ADVISED
         
     | 
| 79 | 
         
            +
            OF THE POSSIBILITY OF ANY OF THE FOREGOING.
         
     | 
| 80 | 
         
            +
             
     | 
| 81 | 
         
            +
            5. Intellectual Property.
         
     | 
| 82 | 
         
            +
             
     | 
| 83 | 
         
            +
              a. No trademark licenses are granted under this Agreement, and in connection with the Llama
         
     | 
| 84 | 
         
            +
            Materials, neither Meta nor Licensee may use any name or mark owned by or associated with the other
         
     | 
| 85 | 
         
            +
            or any of its affiliates, except as required for reasonable and customary use in describing and
         
     | 
| 86 | 
         
            +
            redistributing the Llama Materials or as set forth in this Section 5(a). Meta hereby grants you a license to
         
     | 
| 87 | 
         
            +
            use “Llama” (the “Mark”) solely as required to comply with the last sentence of Section 1.b.i. You will
         
     | 
| 88 | 
         
            +
            comply with Meta’s brand guidelines (currently accessible at
         
     | 
| 89 | 
         
            +
            https://about.meta.com/brand/resources/meta/company-brand/ ). All goodwill arising out of your use
         
     | 
| 90 | 
         
            +
            of the Mark will inure to the benefit of Meta.
         
     | 
| 91 | 
         
            +
             
     | 
| 92 | 
         
            +
              b. Subject to Meta’s ownership of Llama Materials and derivatives made by or for Meta, with
         
     | 
| 93 | 
         
            +
            respect to any derivative works and modifications of the Llama Materials that are made by you, as
         
     | 
| 94 | 
         
            +
            between you and Meta, you are and will be the owner of such derivative works and modifications.
         
     | 
| 95 | 
         
            +
             
     | 
| 96 | 
         
            +
              c. If you institute litigation or other proceedings against Meta or any entity (including a
         
     | 
| 97 | 
         
            +
            cross-claim or counterclaim in a lawsuit) alleging that the Llama Materials or Llama 3.1 outputs or
         
     | 
| 98 | 
         
            +
            results, or any portion of any of the foregoing, constitutes infringement of intellectual property or other
         
     | 
| 99 | 
         
            +
            rights owned or licensable by you, then any licenses granted to you under this Agreement shall
         
     | 
| 100 | 
         
            +
            terminate as of the date such litigation or claim is filed or instituted. You will indemnify and hold
         
     | 
| 101 | 
         
            +
            harmless Meta from and against any claim by any third party arising out of or related to your use or
         
     | 
| 102 | 
         
            +
            distribution of the Llama Materials.
         
     | 
| 103 | 
         
            +
             
     | 
| 104 | 
         
            +
            6. Term and Termination. The term of this Agreement will commence upon your acceptance of this
         
     | 
| 105 | 
         
            +
            Agreement or access to the Llama Materials and will continue in full force and effect until terminated in
         
     | 
| 106 | 
         
            +
            accordance with the terms and conditions herein. Meta may terminate this Agreement if you are in
         
     | 
| 107 | 
         
            +
            breach of any term or condition of this Agreement. Upon termination of this Agreement, you shall delete
         
     | 
| 108 | 
         
            +
            and cease use of the Llama Materials. Sections 3, 4 and 7 shall survive the termination of this
         
     | 
| 109 | 
         
            +
            Agreement.
         
     | 
| 110 | 
         
            +
             
     | 
| 111 | 
         
            +
            7. Governing Law and Jurisdiction. This Agreement will be governed and construed under the laws of
         
     | 
| 112 | 
         
            +
            the State of California without regard to choice of law principles, and the UN Convention on Contracts
         
     | 
| 113 | 
         
            +
            for the International Sale of Goods does not apply to this Agreement. The courts of California shall have
         
     | 
| 114 | 
         
            +
            exclusive jurisdiction of any dispute arising out of this Agreement.
         
     | 
    	
        Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q5_K_M.gguf
    ADDED
    
    | 
         @@ -0,0 +1,3 @@ 
     | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
| 
         | 
|
| 1 | 
         
            +
            version https://git-lfs.github.com/spec/v1
         
     | 
| 2 | 
         
            +
            oid sha256:399b08c30a0db957a3f7fb5711022af1375ea7c41729703d85f0117e945ead0e
         
     | 
| 3 | 
         
            +
            size 5733001536
         
     | 
    	
        Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q8_0.gguf
    ADDED
    
    | 
         @@ -0,0 +1,3 @@ 
     | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
| 
         | 
|
| 1 | 
         
            +
            version https://git-lfs.github.com/spec/v1
         
     | 
| 2 | 
         
            +
            oid sha256:77c6d3b8006cd743e82723448928c7931f99f902497fe03ffcc00066ded6d8ca
         
     | 
| 3 | 
         
            +
            size 8540790016
         
     | 
    	
        NOTICE
    ADDED
    
    | 
         @@ -0,0 +1 @@ 
     | 
|
| 
         | 
| 
         | 
|
| 1 | 
         
            +
            Llama 3.1 is licensed under the Llama 3.1 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved.
         
     | 
    	
        README.md
    ADDED
    
    | 
         @@ -0,0 +1,108 @@ 
     | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
| 
         | 
|
| 1 | 
         
            +
            ---
         
     | 
| 2 | 
         
            +
            language: [pl]
         
     | 
| 3 | 
         
            +
            license: llama3.1
         
     | 
| 4 | 
         
            +
            pipeline_tag: text-generation
         
     | 
| 5 | 
         
            +
            library_name: llama.cpp
         
     | 
| 6 | 
         
            +
            tags:
         
     | 
| 7 | 
         
            +
              - gguf
         
     | 
| 8 | 
         
            +
              - quantized
         
     | 
| 9 | 
         
            +
              - q8_0
         
     | 
| 10 | 
         
            +
              - f16
         
     | 
| 11 | 
         
            +
            base_model: ARTEXIT/Llama-PLLuM-8B-instruct-ArtexIT-reasoning
         
     | 
| 12 | 
         
            +
            base_model_relation: quantized
         
     | 
| 13 | 
         
            +
            quantization:
         
     | 
| 14 | 
         
            +
              - Q8_0
         
     | 
| 15 | 
         
            +
              - Q5_K_M
         
     | 
| 16 | 
         
            +
            ---
         
     | 
| 17 | 
         
            +
             
     | 
| 18 | 
         
            +
            # Llama-PLLuM-8B-instruct-ArtexIT-reasoning
         
     | 
| 19 | 
         
            +
             
     | 
| 20 | 
         
            +
            **Built with Llama**
         
     | 
| 21 | 
         
            +
             
     | 
| 22 | 
         
            +
            This repository contains a GRPO fine‑tune of [`CYFRAGOVPL/Llama-PLLuM-8B-instruct`] trained on **GSM8K** (MIT).
         
     | 
| 23 | 
         
            +
            We publish both **Hugging Face (safetensors)** and **GGUF** artifacts (Q8_0, Q5_K_M) for use with `llama.cpp`.
         
     | 
| 24 | 
         
            +
             
     | 
| 25 | 
         
            +
             
     | 
| 26 | 
         
            +
            ## What is this?
         
     | 
| 27 | 
         
            +
            - **Base**: Meta Llama 3.1 → PLLuM 8B Instruct (Polish) → GRPO fine‑tune (math / word problems).
         
     | 
| 28 | 
         
            +
            - **Context**: ~131k (based on GGUF header).
         
     | 
| 29 | 
         
            +
            - **Message format**: Llama `[INST] ... [/INST]` + explicit reasoning / answer tags (see below).
         
     | 
| 30 | 
         
            +
            - **Default chat template**: The tokenizer includes a default system instruction enforcing the two‑block format.
         
     | 
| 31 | 
         
            +
             
     | 
| 32 | 
         
            +
             
     | 
| 33 | 
         
            +
            ## Prompt format
         
     | 
| 34 | 
         
            +
             
     | 
| 35 | 
         
            +
            The model expects Llama chat formatting and supports explicit tags:
         
     | 
| 36 | 
         
            +
             
     | 
| 37 | 
         
            +
            - **Reasoning**: `<think> ... </think>`  
         
     | 
| 38 | 
         
            +
            - **Final answer**: `<answer> ... </answer>`
         
     | 
| 39 | 
         
            +
             
     | 
| 40 | 
         
            +
            **Example**
         
     | 
| 41 | 
         
            +
            ```text
         
     | 
| 42 | 
         
            +
            [INST] Rozwiąż: 12 * 13 = ? [/INST]
         
     | 
| 43 | 
         
            +
            <think>12*13 = 156.</think>
         
     | 
| 44 | 
         
            +
            <answer>156</answer>
         
     | 
| 45 | 
         
            +
            ```
         
     | 
| 46 | 
         
            +
             
     | 
| 47 | 
         
            +
            ## Quickstart
         
     | 
| 48 | 
         
            +
             
     | 
| 49 | 
         
            +
            ### Transformers (PyTorch)
         
     | 
| 50 | 
         
            +
             
     | 
| 51 | 
         
            +
            ```python
         
     | 
| 52 | 
         
            +
            import torch
         
     | 
| 53 | 
         
            +
            from transformers import AutoModelForCausalLM, AutoTokenizer
         
     | 
| 54 | 
         
            +
             
     | 
| 55 | 
         
            +
            repo = "ARTEXIT/Llama-PLLuM-8B-instruct-ArtexIT-reasoning"
         
     | 
| 56 | 
         
            +
            tok = AutoTokenizer.from_pretrained(repo, use_fast=True)
         
     | 
| 57 | 
         
            +
            model = AutoModelForCausalLM.from_pretrained(repo, torch_dtype="auto", device_map="auto")
         
     | 
| 58 | 
         
            +
             
     | 
| 59 | 
         
            +
            prompt = tok.apply_chat_template(
         
     | 
| 60 | 
         
            +
                [{"role": "user", "content": "Podaj 3 miasta w Polsce."}],
         
     | 
| 61 | 
         
            +
                add_generation_prompt=True,
         
     | 
| 62 | 
         
            +
                tokenize=False,
         
     | 
| 63 | 
         
            +
            )
         
     | 
| 64 | 
         
            +
            inputs = tok(prompt, return_tensors="pt").to(model.device)
         
     | 
| 65 | 
         
            +
            out = model.generate(**inputs, max_new_tokens=64)
         
     | 
| 66 | 
         
            +
            print(tok.decode(out[0], skip_special_tokens=False))
         
     | 
| 67 | 
         
            +
            ```
         
     | 
| 68 | 
         
            +
             
     | 
| 69 | 
         
            +
             
     | 
| 70 | 
         
            +
            ## Training (brief)
         
     | 
| 71 | 
         
            +
             
     | 
| 72 | 
         
            +
            - **Method**: GRPO (policy‑gradient reinforcement learning with multiple reward functions).
         
     | 
| 73 | 
         
            +
            - **Data**: `openai/gsm8k` — License: **MIT**.
         
     | 
| 74 | 
         
            +
            - **Goal**: consistent two‑block outputs (reasoning + final answer) using the training tags.
         
     | 
| 75 | 
         
            +
             
     | 
| 76 | 
         
            +
             
     | 
| 77 | 
         
            +
            ## License & Attribution
         
     | 
| 78 | 
         
            +
             
     | 
| 79 | 
         
            +
            This repository contains derivatives of **Llama 3.1** and **PLLuM**:
         
     | 
| 80 | 
         
            +
             
     | 
| 81 | 
         
            +
            - **Llama 3.1 Community License** applies. When redistributing, you must:
         
     | 
| 82 | 
         
            +
              - include a copy of the license and **prominently display “Built with Llama”**,
         
     | 
| 83 | 
         
            +
              - include **“Llama” at the beginning of any distributed model’s name** if it was created, trained or fine‑tuned using Llama materials,
         
     | 
| 84 | 
         
            +
              - keep a **NOTICE** file with the following line:  
         
     | 
| 85 | 
         
            +
                `Llama 3.1 is licensed under the Llama 3.1 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved.`
         
     | 
| 86 | 
         
            +
              - comply with the **Acceptable Use Policy (AUP)**.
         
     | 
| 87 | 
         
            +
            - **PLLuM**: please cite the PLLuM work (see **Citation** below).
         
     | 
| 88 | 
         
            +
            - **Data**: GSM8K is MIT‑licensed; include dataset attribution.
         
     | 
| 89 | 
         
            +
             
     | 
| 90 | 
         
            +
            This repo includes:
         
     | 
| 91 | 
         
            +
            - `LICENSE` — full text of the **Llama 3.1 Community License**
         
     | 
| 92 | 
         
            +
            - `USE_POLICY.md` — pointer to the official **Acceptable Use Policy**
         
     | 
| 93 | 
         
            +
            - `NOTICE` — required Llama attribution line
         
     | 
| 94 | 
         
            +
             
     | 
| 95 | 
         
            +
            > If your (or your affiliates’) products exceeded **700M monthly active users** on the Llama 3.1 release date, you must obtain a separate license from Meta before exercising the rights in the Llama 3.1 license.
         
     | 
| 96 | 
         
            +
             
     | 
| 97 | 
         
            +
             
     | 
| 98 | 
         
            +
            ## Citation
         
     | 
| 99 | 
         
            +
             
     | 
| 100 | 
         
            +
            If you use PLLuM in research or deployments, please cite:
         
     | 
| 101 | 
         
            +
             
     | 
| 102 | 
         
            +
            ```bibtex
         
     | 
| 103 | 
         
            +
            @unpublished{pllum2025,
         
     | 
| 104 | 
         
            +
                title={PLLuM: A Family of Polish Large Language Models},
         
     | 
| 105 | 
         
            +
                author={PLLuM Consortium},
         
     | 
| 106 | 
         
            +
                year={2025}
         
     | 
| 107 | 
         
            +
            }
         
     | 
| 108 | 
         
            +
            ```
         
     | 
    	
        USE POLICY.md
    ADDED
    
    | 
         @@ -0,0 +1,14 @@ 
     | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
| 
         | 
|
| 1 | 
         
            +
            # Llama 3.1 Acceptable Use Policy (AUP)
         
     | 
| 2 | 
         
            +
             
     | 
| 3 | 
         
            +
            This repository distributes a model derived from Llama 3.1. By accessing or using this model, you agree to the Llama 3.1 Acceptable Use Policy.
         
     | 
| 4 | 
         
            +
             
     | 
| 5 | 
         
            +
            **The most recent, authoritative copy of the AUP is maintained by Meta at:**
         
     | 
| 6 | 
         
            +
            https://llama.meta.com/llama3_1/use-policy
         
     | 
| 7 | 
         
            +
             
     | 
| 8 | 
         
            +
            For convenience only (non-exhaustive summary), the AUP requires responsible and lawful use and prohibits, among other things, uses that:
         
     | 
| 9 | 
         
            +
            - Violate laws or regulations;
         
     | 
| 10 | 
         
            +
            - Exploit, harm, or endanger people (including harassment, discrimination, or incitement to violence);
         
     | 
| 11 | 
         
            +
            - Infringe privacy or intellectual property rights;
         
     | 
| 12 | 
         
            +
            - Facilitate creation or distribution of malicious code or high-risk illegal activities.
         
     | 
| 13 | 
         
            +
             
     | 
| 14 | 
         
            +
            If this summary conflicts with the official AUP, **the official AUP controls**. Please read the full AUP at the link above before using the model.
         
     |