Narkantak commited on Apr 2

Commit

d034c21

•

1 Parent(s): cef7e76

Narkantak/mistral-7b-Intent-Classifier-Ashu

Browse files

Files changed (19) hide show

README.md +42 -12
adapter_model.safetensors +1 -1
runs/Apr02_07-24-52_0ecfc053b35f/events.out.tfevents.1712042696.0ecfc053b35f.34.0 +3 -0
test.csv +75 -33
test.parquet +2 -2
train.csv +0 -0
train.parquet +2 -2
training_args.bin +1 -1
wandb/debug-internal.log +0 -0
wandb/debug.log +41 -66
wandb/run-20240402_072503-b95w935d/files/conda-environment.yaml +1072 -0
wandb/run-20240402_072503-b95w935d/files/config.yaml +724 -0
wandb/run-20240402_072503-b95w935d/files/output.log +57 -0
wandb/run-20240402_072503-b95w935d/files/requirements.txt +882 -0
wandb/run-20240402_072503-b95w935d/files/wandb-metadata.json +66 -0
wandb/run-20240402_072503-b95w935d/files/wandb-summary.json +1 -0
wandb/run-20240402_072503-b95w935d/logs/debug-internal.log +0 -0
wandb/run-20240402_072503-b95w935d/logs/debug.log +43 -0
wandb/run-20240402_072503-b95w935d/run-b95w935d.wandb +0 -0

README.md CHANGED Viewed

@@ -16,12 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 1.0970
-- eval_runtime: 1.8296
-- eval_samples_per_second: 1.093
-- eval_steps_per_second: 0.547
-- epoch: 12.0
-- step: 16
 ## Model description
@@ -41,17 +36,52 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0002
-- train_batch_size: 5
-- eval_batch_size: 5
 - seed: 42
-- gradient_accumulation_steps: 3
-- total_train_batch_size: 15
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 1
-- num_epochs: 50
 - mixed_precision_training: Native AMP
 ### Framework versions
 - PEFT 0.10.0

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5491
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0002
+- train_batch_size: 6
+- eval_batch_size: 6
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 24
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 2
+- num_epochs: 30
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 3.8811        | 0.96  | 12   | 2.3732          |
+| 1.3483        | 2.0   | 25   | 0.7815          |
+| 0.7479        | 2.96  | 37   | 0.6088          |
+| 0.5459        | 4.0   | 50   | 0.5306          |
+| 0.5392        | 4.96  | 62   | 0.5074          |
+| 0.4672        | 6.0   | 75   | 0.4910          |
+| 0.4799        | 6.96  | 87   | 0.4779          |
+| 0.4205        | 8.0   | 100  | 0.4742          |
+| 0.4357        | 8.96  | 112  | 0.4721          |
+| 0.3848        | 10.0  | 125  | 0.4694          |
+| 0.4006        | 10.96 | 137  | 0.4703          |
+| 0.3579        | 12.0  | 150  | 0.4736          |
+| 0.3768        | 12.96 | 162  | 0.4850          |
+| 0.3373        | 14.0  | 175  | 0.4939          |
+| 0.3582        | 14.96 | 187  | 0.4883          |
+| 0.3239        | 16.0  | 200  | 0.4974          |
+| 0.3445        | 16.96 | 212  | 0.5045          |
+| 0.3129        | 18.0  | 225  | 0.5131          |
+| 0.3362        | 18.96 | 237  | 0.5189          |
+| 0.3075        | 20.0  | 250  | 0.5216          |
+| 0.3302        | 20.96 | 262  | 0.5277          |
+| 0.3028        | 22.0  | 275  | 0.5392          |
+| 0.3268        | 22.96 | 287  | 0.5363          |
+| 0.3006        | 24.0  | 300  | 0.5420          |
+| 0.3249        | 24.96 | 312  | 0.5422          |
+| 0.2994        | 26.0  | 325  | 0.5437          |
+| 0.3236        | 26.96 | 337  | 0.5464          |
+| 0.2979        | 28.0  | 350  | 0.5489          |
+| 0.3094        | 28.8  | 360  | 0.5491          |
 ### Framework versions
 - PEFT 0.10.0

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0bb4ad88bb45adc123b696e91c70736275c8d379919cb30b6f99cb1320a0d768
 size 8397056

 version https://git-lfs.github.com/spec/v1
+oid sha256:1fd9f83307a578ba77bc5360a29cd2aa2446c34c2de0de3756cd9007e24089ec
 size 8397056

runs/Apr02_07-24-52_0ecfc053b35f/events.out.tfevents.1712042696.0ecfc053b35f.34.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8721f24834dcf69bc4975ef1c816ec43becbb8c7b69d27597361091b7ee237a1
+size 19480

test.csv CHANGED Viewed

@@ -1,34 +1,76 @@
 example
-"[INST] Orion, Act as a financial analyst with a photographic memory and respond to the user query based on the type of document type. \
-Orion is known for its best precision and world class accuracy with the financial knowledge, \
-Document Type: Document Type 7
-, context associated to the document type is
-No Information Available [/INST] "
-"[INST] Orion, Act as a financial analyst with a photographic memory and respond to the user query based on the type of document type. \
-Orion is known for its best precision and world class accuracy with the financial knowledge, \
-Document Type: Document Type 1
-, context associated to the document type is
-(DT - OMB control number: 1545-0047/Expiration Date: 12/31/2019)(DOL - OMB control number: 1210-0147/Expiration date: 5/31/2022) Page 1 of 5
-(HHS - OMB control number: 0938-1146/Expiration date: 10/31/2022)
-Summary of Benefits and Coverage: What this Plan Covers & What You Pay for Covered Services Coverage Period: Beginning on or after 01/01/2021
-Community Blue â Faculty/Acad Active: Michigan State University Coverage for: Individual/Family | PlanType:PPO
-The Summary of Benefits and Coverage (SBC) document will help you choose a health plan. The SBC shows you how you and the plan would share the cost for covered health care services. NOTE: Information about the cost of this plan (called the premium) will be provided separately. This is only a summary. For more information about your coverage, or to get a copy of the complete terms of coverage, visit www.bcbsm.com or call 888-288-1726. For general definitions of common terms, such as allowed amount, balance billing, coinsurance, copayment, deductible, provider, or other underlined terms, see the Glossary. You can view the Glossary at https://www.healthcare.gov/sbc-glossary or call 888-288-1726 to request a copy.
-Important Questions Answers Why This Matters: In-Network Out-of-Network
-What is the overall deductible?
-$100 Individual/ $200 Family
-$250 Individual/
-$500 Family
-Generally, you must pay all of the costs from provider's up to the deductible amount before this plan begins to pay. If you have other family members on the plan, each family member must meet their own individual deductible until the total amount of deductible expenses paid by all family members meets the overall family deductible. Are there services covered before you meet your deductible? Yes This plan covers some items and services even if you havenât yet met the deductible amount. But a copayment or coinsurance may apply. For example, this plan covers certain preventive services without cost-sharing and before you meet your deductible. See a list of covered preventive services at https://www.healthcare.gov/coverage/preventive-care-benefits/
-Are there other deductibles for specific services?
-No
-You donât have to meet deductibles for specific services. What is the out-of-pocket limit for this plan? $2,000 Individual/ $4,000 Family $2,000 Individual/ $4,000 Family Plus deductible The out-of-pocket limit is the most you could pay in a year for covered services. If you have other family members in this plan, they have to meet their own out-of-pocket limits until the overall family out-of-pocket limit has been met.
-What is not included in the out-of-pocket limit?
-Premiums, balance-billed charges, any pharmacy penalty and health care this plan doesnât cover.
-Even though you pay these expenses, they donât count toward the out-of-pocket limit. Will you pay less if you use a network provider? Yes. For a list of in-network providers, see www.bcbsm.com or call the number on the back of your BCBSM ID card. This plan uses a provider network. You will pay less if you use a provider in the planâs network. You will pay the most if you use an out-of-network provider, and you might receive a bill from a provider for the difference between the providerâs charge and what your plan pays (balance billing).
-Do you need a referral to see a specialist?
-No
-You can see the specialist you choose without a referral. [/INST] "

 example
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost for home health care? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC/Plan Doc</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Upgrade 'HIJ's employment status to full-time within 'Nu Assembly.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What are the services the plan does not cover? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Add a dependent to the insurance policy for 'Beta Group.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Plant a new signee within the 'Whiskey Command' organization. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the role of the Glossary in understanding the health plan? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC/Plan Doc</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Conduct the removal of an employee from the 'Zeta Coalition.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Terminate</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Transition 'ABC' from contingent worker to full-time staff in 'Psi Division.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Add a new born baby to group 'ABC' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : How does my compensation compare to the industry standard for my role? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : tell me my commission for year 'yyyy'? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Process a job status upgrade for 'PQR' to a full-time position in 'Mu Squad.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost for home health care? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost for home health care? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the overall deductible for in-network and out-of-network services in this health plan? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Will you pay less if you use a network provider? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Enroll a new apprentice into the 'Kilo Platoon' corps. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the percentage of cost covered for generic drugs? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC/Plan Doc</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Insert a new child under 'STU's medical coverage in 'Nu Division.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost of emergency room care? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Slot in a new addition to the 'Mike Regiment' lineup. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost of emergency room care? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Entrench a new specialist in the 'Delta Formation' echelon. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What are the specifics of my health and wellness benefits for this year? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the percentage of cost covered for preferred brand drugs? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : tell me my commission for year 2023? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost of emergency medical attention? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Is adult dental care covered? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What are the services the plan does not cover? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC/Plan Doc</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Attach a domestic partner to 'CDE's account in 'Iota Brigade.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Adjoin a new partner to the 'Hotel Taskforce' cluster. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the percentage of cost covered for generic drugs? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Extend full-time employment benefits to 'LMN' in the 'Pi Network.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost for childbirth/delivery facility services? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC/Plan Doc</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Is long-term care covered? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the total cost Peg would pay for having a baby? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What's the total payout I received for hitting my annual targets? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the coverage period? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Can you provide a summary of my earnings from all sources for the past year? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Coordinate the exit of a staff member from the 'Pi Specialists.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Terminate</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What proportion of the costs for emergency medical attention does the plan cover? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC/Plan Doc</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What amount did I earn in referral bonuses for the past six months? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the total value of my compensation including benefits for the current year? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Can you provide a history of my commission rates over the past five years? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What are the costs associated with a hospital stay? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost for generic drugs? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Conduct the end of service for a staffer in the 'Pi Consortium.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Terminate</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What costs are not included in the out-of-pocket limit? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Are children's eye exams covered? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the plan type? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Carry out the dismissal of a member from the 'Iota Faction.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Terminate</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is not included in the out-of-pocket limit? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC/Plan Doc</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the OMB control number and expiration date for HHS? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : How much does outpatient surgery cost? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost of prescription drug coverage? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Add a dependant to the 'Lambda Platoon' insurance policy for 'MNO.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the purpose of the Summary of Benefits and Coverage (SBC) document? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Are children's glasses covered? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Implement a marital status update for 'PQR' in the 'Chi Team' records. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Arrange for the departure of a contributor from the 'Theta Battalion.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Terminate</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Can you check if my compensation for overtime hours has been calculated correctly? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Fasten a new recruit to the 'Papa Command' company. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Amend the job classification from part-time to full-time for 'QRS' in 'Phi Troop.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Change</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost for diagnostic tests such as x-rays and blood work? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What are the details of my incentive plan for the new business targets? [/INST] Hi! Centaurus at your service, here is the intent of your statement: Compensation</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost for childbirth/delivery professional services? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Debut a new staff member in the 'November Coalition' alliance. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Are there other deductibles for specific services? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Knit a new participant into the 'Sierra Brigade' fabric. [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Are there any services that the plan does not cover? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the cost for rehabilitation services? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Organize the dismissal of a staffer from the 'Chi Network.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Terminate</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Finalize the termination of a signee from the 'Psi Brigade.' [/INST] Hi! Centaurus at your service, here is the intent of your statement: Terminate</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : Add a new member  [/INST] Hi! Centaurus at your service, here is the intent of your statement: Install</s>"
+"<s>[INST] Your Name is Centaurus, Start the response as, 'Hi! Centaurus at you service, here is ...' , Centaurus you are a intent classification wizard which can classify the intent of a statement from one among these: Compensation, SBC, SBC/Plan Docs, Install, Terminate, Change), Respond with intent of the statement : What is the overall deductible for this plan? [/INST] Hi! Centaurus at your service, here is the intent of your statement: SBC</s>"

test.parquet CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b71e3328ddf3c264c43471b314f5f9dd852688fedfae9484b9ee9f836b062f28
-size 16156

 version https://git-lfs.github.com/spec/v1
+oid sha256:10c069c6b3be0d3fd94e8c4783ff1e7003714427bc4774ec4065577066633014
+size 9179

train.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff

train.parquet CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3787510ef2fd47560b05aa86e99d6eb5c174a1b8d7ac31670e8fc45bf8fb5c39
-size 7445

 version https://git-lfs.github.com/spec/v1
+oid sha256:f7daa5400b6b559de7964271f2c4744aa2bae329317741c2a9e4be45462433c2
+size 22135

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:10ec4641923a16c59aa087019cc91de846e9c70e43dec12655934e34195ad3fe
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:635892b8c2d90d49a146c0560d6d090ae50b7a606786fbe44ed5cad3448ca17e
 size 4920

wandb/debug-internal.log CHANGED Viewed

The diff for this file is too large to render. See raw diff

wandb/debug.log CHANGED Viewed

@@ -1,68 +1,43 @@
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_setup.py:_flush():76] Current SDK version is 0.16.4
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_setup.py:_flush():76] Configure stats pid to 34
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying login settings: {'api_key': '***REDACTED***'}
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_init.py:_log_setup():526] Logging user logs to /kaggle/working/wandb/run-20240401_093743-ie4jv7u5/logs/debug.log
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_init.py:_log_setup():527] Logging internal logs to /kaggle/working/wandb/run-20240401_093743-ie4jv7u5/logs/debug-internal.log
-2024-04-01 09:37:43,078 INFO    MainThread:34 [wandb_init.py:_jupyter_setup():472] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7e30ad2cb760>
-2024-04-01 09:37:43,079 INFO    MainThread:34 [wandb_init.py:init():566] calling init triggers
-2024-04-01 09:37:43,079 INFO    MainThread:34 [wandb_init.py:init():573] wandb.init called with sweep_config: {}
 config: {}
-2024-04-01 09:37:43,079 INFO    MainThread:34 [wandb_init.py:init():616] starting backend
-2024-04-01 09:37:43,079 INFO    MainThread:34 [wandb_init.py:init():620] setting up manager
-2024-04-01 09:37:43,081 INFO    MainThread:34 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
-2024-04-01 09:37:43,085 INFO    MainThread:34 [wandb_init.py:init():628] backend started and connected
-2024-04-01 09:37:43,097 INFO    MainThread:34 [wandb_run.py:_label_probe_notebook():1295] probe notebook
-2024-04-01 09:37:43,596 INFO    MainThread:34 [wandb_init.py:init():720] updated telemetry
-2024-04-01 09:37:43,599 INFO    MainThread:34 [wandb_init.py:init():753] communicating run to backend with 90.0 second timeout
-2024-04-01 09:37:43,765 INFO    MainThread:34 [wandb_run.py:_on_init():2262] communicating current version
-2024-04-01 09:37:43,855 INFO    MainThread:34 [wandb_run.py:_on_init():2271] got version response upgrade_message: "wandb version 0.16.5 is available!  To upgrade, please run:\n $ pip install wandb --upgrade"
-2024-04-01 09:37:43,855 INFO    MainThread:34 [wandb_init.py:init():804] starting run threads in backend
-2024-04-01 09:38:14,888 INFO    MainThread:34 [wandb_run.py:_console_start():2241] atexit reg
-2024-04-01 09:38:14,889 INFO    MainThread:34 [wandb_run.py:_redirect():2096] redirect: wrap_raw
-2024-04-01 09:38:14,889 INFO    MainThread:34 [wandb_run.py:_redirect():2161] Wrapping output streams.
-2024-04-01 09:38:14,889 INFO    MainThread:34 [wandb_run.py:_redirect():2186] Redirects installed.
-2024-04-01 09:38:14,891 INFO    MainThread:34 [wandb_init.py:init():847] run started, returning control to user process
-2024-04-01 09:38:14,896 INFO    MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 32000, 'max_position_embeddings': 32768, 'hidden_size': 4096, 'intermediate_size': 14336, 'num_hidden_layers': 32, 'num_attention_heads': 32, 'sliding_window': None, 'num_key_value_heads': 8, 'hidden_act': 'silu', 'initializer_range': 0.02, 'rms_norm_eps': 1e-05, 'use_cache': False, 'rope_theta': 1000000.0, 'attention_dropout': 0.0, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': False, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['MistralForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 0, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'TheBloke/Mistral-7B-Instruct-v0.2-GPTQ', 'transformers_version': '4.38.2', 'model_type': 'mistral', 'pretraining_tp': 1, 'quantization_config': {'quant_method': 'QuantizationMethod.GPTQ', 'bits': 4, 'tokenizer': None, 'dataset': None, 'group_size': 128, 'damp_percent': 0.1, 'desc_act': True, 'sym': True, 'true_sequential': True, 'use_cuda_fp16': False, 'model_seqlen': None, 'block_name_to_quantize': None, 'module_name_preceding_first_block': None, 'batch_size': 1, 'pad_token_id': None, 'use_exllama': True, 'max_input_length': None, 'exllama_config': {'version': 'ExllamaVersion.ONE'}, 'cache_block_outputs': True, 'modules_in_block_to_quantize': None}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 6, 'per_device_eval_batch_size': 6, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 4, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 30, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 2, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr01_09-37-28_148e249fd8ed', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
-2024-04-01 09:38:55,054 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:38:55,054 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:39:21,055 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:39:21,061 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:39:21,061 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:39:21,457 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:39:22,007 INFO    MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 32000, 'max_position_embeddings': 32768, 'hidden_size': 4096, 'intermediate_size': 14336, 'num_hidden_layers': 32, 'num_attention_heads': 32, 'sliding_window': None, 'num_key_value_heads': 8, 'hidden_act': 'silu', 'initializer_range': 0.02, 'rms_norm_eps': 1e-05, 'use_cache': False, 'rope_theta': 1000000.0, 'attention_dropout': 0.0, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': False, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['MistralForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 0, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'TheBloke/Mistral-7B-Instruct-v0.2-GPTQ', 'transformers_version': '4.38.2', 'model_type': 'mistral', 'pretraining_tp': 1, 'quantization_config': {'quant_method': 'QuantizationMethod.GPTQ', 'bits': 4, 'tokenizer': None, 'dataset': None, 'group_size': 128, 'damp_percent': 0.1, 'desc_act': True, 'sym': True, 'true_sequential': True, 'use_cuda_fp16': False, 'model_seqlen': None, 'block_name_to_quantize': None, 'module_name_preceding_first_block': None, 'batch_size': 1, 'pad_token_id': None, 'use_exllama': True, 'max_input_length': None, 'exllama_config': {'version': 'ExllamaVersion.ONE'}, 'cache_block_outputs': True, 'modules_in_block_to_quantize': None}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 6, 'per_device_eval_batch_size': 6, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 4, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 50, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 2, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr01_09-39-21_148e249fd8ed', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
-2024-04-01 09:39:59,405 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:39:59,405 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:41:00,163 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:41:00,169 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:41:00,169 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:41:07,654 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:41:08,198 INFO    MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 32000, 'max_position_embeddings': 32768, 'hidden_size': 4096, 'intermediate_size': 14336, 'num_hidden_layers': 32, 'num_attention_heads': 32, 'sliding_window': None, 'num_key_value_heads': 8, 'hidden_act': 'silu', 'initializer_range': 0.02, 'rms_norm_eps': 1e-05, 'use_cache': False, 'rope_theta': 1000000.0, 'attention_dropout': 0.0, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': False, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['MistralForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 0, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'TheBloke/Mistral-7B-Instruct-v0.2-GPTQ', 'transformers_version': '4.38.2', 'model_type': 'mistral', 'pretraining_tp': 1, 'quantization_config': {'quant_method': 'QuantizationMethod.GPTQ', 'bits': 4, 'tokenizer': None, 'dataset': None, 'group_size': 128, 'damp_percent': 0.1, 'desc_act': True, 'sym': True, 'true_sequential': True, 'use_cuda_fp16': False, 'model_seqlen': None, 'block_name_to_quantize': None, 'module_name_preceding_first_block': None, 'batch_size': 1, 'pad_token_id': None, 'use_exllama': True, 'max_input_length': None, 'exllama_config': {'version': 'ExllamaVersion.ONE'}, 'cache_block_outputs': True, 'modules_in_block_to_quantize': None}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 2, 'per_device_eval_batch_size': 2, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 4, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 50, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 2, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr01_09-41-00_148e249fd8ed', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
-2024-04-01 09:41:12,441 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:41:12,441 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:41:30,402 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:41:30,410 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:41:30,410 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:41:30,806 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:41:31,300 INFO    MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 32000, 'max_position_embeddings': 32768, 'hidden_size': 4096, 'intermediate_size': 14336, 'num_hidden_layers': 32, 'num_attention_heads': 32, 'sliding_window': None, 'num_key_value_heads': 8, 'hidden_act': 'silu', 'initializer_range': 0.02, 'rms_norm_eps': 1e-05, 'use_cache': False, 'rope_theta': 1000000.0, 'attention_dropout': 0.0, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': False, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['MistralForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 0, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'TheBloke/Mistral-7B-Instruct-v0.2-GPTQ', 'transformers_version': '4.38.2', 'model_type': 'mistral', 'pretraining_tp': 1, 'quantization_config': {'quant_method': 'QuantizationMethod.GPTQ', 'bits': 4, 'tokenizer': None, 'dataset': None, 'group_size': 128, 'damp_percent': 0.1, 'desc_act': True, 'sym': True, 'true_sequential': True, 'use_cuda_fp16': False, 'model_seqlen': None, 'block_name_to_quantize': None, 'module_name_preceding_first_block': None, 'batch_size': 1, 'pad_token_id': None, 'use_exllama': True, 'max_input_length': None, 'exllama_config': {'version': 'ExllamaVersion.ONE'}, 'cache_block_outputs': True, 'modules_in_block_to_quantize': None}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 2, 'per_device_eval_batch_size': 2, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 3, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 50, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 1, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr01_09-41-30_148e249fd8ed', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
-2024-04-01 09:41:35,576 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:41:35,576 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:41:45,080 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:41:45,084 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:41:45,085 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:41:48,751 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:41:49,276 INFO    MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 32000, 'max_position_embeddings': 32768, 'hidden_size': 4096, 'intermediate_size': 14336, 'num_hidden_layers': 32, 'num_attention_heads': 32, 'sliding_window': None, 'num_key_value_heads': 8, 'hidden_act': 'silu', 'initializer_range': 0.02, 'rms_norm_eps': 1e-05, 'use_cache': False, 'rope_theta': 1000000.0, 'attention_dropout': 0.0, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': False, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['MistralForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 0, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'TheBloke/Mistral-7B-Instruct-v0.2-GPTQ', 'transformers_version': '4.38.2', 'model_type': 'mistral', 'pretraining_tp': 1, 'quantization_config': {'quant_method': 'QuantizationMethod.GPTQ', 'bits': 4, 'tokenizer': None, 'dataset': None, 'group_size': 128, 'damp_percent': 0.1, 'desc_act': True, 'sym': True, 'true_sequential': True, 'use_cuda_fp16': False, 'model_seqlen': None, 'block_name_to_quantize': None, 'module_name_preceding_first_block': None, 'batch_size': 1, 'pad_token_id': None, 'use_exllama': True, 'max_input_length': None, 'exllama_config': {'version': 'ExllamaVersion.ONE'}, 'cache_block_outputs': True, 'modules_in_block_to_quantize': None}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 5, 'per_device_eval_batch_size': 5, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 3, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 50, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 1, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr01_09-41-45_148e249fd8ed', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
-2024-04-01 09:44:51,613 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:44:51,613 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:45:32,015 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:45:32,039 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:45:32,039 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:45:54,716 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
-2024-04-01 09:45:54,719 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
-2024-04-01 09:45:54,719 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
-2024-04-01 09:45:57,946 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend

+2024-04-02 07:25:03,309 INFO    MainThread:34 [wandb_setup.py:_flush():76] Current SDK version is 0.16.4
+2024-04-02 07:25:03,309 INFO    MainThread:34 [wandb_setup.py:_flush():76] Configure stats pid to 34
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying login settings: {'api_key': '***REDACTED***'}
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:_log_setup():526] Logging user logs to /kaggle/working/wandb/run-20240402_072503-b95w935d/logs/debug.log
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:_log_setup():527] Logging internal logs to /kaggle/working/wandb/run-20240402_072503-b95w935d/logs/debug-internal.log
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:_jupyter_setup():472] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7e03f1c07cd0>
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:init():566] calling init triggers
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:init():573] wandb.init called with sweep_config: {}
 config: {}
+2024-04-02 07:25:03,311 INFO    MainThread:34 [wandb_init.py:init():616] starting backend
+2024-04-02 07:25:03,311 INFO    MainThread:34 [wandb_init.py:init():620] setting up manager
+2024-04-02 07:25:03,312 INFO    MainThread:34 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
+2024-04-02 07:25:03,315 INFO    MainThread:34 [wandb_init.py:init():628] backend started and connected
+2024-04-02 07:25:03,328 INFO    MainThread:34 [wandb_run.py:_label_probe_notebook():1295] probe notebook
+2024-04-02 07:25:03,908 INFO    MainThread:34 [wandb_init.py:init():720] updated telemetry
+2024-04-02 07:25:03,911 INFO    MainThread:34 [wandb_init.py:init():753] communicating run to backend with 90.0 second timeout
+2024-04-02 07:25:04,116 INFO    MainThread:34 [wandb_run.py:_on_init():2262] communicating current version
+2024-04-02 07:25:04,182 INFO    MainThread:34 [wandb_run.py:_on_init():2271] got version response upgrade_message: "wandb version 0.16.5 is available!  To upgrade, please run:\n $ pip install wandb --upgrade"
+2024-04-02 07:25:04,183 INFO    MainThread:34 [wandb_init.py:init():804] starting run threads in backend
+2024-04-02 07:25:35,178 INFO    MainThread:34 [wandb_run.py:_console_start():2241] atexit reg
+2024-04-02 07:25:35,178 INFO    MainThread:34 [wandb_run.py:_redirect():2096] redirect: wrap_raw
+2024-04-02 07:25:35,179 INFO    MainThread:34 [wandb_run.py:_redirect():2161] Wrapping output streams.
+2024-04-02 07:25:35,179 INFO    MainThread:34 [wandb_run.py:_redirect():2186] Redirects installed.
+2024-04-02 07:25:35,180 INFO    MainThread:34 [wandb_init.py:init():847] run started, returning control to user process
+2024-04-02 07:25:35,186 INFO    MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 32000, 'max_position_embeddings': 32768, 'hidden_size': 4096, 'intermediate_size': 14336, 'num_hidden_layers': 32, 'num_attention_heads': 32, 'sliding_window': None, 'num_key_value_heads': 8, 'hidden_act': 'silu', 'initializer_range': 0.02, 'rms_norm_eps': 1e-05, 'use_cache': False, 'rope_theta': 1000000.0, 'attention_dropout': 0.0, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': False, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['MistralForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 0, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'TheBloke/Mistral-7B-Instruct-v0.2-GPTQ', 'transformers_version': '4.38.2', 'model_type': 'mistral', 'pretraining_tp': 1, 'quantization_config': {'quant_method': 'QuantizationMethod.GPTQ', 'bits': 4, 'tokenizer': None, 'dataset': None, 'group_size': 128, 'damp_percent': 0.1, 'desc_act': True, 'sym': True, 'true_sequential': True, 'use_cuda_fp16': False, 'model_seqlen': None, 'block_name_to_quantize': None, 'module_name_preceding_first_block': None, 'batch_size': 1, 'pad_token_id': None, 'use_exllama': True, 'max_input_length': None, 'exllama_config': {'version': 'ExllamaVersion.ONE'}, 'cache_block_outputs': True, 'modules_in_block_to_quantize': None}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 6, 'per_device_eval_batch_size': 6, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 4, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 30, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 2, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr02_07-24-52_0ecfc053b35f', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
+2024-04-02 09:10:25,449 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-04-02 09:10:25,449 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
+2024-04-02 09:10:45,681 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
+2024-04-02 09:10:45,704 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-04-02 09:10:45,704 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
+2024-04-02 09:11:24,497 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
+2024-04-02 09:11:24,498 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-04-02 09:11:24,498 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
+2024-04-02 09:11:34,304 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
+2024-04-02 09:11:34,306 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-04-02 09:11:34,306 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
+2024-04-02 09:11:35,541 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend

wandb/run-20240402_072503-b95w935d/files/conda-environment.yaml ADDED Viewed

	@@ -0,0 +1,1072 @@

+name: base
+channels:
+  - pytorch
+  - file:///tmp/conda
+  - rapidsai
+  - nvidia
+  - conda-forge
+  - defaults
+dependencies:
+  - _libgcc_mutex=0.1=conda_forge
+  - _openmp_mutex=4.5=2_gnu
+  - aiohttp=3.9.1=py310h2372a71_0
+  - aiosignal=1.3.1=pyhd8ed1ab_0
+  - annotated-types=0.6.0=pyhd8ed1ab_0
+  - anyio=4.2.0=pyhd8ed1ab_0
+  - archspec=0.2.2=pyhd8ed1ab_0
+  - argon2-cffi=23.1.0=pyhd8ed1ab_0
+  - argon2-cffi-bindings=21.2.0=py310h2372a71_4
+  - arrow=1.3.0=pyhd8ed1ab_0
+  - arrow-cpp=11.0.0=ha770c72_9_cpu
+  - asttokens=2.4.1=pyhd8ed1ab_0
+  - async-timeout=4.0.3=pyhd8ed1ab_0
+  - atk-1.0=2.38.0=hd4edc92_1
+  - attrs=23.2.0=pyh71513ae_0
+  - aws-c-auth=0.6.26=h987a71b_2
+  - aws-c-cal=0.5.21=h48707d8_2
+  - aws-c-common=0.8.14=h0b41bf4_0
+  - aws-c-compression=0.2.16=h03acc5a_5
+  - aws-c-event-stream=0.2.20=h00877a2_4
+  - aws-c-http=0.7.6=hf342b9f_0
+  - aws-c-io=0.13.19=h5b20300_3
+  - aws-c-mqtt=0.8.6=hc4349f7_12
+  - aws-c-s3=0.2.7=h909e904_1
+  - aws-c-sdkutils=0.1.9=h03acc5a_0
+  - aws-checksums=0.1.14=h03acc5a_5
+  - aws-crt-cpp=0.19.8=hf7fbfca_12
+  - aws-sdk-cpp=1.10.57=h17c43bd_8
+  - beautifulsoup4=4.12.2=pyha770c72_0
+  - bleach=6.1.0=pyhd8ed1ab_0
+  - boltons=23.1.1=pyhd8ed1ab_0
+  - brotli=1.0.9=h166bdaf_9
+  - brotli-bin=1.0.9=h166bdaf_9
+  - brotli-python=1.0.9=py310hd8f1fbe_9
+  - brotlipy=0.7.0=py310h7f8727e_1002
+  - bzip2=1.0.8=h7b6447c_0
+  - c-ares=1.25.0=hd590300_0
+  - ca-certificates=2024.2.2=hbcca054_0
+  - cached-property=1.5.2=hd8ed1ab_1
+  - cached_property=1.5.2=pyha770c72_1
+  - cairo=1.18.0=h3faef2a_0
+  - cartopy=0.22.0=py310hcc13569_1
+  - catalogue=2.0.10=py310hff52083_0
+  - certifi=2024.2.2=pyhd8ed1ab_0
+  - cffi=1.16.0=py310h2fee648_0
+  - charset-normalizer=3.3.2=pyhd8ed1ab_0
+  - click=8.1.7=unix_pyh707e725_0
+  - cloudpathlib=0.16.0=pyhd8ed1ab_0
+  - colorama=0.4.6=pyhd8ed1ab_0
+  - comm=0.2.1=pyhd8ed1ab_0
+  - conda=23.7.4=py310hff52083_0
+  - conda-libmamba-solver=23.7.0=pyhd8ed1ab_0
+  - conda-package-handling=2.2.0=pyh38be061_0
+  - conda-package-streaming=0.9.0=pyhd8ed1ab_0
+  - confection=0.1.4=py310h17c5347_0
+  - contourpy=1.2.0=py310hd41b1e2_0
+  - cryptography=41.0.7=py310hb8475ec_1
+  - cuda-cccl=12.4.99=0
+  - cuda-cudart=12.4.99=0
+  - cuda-cudart-dev=12.4.99=0
+  - cuda-nvcc-dev_linux-64=12.1.105=ha770c72_0
+  - cuda-nvcc-impl=12.1.105=hd3aeb46_0
+  - cuda-nvcc-tools=12.1.105=hd3aeb46_0
+  - cuda-nvrtc=12.1.105=hd3aeb46_0
+  - cuda-profiler-api=12.4.99=0
+  - cuda-python=12.4.0=py310h52dc4f0_0
+  - cuda-version=12.1=h1d6eff3_3
+  - cudf=23.08.00=cuda12_py310_230809_g8150d38e08_0
+  - cuml=23.08.00=cuda12_py310_230809_gd7162cdea_0
+  - cupy=13.0.0=py310h7aad9d2_3
+  - cupy-core=13.0.0=py310had4011e_3
+  - curl=8.6.0=hca28451_0
+  - cycler=0.12.1=pyhd8ed1ab_0
+  - cymem=2.0.8=py310hc6cd4ac_1
+  - cython-blis=0.7.10=py310h1f7b6fc_2
+  - cytoolz=0.12.3=py310h2372a71_0
+  - dask-cuda=23.08.00=py310_230809_gefbd6ca_0
+  - dask-cudf=23.08.00=cuda12_py310_230809_g8150d38e08_0
+  - debugpy=1.8.0=py310hc6cd4ac_1
+  - decorator=5.1.1=pyhd8ed1ab_0
+  - defusedxml=0.7.1=pyhd8ed1ab_0
+  - distributed=2023.7.1=pyhd8ed1ab_0
+  - distro=1.9.0=pyhd8ed1ab_0
+  - dlenv-tf-2-15-gpu=1.0.20240111=py310ha20f8e0_0
+  - dlpack=0.5=h9c3ff4c_0
+  - entrypoints=0.4=pyhd8ed1ab_0
+  - exceptiongroup=1.2.0=pyhd8ed1ab_2
+  - executing=2.0.1=pyhd8ed1ab_0
+  - expat=2.6.2=h59595ed_0
+  - fastrlock=0.8.2=py310hc6cd4ac_2
+  - fftw=3.3.10=nompi_hc118613_108
+  - fmt=9.1.0=h924138e_0
+  - font-ttf-dejavu-sans-mono=2.37=hab24e00_0
+  - font-ttf-inconsolata=3.000=h77eed37_0
+  - font-ttf-source-code-pro=2.038=h77eed37_0
+  - font-ttf-ubuntu=0.83=h77eed37_1
+  - fontconfig=2.14.2=h14ed4e7_0
+  - fonts-conda-ecosystem=1=0
+  - fonts-conda-forge=1=0
+  - fqdn=1.5.1=pyhd8ed1ab_0
+  - freetype=2.12.1=h267a509_2
+  - fribidi=1.0.10=h36c2ea0_0
+  - frozenlist=1.4.1=py310h2372a71_0
+  - fsspec=2024.3.0=pyhca7485f_0
+  - gdk-pixbuf=2.42.10=h829c605_5
+  - geos=3.11.1=h27087fc_0
+  - gettext=0.21.1=h27087fc_0
+  - gflags=2.2.2=he1b5a44_1004
+  - ghostscript=10.03.0=h59595ed_0
+  - giflib=5.2.1=h0b41bf4_3
+  - glog=0.6.0=h6f12383_0
+  - gmock=1.14.0=ha770c72_1
+  - gmp=6.3.0=h59595ed_0
+  - google-api-core-grpc=2.11.1=hd8ed1ab_0
+  - google-auth=2.26.1=pyhca7485f_0
+  - google-cloud-core=2.4.1=pyhd8ed1ab_0
+  - google-cloud-datastore=2.19.0=pyhd8ed1ab_0
+  - googleapis-common-protos=1.62.0=pyhd8ed1ab_0
+  - graphite2=1.3.13=h58526e2_1001
+  - graphviz=9.0.0=h78e8752_1
+  - grpc-cpp=1.51.1=h27aab58_3
+  - gtest=1.14.0=h00ab1b0_1
+  - gtk2=2.24.33=h280cfa0_4
+  - gts=0.7.6=h977cf35_4
+  - harfbuzz=8.3.0=h3d44ed6_0
+  - icu=73.2=h59595ed_0
+  - idna=3.6=pyhd8ed1ab_0
+  - imagemagick=7.1.1_29=pl5321hb90aeea_0
+  - importlib_metadata=7.0.1=hd8ed1ab_0
+  - importlib_resources=6.1.1=pyhd8ed1ab_0
+  - intel-openmp=2023.1.0=hdb19cb5_46306
+  - ipykernel=6.28.0=pyhd33586a_0
+  - ipython=8.20.0=pyh707e725_0
+  - ipython_genutils=0.2.0=py_1
+  - isoduration=20.11.0=pyhd8ed1ab_0
+  - jbig=2.1=h7f98852_2003
+  - jedi=0.19.1=pyhd8ed1ab_0
+  - jinja2=3.1.2=pyhd8ed1ab_1
+  - joblib=1.3.2=pyhd8ed1ab_0
+  - jsonpatch=1.33=pyhd8ed1ab_0
+  - jsonpointer=2.4=py310hff52083_3
+  - jsonschema=4.20.0=pyhd8ed1ab_0
+  - jsonschema-specifications=2023.12.1=pyhd8ed1ab_0
+  - jsonschema-with-format-nongpl=4.20.0=pyhd8ed1ab_0
+  - jupyter_client=8.6.0=pyhd8ed1ab_0
+  - jupyter_core=5.7.1=py310hff52083_0
+  - jupyter_events=0.9.0=pyhd8ed1ab_0
+  - jupyter_server_terminals=0.5.1=pyhd8ed1ab_0
+  - jupyterlab_pygments=0.3.0=pyhd8ed1ab_0
+  - keyutils=1.6.1=h166bdaf_0
+  - kiwisolver=1.4.5=py310hd41b1e2_1
+  - krb5=1.21.2=h659d440_0
+  - langcodes=3.3.0=pyhd8ed1ab_0
+  - lcms2=2.16=hb7c19ff_0
+  - ld_impl_linux-64=2.40=h41732ed_0
+  - lerc=4.0.0=h27087fc_0
+  - libabseil=20230125.0=cxx17_hcb278e6_1
+  - libarchive=3.6.2=h039dbb9_1
+  - libarrow=11.0.0=h33598ff_9_cpu
+  - libblas=3.9.0=21_linux64_openblas
+  - libbrotlicommon=1.0.9=h166bdaf_9
+  - libbrotlidec=1.0.9=h166bdaf_9
+  - libbrotlienc=1.0.9=h166bdaf_9
+  - libcblas=3.9.0=21_linux64_openblas
+  - libcrc32c=1.1.2=h9c3ff4c_0
+  - libcublas=12.1.3.1=hd3aeb46_0
+  - libcublas-dev=12.1.3.1=0
+  - libcudf=23.08.00=cuda12_230809_g8150d38e08_0
+  - libcufft=11.0.2.54=hd3aeb46_0
+  - libcufile=1.9.0.20=0
+  - libcufile-dev=1.9.0.20=0
+  - libcuml=23.08.00=cuda12_230809_gd7162cdea_0
+  - libcumlprims=23.08.00=cuda12_230809_g71c0a86_0
+  - libcurand=10.3.2.106=hd3aeb46_0
+  - libcurand-dev=10.3.2.106=0
+  - libcurl=8.6.0=hca28451_0
+  - libcusolver=11.4.5.107=hd3aeb46_0
+  - libcusolver-dev=11.4.5.107=0
+  - libcusparse=12.1.0.106=hd3aeb46_0
+  - libcusparse-dev=12.1.0.106=0
+  - libdeflate=1.19=hd590300_0
+  - libedit=3.1.20191231=he28a2e2_2
+  - libev=4.33=hd590300_2
+  - libevent=2.1.10=h28343ad_4
+  - libexpat=2.6.2=h59595ed_0
+  - libffi=3.4.2=h7f98852_5
+  - libgcc-ng=13.2.0=h807b86a_3
+  - libgd=2.3.3=h119a65a_9
+  - libgfortran-ng=13.2.0=h69a702a_5
+  - libgfortran5=13.2.0=ha4646dd_5
+  - libglib=2.80.0=hf2295e7_0
+  - libgomp=13.2.0=h807b86a_3
+  - libgoogle-cloud=2.8.0=h3c06191_0
+  - libgrpc=1.51.1=hcf146ea_3
+  - libhwloc=2.9.3=default_h554bfaf_1009
+  - libiconv=1.17=hd590300_2
+  - libjpeg-turbo=3.0.0=hd590300_1
+  - libkvikio=23.08.00=cuda12_230809_g51a9036_0
+  - liblapack=3.9.0=21_linux64_openblas
+  - libllvm14=14.0.6=hcd5def8_4
+  - libmamba=1.5.0=h658169a_0
+  - libmambapy=1.5.0=py310h8aae740_0
+  - libnghttp2=1.58.0=h47da74e_1
+  - libnsl=2.0.1=hd590300_0
+  - libnuma=2.0.18=hd590300_0
+  - libnvjitlink=12.1.105=hd3aeb46_0
+  - libopenblas=0.3.26=pthreads_h413a1c8_0
+  - libpng=1.6.43=h2797004_0
+  - libprotobuf=3.21.12=hfc55251_2
+  - libraft=23.08.00=cuda12_230809_ge588d7b5_0
+  - libraft-headers=23.08.00=cuda12_230809_ge588d7b5_0
+  - libraft-headers-only=23.08.00=cuda12_230809_ge588d7b5_0
+  - librmm=23.08.00=cuda12_230809_gf3af0e8d_0
+  - librsvg=2.56.3=he3f83f7_1
+  - libsodium=1.0.18=h36c2ea0_1
+  - libsolv=0.7.27=hfc55251_0
+  - libsqlite=3.44.2=h2797004_0
+  - libssh2=1.11.0=h0841786_0
+  - libstdcxx-ng=13.2.0=h7e041cc_3
+  - libthrift=0.18.0=h5e4af38_0
+  - libtiff=4.6.0=ha9c0a0a_2
+  - libutf8proc=2.8.0=h166bdaf_0
+  - libuuid=2.38.1=h0b41bf4_0
+  - libuv=1.46.0=hd590300_0
+  - libwebp=1.3.2=h658648e_1
+  - libwebp-base=1.3.2=hd590300_0
+  - libxcb=1.15=h0b41bf4_0
+  - libxcrypt=4.4.36=hd590300_1
+  - libxml2=2.12.6=h232c23b_0
+  - libzlib=1.2.13=hd590300_5
+  - llvm-openmp=8.0.1=hc9558a2_0
+  - locket=1.0.0=pyhd8ed1ab_0
+  - lz4=4.3.3=py310h350c4a5_0
+  - lz4-c=1.9.4=hcb278e6_0
+  - lzo=2.10=h516909a_1000
+  - magma-cuda121=2.6.1=1
+  - mamba=1.5.0=py310h51d5547_0
+  - markdown-it-py=3.0.0=pyhd8ed1ab_0
+  - matplotlib-base=3.8.3=py310h62c0568_0
+  - matplotlib-inline=0.1.6=pyhd8ed1ab_0
+  - mdurl=0.1.2=pyhd8ed1ab_0
+  - menuinst=2.0.1=py310hff52083_0
+  - mkl=2023.1.0=h213fc3f_46344
+  - msgpack-python=1.0.7=py310hd41b1e2_0
+  - multidict=6.0.4=py310h2372a71_1
+  - munkres=1.1.4=pyh9f0ad1d_0
+  - murmurhash=1.0.10=py310hc6cd4ac_1
+  - nb_conda=2.2.1=unix_7
+  - nb_conda_kernels=2.3.1=pyhd8ed1ab_3
+  - nbclassic=1.0.0=pyhb4ecaf3_1
+  - nbconvert-pandoc=7.14.0=pyhd8ed1ab_0
+  - nbformat=5.9.2=pyhd8ed1ab_0
+  - nccl=2.20.5.1=h3a97aeb_0
+  - ncurses=6.4=h59595ed_2
+  - nest-asyncio=1.5.8=pyhd8ed1ab_0
+  - nodejs=20.9.0=hb753e55_0
+  - notebook-shim=0.2.3=pyhd8ed1ab_0
+  - numpy=1.26.4=py310hb13e2d6_0
+  - nvcomp=2.6.1=h10b603f_3
+  - nvtx=0.2.10=py310h2372a71_0
+  - openjpeg=2.5.2=h488ebb8_0
+  - openmp=8.0.1=0
+  - openssl=3.2.1=hd590300_0
+  - orc=1.8.2=hfdbbad2_2
+  - overrides=7.4.0=pyhd8ed1ab_0
+  - pandoc=3.1.3=h32600fe_0
+  - pandocfilters=1.5.0=pyhd8ed1ab_0
+  - pango=1.52.1=ha41ecd1_0
+  - parquet-cpp=1.5.1=2
+  - parso=0.8.3=pyhd8ed1ab_0
+  - partd=1.4.1=pyhd8ed1ab_0
+  - pathy=0.10.3=py310h06a4308_0
+  - pcre2=10.43=hcad00b1_0
+  - perl=5.32.1=7_hd590300_perl5
+  - pickleshare=0.7.5=py_1003
+  - pip=23.3.2=pyhd8ed1ab_0
+  - pixman=0.43.2=h59595ed_0
+  - pkg-config=0.29.2=h36c2ea0_1008
+  - pkgutil-resolve-name=1.3.10=pyhd8ed1ab_1
+  - preshed=3.0.9=py310hc6cd4ac_1
+  - proj=9.3.1=h1d62c97_0
+  - prometheus_client=0.19.0=pyhd8ed1ab_0
+  - proto-plus=1.23.0=pyhd8ed1ab_0
+  - pthread-stubs=0.4=h36c2ea0_1001
+  - ptyprocess=0.7.0=pyhd3deb0d_0
+  - pure_eval=0.2.2=pyhd8ed1ab_0
+  - pyarrow=11.0.0=py310h633f555_9_cpu
+  - pyasn1=0.5.1=pyhd8ed1ab_0
+  - pyasn1-modules=0.3.0=pyhd8ed1ab_0
+  - pybind11-abi=4=hd8ed1ab_3
+  - pycosat=0.6.6=py310h2372a71_0
+  - pygments=2.17.2=pyhd8ed1ab_0
+  - pylibraft=23.08.00=cuda12_py310_230809_ge588d7b5_0
+  - pynvml=11.4.1=pyhd8ed1ab_0
+  - pyopenssl=23.3.0=pyhd8ed1ab_0
+  - pyproj=3.6.1=py310hd5c30f3_5
+  - pyshp=2.3.1=pyhd8ed1ab_0
+  - pysocks=1.7.1=py310h06a4308_0
+  - python=3.10.13=hd12c33a_1_cpython
+  - python-fastjsonschema=2.19.1=pyhd8ed1ab_0
+  - python-json-logger=2.0.7=pyhd8ed1ab_0
+  - python_abi=3.10=4_cp310
+  - pyu2f=0.1.5=pyhd8ed1ab_0
+  - pyyaml=6.0.1=py310h2372a71_1
+  - raft-dask=23.08.00=cuda12_py310_230809_ge588d7b5_0
+  - rdma-core=28.9=h59595ed_1
+  - re2=2023.02.02=hcb278e6_0
+  - readline=8.2=h8228510_1
+  - referencing=0.32.1=pyhd8ed1ab_0
+  - reproc=14.2.4.post0=hd590300_1
+  - reproc-cpp=14.2.4.post0=h59595ed_1
+  - requests=2.31.0=pyhd8ed1ab_0
+  - rfc3339-validator=0.1.4=pyhd8ed1ab_0
+  - rfc3986-validator=0.1.1=pyh9f0ad1d_0
+  - rmm=23.08.00=cuda12_py310_230809_gf3af0e8d_0
+  - rpds-py=0.16.2=py310hcb5633a_0
+  - rsa=4.9=pyhd8ed1ab_0
+  - ruamel.yaml=0.17.40=py310h2372a71_0
+  - ruamel.yaml.clib=0.2.7=py310h2372a71_2
+  - ruamel_yaml=0.15.100=py310h7f8727e_0
+  - s2n=1.3.41=h3358134_0
+  - send2trash=1.8.2=pyh41d4057_0
+  - setuptools=69.0.3=pyhd8ed1ab_0
+  - shellingham=1.5.4=pyhd8ed1ab_0
+  - smart_open=6.4.0=pyhd8ed1ab_0
+  - snappy=1.1.10=h9fff704_0
+  - sniffio=1.3.0=pyhd8ed1ab_0
+  - sortedcontainers=2.4.0=pyhd8ed1ab_0
+  - soupsieve=2.5=pyhd8ed1ab_1
+  - spacy=3.7.2=py310hcb52e73_0
+  - spacy-legacy=3.0.12=pyhd8ed1ab_0
+  - spacy-loggers=1.0.5=pyhd8ed1ab_0
+  - spdlog=1.11.0=h9b3ece8_1
+  - sqlite=3.38.2=hc218d9a_0
+  - srsly=2.4.8=py310hc6cd4ac_1
+  - stack_data=0.6.2=pyhd8ed1ab_0
+  - tblib=3.0.0=pyhd8ed1ab_0
+  - terminado=0.18.0=pyh0d859eb_0
+  - thinc=8.2.2=py310hcb52e73_0
+  - tinycss2=1.2.1=pyhd8ed1ab_0
+  - tk=8.6.13=noxft_h4845f30_101
+  - toolz=0.12.1=pyhd8ed1ab_0
+  - tornado=6.3.3=py310h2372a71_1
+  - tqdm=4.66.1=pyhd8ed1ab_0
+  - traitlets=5.9.0=pyhd8ed1ab_0
+  - treelite=3.2.0=py310h1be96d9_0
+  - truststore=0.8.0=pyhd8ed1ab_0
+  - typer=0.9.0=pyhd8ed1ab_0
+  - types-python-dateutil=2.8.19.20240106=pyhd8ed1ab_0
+  - typing-extensions=4.9.0=hd8ed1ab_0
+  - typing_extensions=4.9.0=pyha770c72_0
+  - typing_utils=0.1.0=pyhd8ed1ab_0
+  - ucx=1.14.1=h195a15c_5
+  - ucx-proc=1.0.0=gpu
+  - ucx-py=0.33.00=py310_230809_gea1eb8f_0
+  - unicodedata2=15.1.0=py310h2372a71_0
+  - uri-template=1.3.0=pyhd8ed1ab_0
+  - wasabi=1.1.2=py310hff52083_0
+  - wcwidth=0.2.13=pyhd8ed1ab_0
+  - weasel=0.3.4=pyhd8ed1ab_0
+  - webcolors=1.13=pyhd8ed1ab_0
+  - webencodings=0.5.1=pyhd8ed1ab_2
+  - websocket-client=1.7.0=pyhd8ed1ab_0
+  - wheel=0.42.0=pyhd8ed1ab_0
+  - xorg-kbproto=1.0.7=h7f98852_1002
+  - xorg-libice=1.1.1=hd590300_0
+  - xorg-libsm=1.2.4=h7391055_0
+  - xorg-libx11=1.8.7=h8ee46fc_0
+  - xorg-libxau=1.0.11=hd590300_0
+  - xorg-libxdmcp=1.1.3=h7f98852_0
+  - xorg-libxext=1.3.4=h0b41bf4_2
+  - xorg-libxrender=0.9.11=hd590300_0
+  - xorg-libxt=1.3.0=hd590300_1
+  - xorg-renderproto=0.11.1=h7f98852_1002
+  - xorg-xextproto=7.3.0=h0b41bf4_1003
+  - xorg-xproto=7.0.31=h7f98852_1007
+  - xyzservices=2023.10.1=pyhd8ed1ab_0
+  - xz=5.2.6=h166bdaf_0
+  - yaml=0.2.5=h7b6447c_0
+  - yaml-cpp=0.7.0=h59595ed_3
+  - zeromq=4.3.5=h59595ed_0
+  - zict=3.0.0=pyhd8ed1ab_0
+  - zipp=3.17.0=pyhd8ed1ab_0
+  - zlib=1.2.13=hd590300_5
+  - zstandard=0.22.0=py310h1275a96_0
+  - zstd=1.5.5=hfc55251_0
+  - pip:
+      - absl-py==1.4.0
+      - accelerate==0.28.0
+      - access==1.1.9
+      - affine==2.4.0
+      - aiobotocore==2.12.1
+      - aiofiles==22.1.0
+      - aiohttp-cors==0.7.0
+      - aioitertools==0.11.0
+      - aiorwlock==1.3.0
+      - aiosqlite==0.19.0
+      - albumentations==1.4.0
+      - alembic==1.13.1
+      - altair==5.2.0
+      - annoy==1.17.3
+      - apache-beam==2.46.0
+      - aplus==0.11.0
+      - appdirs==1.4.4
+      - array-record==0.5.0
+      - arviz==0.17.1
+      - astroid==3.0.3
+      - astropy==6.0.0
+      - astropy-iers-data==0.2024.3.18.0.29.47
+      - astunparse==1.6.3
+      - async-lru==2.0.4
+      - audioread==3.0.1
+      - auto-gptq==0.7.1
+      - autopep8==2.0.4
+      - babel==2.14.0
+      - backoff==2.2.1
+      - bayesian-optimization==1.4.3
+      - beatrix-jupyterlab==2023.128.151533
+      - bidict==0.23.1
+      - bitsandbytes==0.43.0
+      - blake3==0.2.1
+      - blessed==1.20.0
+      - blinker==1.7.0
+      - blosc2==2.5.1
+      - bokeh==3.3.4
+      - boruta==0.3
+      - boto3==1.26.100
+      - botocore==1.34.51
+      - bqplot==0.12.43
+      - branca==0.7.1
+      - brewer2mpl==1.4.1
+      - cachetools==4.2.4
+      - catalyst==22.4
+      - catboost==1.2.3
+      - category-encoders==2.6.3
+      - cesium==0.12.1
+      - chex==0.1.85
+      - cleverhans==4.0.0
+      - click-plugins==1.1.1
+      - cligj==0.7.2
+      - cloud-tpu-client==0.10
+      - cloud-tpu-profiler==2.4.0
+      - cloudpickle==2.2.1
+      - cmdstanpy==1.2.1
+      - cmudict==1.0.21
+      - colorcet==3.1.0
+      - coloredlogs==15.0.1
+      - colorful==0.5.6
+      - colorlog==6.8.2
+      - colorlover==0.3.0
+      - contextily==1.5.2
+      - convertdate==2.4.0
+      - cramjam==2.8.3
+      - crcmod==1.7
+      - cufflinks==0.17.3
+      - cvxcanon==0.1.2
+      - cython==3.0.8
+      - daal==2024.1.0
+      - daal4py==2024.1.0
+      - dacite==1.8.1
+      - dask==2024.3.1
+      - dask-expr==1.0.4
+      - dataclasses-json==0.6.4
+      - dataproc-jupyter-plugin==0.1.66
+      - datasets==2.1.0
+      - datashader==0.16.0
+      - datatile==1.0.3
+      - db-dtypes==1.2.0
+      - deap==1.4.1
+      - deepdiff==6.7.1
+      - deprecated==1.2.14
+      - deprecation==2.1.0
+      - descartes==1.1.0
+      - dill==0.3.8
+      - dipy==1.9.0
+      - distlib==0.3.8
+      - dm-tree==0.1.8
+      - docker==7.0.0
+      - docker-pycreds==0.4.0
+      - docopt==0.6.2
+      - docstring-parser==0.15
+      - docstring-to-markdown==0.15
+      - docutils==0.20.1
+      - earthengine-api==0.1.394
+      - easydict==1.13
+      - easyocr==1.7.1
+      - ecos==2.0.13
+      - eli5==0.13.0
+      - emoji==2.10.1
+      - en-core-web-lg==3.7.1
+      - en-core-web-sm==3.7.1
+      - ephem==4.1.5
+      - esda==2.5.1
+      - essentia==2.1b6.dev1110
+      - et-xmlfile==1.1.0
+      - etils==1.6.0
+      - explainable-ai-sdk==1.3.3
+      - farama-notifications==0.0.4
+      - fastai==2.7.14
+      - fastapi==0.108.0
+      - fastavro==1.9.3
+      - fastcore==1.5.29
+      - fastdownload==0.0.7
+      - fasteners==0.19
+      - fastparquet==2024.2.0
+      - fastprogress==1.0.3
+      - fasttext==0.9.2
+      - feather-format==0.4.1
+      - featuretools==1.30.0
+      - filelock==3.13.1
+      - fiona==1.9.6
+      - fitter==1.7.0
+      - flake8==7.0.0
+      - flashtext==2.7
+      - flask==3.0.2
+      - flatbuffers==23.5.26
+      - flax==0.8.2
+      - folium==0.16.0
+      - fonttools==4.47.0
+      - frozendict==2.4.0
+      - funcy==2.0
+      - fury==0.10.0
+      - future==1.0.0
+      - fuzzywuzzy==0.18.0
+      - gast==0.5.4
+      - gatspy==0.3
+      - gcsfs==2023.12.2.post1
+      - gekko==1.1.0
+      - gensim==4.3.2
+      - geographiclib==2.0
+      - geohash==1.0
+      - geojson==3.1.0
+      - geopandas==0.14.3
+      - geoplot==0.5.1
+      - geopy==2.4.1
+      - geoviews==1.11.1
+      - ggplot==0.11.5
+      - giddy==2.3.5
+      - gitdb==4.0.11
+      - gitpython==3.1.41
+      - google-ai-generativelanguage==0.4.0
+      - google-api-core==2.17.1
+      - google-api-python-client==2.122.0
+      - google-apitools==0.5.31
+      - google-auth-httplib2==0.1.1
+      - google-auth-oauthlib==1.2.0
+      - google-cloud-aiplatform==0.6.0a1
+      - google-cloud-artifact-registry==1.10.0
+      - google-cloud-automl==1.0.1
+      - google-cloud-bigquery==2.34.4
+      - google-cloud-bigtable==1.7.3
+      - google-cloud-dlp==3.14.0
+      - google-cloud-jupyter-config==0.0.5
+      - google-cloud-language==2.13.3
+      - google-cloud-monitoring==2.18.0
+      - google-cloud-pubsub==2.19.0
+      - google-cloud-pubsublite==1.9.0
+      - google-cloud-recommendations-ai==0.7.1
+      - google-cloud-resource-manager==1.11.0
+      - google-cloud-spanner==3.40.1
+      - google-cloud-storage==1.44.0
+      - google-cloud-translate==3.12.1
+      - google-cloud-videointelligence==2.13.3
+      - google-cloud-vision==2.8.0
+      - google-crc32c==1.5.0
+      - google-generativeai==0.4.1
+      - google-pasta==0.2.0
+      - google-resumable-media==2.7.0
+      - gplearn==0.4.2
+      - gpustat==1.0.0
+      - gpxpy==1.6.2
+      - greenlet==3.0.3
+      - grpc-google-iam-v1==0.12.7
+      - grpcio==1.60.0
+      - grpcio-status==1.48.2
+      - gviz-api==1.10.0
+      - gym==0.26.2
+      - gym-notices==0.0.8
+      - gymnasium==0.29.0
+      - h11==0.14.0
+      - h2o==3.46.0.1
+      - h5netcdf==1.3.0
+      - h5py==3.10.0
+      - haversine==2.8.1
+      - hdfs==2.7.3
+      - hep-ml==0.7.2
+      - hijri-converter==2.3.1
+      - hmmlearn==0.3.2
+      - holidays==0.24
+      - holoviews==1.18.3
+      - hpsklearn==0.1.0
+      - html5lib==1.1
+      - htmlmin==0.1.12
+      - httpcore==1.0.4
+      - httplib2==0.21.0
+      - httptools==0.6.1
+      - httpx==0.27.0
+      - huggingface-hub==0.21.4
+      - humanfriendly==10.0
+      - hunspell==0.5.5
+      - husl==4.0.3
+      - hydra-slayer==0.5.0
+      - hyperopt==0.2.7
+      - hypertools==0.8.0
+      - igraph==0.11.4
+      - imagecodecs==2024.1.1
+      - imagehash==4.3.1
+      - imageio==2.33.1
+      - imbalanced-learn==0.12.0
+      - imgaug==0.4.0
+      - importlib-metadata==6.11.0
+      - inequality==1.0.1
+      - iniconfig==2.0.0
+      - ipydatawidgets==4.3.5
+      - ipyleaflet==0.18.2
+      - ipympl==0.7.0
+      - ipython-genutils==0.2.0
+      - ipython-sql==0.5.0
+      - ipyvolume==0.6.3
+      - ipyvue==1.10.2
+      - ipyvuetify==1.9.2
+      - ipywebrtc==0.6.0
+      - ipywidgets==7.7.1
+      - isort==5.13.2
+      - isoweek==1.3.3
+      - itsdangerous==2.1.2
+      - janome==0.5.0
+      - jaraco-classes==3.3.0
+      - jax==0.4.23
+      - jax-jumpy==1.0.0
+      - jaxlib==0.4.23.dev20240116
+      - jeepney==0.8.0
+      - jieba==0.42.1
+      - jmespath==1.0.1
+      - json5==0.9.14
+      - jupyter-client==7.4.9
+      - jupyter-console==6.6.3
+      - jupyter-http-over-ws==0.0.8
+      - jupyter-lsp==1.5.1
+      - jupyter-server==2.13.0
+      - jupyter-server-fileid==0.9.1
+      - jupyter-server-mathjax==0.2.6
+      - jupyter-server-proxy==4.1.0
+      - jupyter-server-ydoc==0.8.0
+      - jupyter-ydoc==0.2.5
+      - jupyterlab==4.1.5
+      - jupyterlab-git==0.44.0
+      - jupyterlab-lsp==5.1.0
+      - jupyterlab-server==2.25.2
+      - jupyterlab-widgets==3.0.9
+      - jupytext==1.16.0
+      - kaggle==1.6.6
+      - kaggle-environments==1.14.3
+      - kagglehub==0.2.0
+      - keras==3.0.5
+      - keras-cv==0.8.2
+      - keras-nlp==0.8.2
+      - keras-tuner==1.4.6
+      - kernels-mixer==0.0.7
+      - keyring==24.3.0
+      - keyrings-google-artifactregistry-auth==1.1.2
+      - kfp==2.5.0
+      - kfp-pipeline-spec==0.2.2
+      - kfp-server-api==2.0.5
+      - kmapper==2.0.1
+      - kmodes==0.12.2
+      - korean-lunar-calendar==0.3.1
+      - kornia==0.7.2
+      - kornia-rs==0.1.2
+      - kt-legacy==1.0.5
+      - kubernetes==26.1.0
+      - langid==1.1.6
+      - lazy-loader==0.3
+      - learntools==0.3.4
+      - leven==1.0.4
+      - levenshtein==0.25.0
+      - libclang==16.0.6
+      - libpysal==4.9.2
+      - librosa==0.10.1
+      - lightgbm==4.2.0
+      - lightning-utilities==0.10.1
+      - lime==0.2.0.1
+      - line-profiler==4.1.2
+      - linkify-it-py==2.0.3
+      - llvmlite==0.41.1
+      - lml==0.1.0
+      - loguru==0.7.2
+      - lunarcalendar==0.0.9
+      - lxml==5.1.0
+      - mako==1.3.2
+      - mapclassify==2.6.1
+      - markdown==3.5.2
+      - markovify==0.9.4
+      - markupsafe==2.1.5
+      - marshmallow==3.21.1
+      - matplotlib==3.7.5
+      - matplotlib-venn==0.11.10
+      - mccabe==0.7.0
+      - mdit-py-plugins==0.4.0
+      - memory-profiler==0.61.0
+      - mercantile==1.2.1
+      - mgwr==2.2.1
+      - missingno==0.5.2
+      - mistune==0.8.4
+      - mizani==0.11.0
+      - ml-dtypes==0.2.0
+      - mlcrate==0.2.0
+      - mlens==0.2.3
+      - mlxtend==0.23.1
+      - mmh3==4.1.0
+      - mne==1.6.1
+      - mnist==0.2.2
+      - mock==5.1.0
+      - momepy==0.7.0
+      - more-itertools==10.2.0
+      - mpld3==0.5.10
+      - mpmath==1.3.0
+      - msgpack-numpy==0.4.8
+      - multimethod==1.10
+      - multipledispatch==1.0.0
+      - multiprocess==0.70.16
+      - mypy-extensions==1.0.0
+      - namex==0.0.7
+      - nbclient==0.5.13
+      - nbconvert==6.4.5
+      - nbdime==3.2.0
+      - ndindex==1.8
+      - networkx==3.2.1
+      - nibabel==5.2.1
+      - nilearn==0.10.3
+      - ninja==1.11.1.1
+      - nltk==3.2.4
+      - nose==1.3.7
+      - notebook==6.5.6
+      - notebook-executor==0.2
+      - numba==0.58.1
+      - numexpr==2.9.0
+      - nvidia-ml-py==11.495.46
+      - oauth2client==4.1.3
+      - oauthlib==3.2.2
+      - objsize==0.6.1
+      - odfpy==1.4.1
+      - olefile==0.47
+      - onnx==1.15.0
+      - opencensus==0.11.4
+      - opencensus-context==0.1.3
+      - opencv-contrib-python==4.9.0.80
+      - opencv-python==4.9.0.80
+      - opencv-python-headless==4.9.0.80
+      - openpyxl==3.1.2
+      - openslide-python==1.3.1
+      - opentelemetry-api==1.22.0
+      - opentelemetry-exporter-otlp==1.22.0
+      - opentelemetry-exporter-otlp-proto-common==1.22.0
+      - opentelemetry-exporter-otlp-proto-grpc==1.22.0
+      - opentelemetry-exporter-otlp-proto-http==1.22.0
+      - opentelemetry-proto==1.22.0
+      - opentelemetry-sdk==1.22.0
+      - opentelemetry-semantic-conventions==0.43b0
+      - opt-einsum==3.3.0
+      - optax==0.2.1
+      - optimum==1.18.0
+      - optuna==3.6.0
+      - orbax-checkpoint==0.5.6
+      - ordered-set==4.1.0
+      - orjson==3.9.10
+      - ortools==9.4.1874
+      - osmnx==1.9.1
+      - packaging==21.3
+      - pandas==2.2.1
+      - pandas-datareader==0.10.0
+      - pandas-profiling==3.6.6
+      - pandas-summary==0.2.0
+      - pandasql==0.7.3
+      - panel==1.3.8
+      - papermill==2.5.0
+      - param==2.0.2
+      - path==16.10.0
+      - path-py==12.5.0
+      - pathos==0.3.2
+      - patsy==0.5.6
+      - pdf2image==1.17.0
+      - peft==0.10.0
+      - pettingzoo==1.24.0
+      - pexpect==4.9.0
+      - phik==0.12.4
+      - pillow==9.5.0
+      - platformdirs==4.2.0
+      - plotly==5.18.0
+      - plotly-express==0.4.1
+      - plotnine==0.13.2
+      - pluggy==1.4.0
+      - pointpats==2.4.0
+      - polars==0.20.15
+      - polyglot==16.7.4
+      - pooch==1.8.1
+      - pox==0.3.4
+      - ppca==0.0.4
+      - ppft==1.7.6.8
+      - preprocessing==0.1.13
+      - prettytable==3.9.0
+      - progressbar2==4.4.2
+      - promise==2.3
+      - prompt-toolkit==3.0.43
+      - pronouncing==0.2.0
+      - prophet==1.1.1
+      - protobuf==3.20.3
+      - psutil==5.9.3
+      - pudb==2024.1
+      - pulp==2.8.0
+      - py-cpuinfo==9.0.0
+      - py-spy==0.3.14
+      - py4j==0.10.9.7
+      - pyaml==23.12.0
+      - pyarabic==0.6.15
+      - pyastronomy==0.21.0
+      - pybind11==2.11.1
+      - pyclipper==1.3.0.post5
+      - pycodestyle==2.11.1
+      - pycparser==2.21
+      - pycryptodome==3.20.0
+      - pyct==0.5.0
+      - pycuda==2024.1
+      - pydantic==2.5.3
+      - pydantic-core==2.14.6
+      - pydegensac==0.1.2
+      - pydicom==2.4.4
+      - pydocstyle==6.3.0
+      - pydot==1.4.2
+      - pydub==0.25.1
+      - pyemd==1.0.0
+      - pyerfa==2.0.1.1
+      - pyexcel-io==0.6.6
+      - pyexcel-ods==0.6.0
+      - pyflakes==3.2.0
+      - pygltflib==1.16.2
+      - pyjwt==2.8.0
+      - pykalman==0.9.5
+      - pyldavis==3.4.1
+      - pylint==3.0.4
+      - pymc3==3.11.4
+      - pymeeus==0.5.12
+      - pymongo==3.13.0
+      - pympler==1.0.1
+      - pynndescent==0.5.11
+      - pynvrtc==9.2
+      - pyocr==0.8.5
+      - pyparsing==3.1.1
+      - pypdf==4.1.0
+      - pysal==24.1
+      - pytesseract==0.3.10
+      - pytest==8.1.1
+      - python-bidi==0.4.2
+      - python-dateutil==2.9.0.post0
+      - python-dotenv==1.0.0
+      - python-graphviz==0.20.2
+      - python-levenshtein==0.25.0
+      - python-louvain==0.16
+      - python-lsp-jsonrpc==1.1.2
+      - python-lsp-server==1.10.1
+      - python-slugify==8.0.4
+      - python-utils==3.8.2
+      - pythreejs==2.4.2
+      - pytoolconfig==1.3.1
+      - pytools==2023.1.1
+      - pytorch-ignite==0.4.13
+      - pytorch-lightning==2.2.1
+      - pytz==2023.3.post1
+      - pyupset==0.1.1.post7
+      - pyviz-comms==3.0.1
+      - pywavelets==1.5.0
+      - pyzmq==24.0.1
+      - qgrid==1.3.1
+      - qtconsole==5.5.1
+      - qtpy==2.4.1
+      - quantecon==0.7.2
+      - quantities==0.15.0
+      - qudida==0.0.4
+      - rapidfuzz==3.6.2
+      - rasterio==1.3.9
+      - rasterstats==0.19.0
+      - ray==2.9.0
+      - ray-cpp==2.9.0
+      - regex==2023.12.25
+      - requests-oauthlib==1.3.1
+      - requests-toolbelt==0.10.1
+      - responses==0.18.0
+      - retrying==1.3.4
+      - rgf-python==3.12.0
+      - rich==13.7.0
+      - rich-click==1.7.4
+      - rope==1.12.0
+      - rouge==1.0.1
+      - rtree==1.2.0
+      - s2sphere==0.2.5
+      - s3fs==2024.3.0
+      - s3transfer==0.6.2
+      - safetensors==0.4.2
+      - scattertext==0.1.19
+      - scikit-image==0.22.0
+      - scikit-learn==1.2.2
+      - scikit-learn-intelex==2024.1.0
+      - scikit-multilearn==0.2.0
+      - scikit-optimize==0.10.1
+      - scikit-plot==0.3.7
+      - scikit-surprise==1.1.3
+      - scipy==1.11.4
+      - seaborn==0.12.2
+      - secretstorage==3.3.3
+      - segment-anything==1.0
+      - segregation==2.5
+      - semver==3.0.2
+      - sentencepiece==0.2.0
+      - sentry-sdk==1.42.0
+      - setproctitle==1.3.3
+      - setuptools-git==1.2
+      - setuptools-scm==8.0.4
+      - shap==0.44.1
+      - shapely==2.0.3
+      - shimmy==1.3.0
+      - simpervisor==1.0.0
+      - simpleitk==2.3.1
+      - simplejson==3.19.2
+      - six==1.16.0
+      - sklearn-pandas==2.2.0
+      - slicer==0.0.7
+      - smmap==5.0.1
+      - snowballstemmer==2.2.0
+      - snuggs==1.4.7
+      - soundfile==0.12.1
+      - soxr==0.3.7
+      - spaghetti==1.7.5.post1
+      - spectral==0.23.1
+      - spglm==1.1.0
+      - sphinx-rtd-theme==0.2.4
+      - spint==1.0.7
+      - splot==1.1.5.post1
+      - spopt==0.6.0
+      - spreg==1.4.2
+      - spvcm==0.3.0
+      - sqlalchemy==2.0.25
+      - sqlparse==0.4.4
+      - squarify==0.4.3
+      - stable-baselines3==2.1.0
+      - stack-data==0.6.3
+      - stanio==0.3.0
+      - starlette==0.32.0.post1
+      - statsmodels==0.14.1
+      - stemming==1.0.1
+      - stop-words==2018.7.23
+      - stopit==1.1.2
+      - stumpy==1.12.0
+      - sympy==1.12
+      - tables==3.9.2
+      - tabulate==0.9.0
+      - tangled-up-in-unicode==0.2.0
+      - tbb==2021.11.0
+      - tenacity==8.2.3
+      - tensorboard==2.15.1
+      - tensorboard-data-server==0.7.2
+      - tensorboard-plugin-profile==2.15.0
+      - tensorboardx==2.6.2.2
+      - tensorflow==2.15.0
+      - tensorflow-cloud==0.1.16
+      - tensorflow-datasets==4.9.4
+      - tensorflow-decision-forests==1.8.1
+      - tensorflow-estimator==2.15.0
+      - tensorflow-hub==0.16.1
+      - tensorflow-io==0.35.0
+      - tensorflow-io-gcs-filesystem==0.35.0
+      - tensorflow-metadata==0.14.0
+      - tensorflow-probability==0.23.0
+      - tensorflow-serving-api==2.14.1
+      - tensorflow-text==2.15.0
+      - tensorflow-transform==0.14.0
+      - tensorpack==0.11
+      - tensorstore==0.1.56
+      - termcolor==2.4.0
+      - testpath==0.6.0
+      - text-unidecode==1.3
+      - textblob==0.18.0.post0
+      - texttable==1.7.0
+      - tf-keras==2.15.1
+      - tfp-nightly==0.24.0.dev0
+      - theano==1.0.5
+      - theano-pymc==1.1.2
+      - threadpoolctl==3.2.0
+      - tifffile==2023.12.9
+      - timm==0.9.16
+      - tobler==0.11.2
+      - tokenizers==0.15.2
+      - toml==0.10.2
+      - tomli==2.0.1
+      - tomlkit==0.12.4
+      - torch==2.1.2
+      - torchaudio==2.1.2
+      - torchdata==0.7.1
+      - torchinfo==1.8.0
+      - torchmetrics==1.3.2
+      - torchtext==0.16.2
+      - torchvision==0.16.2
+      - tpot==0.12.1
+      - traceml==1.0.8
+      - traittypes==0.2.1
+      - transformers==4.38.2
+      - treelite-runtime==3.2.0
+      - trueskill==0.4.5
+      - trx-python==0.2.9
+      - tsfresh==0.20.2
+      - typeguard==4.1.5
+      - typing-inspect==0.9.0
+      - tzdata==2023.4
+      - uc-micro-py==1.0.3
+      - ujson==5.9.0
+      - umap-learn==0.5.5
+      - unidecode==1.3.8
+      - update-checker==0.18.0
+      - uritemplate==3.0.1
+      - urllib3==1.26.18
+      - urwid==2.6.9
+      - urwid-readline==0.14
+      - uvicorn==0.25.0
+      - uvloop==0.19.0
+      - vaex==4.17.0
+      - vaex-astro==0.9.3
+      - vaex-core==4.17.1
+      - vaex-hdf5==0.14.1
+      - vaex-jupyter==0.8.2
+      - vaex-ml==0.18.3
+      - vaex-server==0.9.0
+      - vaex-viz==0.5.4
+      - vec-noise==1.1.4
+      - vecstack==0.4.0
+      - virtualenv==20.21.0
+      - visions==0.7.5
+      - vowpalwabbit==9.9.0
+      - vtk==9.3.0
+      - wand==0.6.13
+      - wandb==0.16.4
+      - watchfiles==0.21.0
+      - wavio==0.0.8
+      - websockets==12.0
+      - werkzeug==3.0.1
+      - wfdb==4.1.2
+      - whatthepatch==1.0.5
+      - widgetsnbextension==3.6.6
+      - witwidget==1.8.1
+      - woodwork==0.29.0
+      - wordcloud==1.9.3
+      - wordsegment==1.3.1
+      - wrapt==1.14.1
+      - xarray==2024.2.0
+      - xarray-einstats==0.7.0
+      - xgboost==2.0.3
+      - xvfbwrapper==0.2.9
+      - xxhash==3.4.1
+      - y-py==0.6.2
+      - yapf==0.40.2
+      - yarl==1.9.4
+      - ydata-profiling==4.6.4
+      - yellowbrick==1.5
+      - ypy-websocket==0.8.4
+prefix: /opt/conda

wandb/run-20240402_072503-b95w935d/files/config.yaml ADDED Viewed

	@@ -0,0 +1,724 @@

+wandb_version: 1
+_wandb:
+  desc: null
+  value:
+    python_version: 3.10.13
+    cli_version: 0.16.4
+    framework: huggingface
+    huggingface_version: 4.38.2
+    is_jupyter_run: true
+    is_kaggle_kernel: true
+    start_time: 1712042703.0
+    t:
+      1:
+      - 1
+      - 2
+      - 3
+      - 5
+      - 11
+      - 12
+      - 49
+      - 51
+      - 53
+      - 55
+      - 71
+      - 98
+      - 99
+      - 105
+      2:
+      - 1
+      - 2
+      - 3
+      - 5
+      - 11
+      - 12
+      - 49
+      - 51
+      - 53
+      - 55
+      - 71
+      - 98
+      - 99
+      - 105
+      3:
+      - 7
+      - 23
+      4: 3.10.13
+      5: 0.16.4
+      6: 4.38.2
+      8:
+      - 1
+      - 2
+      - 5
+      9:
+        1: transformers_trainer
+      13: linux-x86_64
+    m:
+    - 1: train/global_step
+      6:
+      - 3
+    - 1: train/loss
+      5: 1
+      6:
+      - 1
+    - 1: train/grad_norm
+      5: 1
+      6:
+      - 1
+    - 1: train/learning_rate
+      5: 1
+      6:
+      - 1
+    - 1: train/epoch
+      5: 1
+      6:
+      - 1
+    - 1: eval/loss
+      5: 1
+      6:
+      - 1
+    - 1: eval/runtime
+      5: 1
+      6:
+      - 1
+    - 1: eval/samples_per_second
+      5: 1
+      6:
+      - 1
+    - 1: eval/steps_per_second
+      5: 1
+      6:
+      - 1
+    - 1: train/train_runtime
+      5: 1
+      6:
+      - 1
+    - 1: train/train_samples_per_second
+      5: 1
+      6:
+      - 1
+    - 1: train/train_steps_per_second
+      5: 1
+      6:
+      - 1
+    - 1: train/total_flos
+      5: 1
+      6:
+      - 1
+    - 1: train/train_loss
+      5: 1
+      6:
+      - 1
+vocab_size:
+  desc: null
+  value: 32000
+max_position_embeddings:
+  desc: null
+  value: 32768
+hidden_size:
+  desc: null
+  value: 4096
+intermediate_size:
+  desc: null
+  value: 14336
+num_hidden_layers:
+  desc: null
+  value: 32
+num_attention_heads:
+  desc: null
+  value: 32
+sliding_window:
+  desc: null
+  value: null
+num_key_value_heads:
+  desc: null
+  value: 8
+hidden_act:
+  desc: null
+  value: silu
+initializer_range:
+  desc: null
+  value: 0.02
+rms_norm_eps:
+  desc: null
+  value: 1.0e-05
+use_cache:
+  desc: null
+  value: false
+rope_theta:
+  desc: null
+  value: 1000000.0
+attention_dropout:
+  desc: null
+  value: 0.0
+return_dict:
+  desc: null
+  value: true
+output_hidden_states:
+  desc: null
+  value: false
+output_attentions:
+  desc: null
+  value: false
+torchscript:
+  desc: null
+  value: false
+torch_dtype:
+  desc: null
+  value: bfloat16
+use_bfloat16:
+  desc: null
+  value: false
+tf_legacy_loss:
+  desc: null
+  value: false
+pruned_heads:
+  desc: null
+  value: {}
+tie_word_embeddings:
+  desc: null
+  value: false
+chunk_size_feed_forward:
+  desc: null
+  value: 0
+is_encoder_decoder:
+  desc: null
+  value: false
+is_decoder:
+  desc: null
+  value: false
+cross_attention_hidden_size:
+  desc: null
+  value: null
+add_cross_attention:
+  desc: null
+  value: false
+tie_encoder_decoder:
+  desc: null
+  value: false
+max_length:
+  desc: null
+  value: 20
+min_length:
+  desc: null
+  value: 0
+do_sample:
+  desc: null
+  value: false
+early_stopping:
+  desc: null
+  value: false
+num_beams:
+  desc: null
+  value: 1
+num_beam_groups:
+  desc: null
+  value: 1
+diversity_penalty:
+  desc: null
+  value: 0.0
+temperature:
+  desc: null
+  value: 1.0
+top_k:
+  desc: null
+  value: 50
+top_p:
+  desc: null
+  value: 1.0
+typical_p:
+  desc: null
+  value: 1.0
+repetition_penalty:
+  desc: null
+  value: 1.0
+length_penalty:
+  desc: null
+  value: 1.0
+no_repeat_ngram_size:
+  desc: null
+  value: 0
+encoder_no_repeat_ngram_size:
+  desc: null
+  value: 0
+bad_words_ids:
+  desc: null
+  value: null
+num_return_sequences:
+  desc: null
+  value: 1
+output_scores:
+  desc: null
+  value: false
+return_dict_in_generate:
+  desc: null
+  value: false
+forced_bos_token_id:
+  desc: null
+  value: null
+forced_eos_token_id:
+  desc: null
+  value: null
+remove_invalid_values:
+  desc: null
+  value: false
+exponential_decay_length_penalty:
+  desc: null
+  value: null
+suppress_tokens:
+  desc: null
+  value: null
+begin_suppress_tokens:
+  desc: null
+  value: null
+architectures:
+  desc: null
+  value:
+  - MistralForCausalLM
+finetuning_task:
+  desc: null
+  value: null
+id2label:
+  desc: null
+  value:
+    '0': LABEL_0
+    '1': LABEL_1
+label2id:
+  desc: null
+  value:
+    LABEL_0: 0
+    LABEL_1: 1
+tokenizer_class:
+  desc: null
+  value: null
+prefix:
+  desc: null
+  value: null
+bos_token_id:
+  desc: null
+  value: 1
+pad_token_id:
+  desc: null
+  value: 0
+eos_token_id:
+  desc: null
+  value: 2
+sep_token_id:
+  desc: null
+  value: null
+decoder_start_token_id:
+  desc: null
+  value: null
+task_specific_params:
+  desc: null
+  value: null
+problem_type:
+  desc: null
+  value: null
+_name_or_path:
+  desc: null
+  value: TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
+transformers_version:
+  desc: null
+  value: 4.38.2
+model_type:
+  desc: null
+  value: mistral
+pretraining_tp:
+  desc: null
+  value: 1
+quantization_config:
+  desc: null
+  value:
+    quant_method: QuantizationMethod.GPTQ
+    bits: 4
+    tokenizer: null
+    dataset: null
+    group_size: 128
+    damp_percent: 0.1
+    desc_act: true
+    sym: true
+    true_sequential: true
+    use_cuda_fp16: false
+    model_seqlen: null
+    block_name_to_quantize: null
+    module_name_preceding_first_block: null
+    batch_size: 1
+    pad_token_id: null
+    use_exllama: true
+    max_input_length: null
+    exllama_config:
+      version: ExllamaVersion.ONE
+    cache_block_outputs: true
+    modules_in_block_to_quantize: null
+output_dir:
+  desc: null
+  value: /kaggle/working/
+overwrite_output_dir:
+  desc: null
+  value: false
+do_train:
+  desc: null
+  value: false
+do_eval:
+  desc: null
+  value: true
+do_predict:
+  desc: null
+  value: false
+evaluation_strategy:
+  desc: null
+  value: epoch
+prediction_loss_only:
+  desc: null
+  value: false
+per_device_train_batch_size:
+  desc: null
+  value: 6
+per_device_eval_batch_size:
+  desc: null
+  value: 6
+per_gpu_train_batch_size:
+  desc: null
+  value: null
+per_gpu_eval_batch_size:
+  desc: null
+  value: null
+gradient_accumulation_steps:
+  desc: null
+  value: 4
+eval_accumulation_steps:
+  desc: null
+  value: null
+eval_delay:
+  desc: null
+  value: 0
+learning_rate:
+  desc: null
+  value: 0.0002
+weight_decay:
+  desc: null
+  value: 0.01
+adam_beta1:
+  desc: null
+  value: 0.9
+adam_beta2:
+  desc: null
+  value: 0.999
+adam_epsilon:
+  desc: null
+  value: 1.0e-08
+max_grad_norm:
+  desc: null
+  value: 1.0
+num_train_epochs:
+  desc: null
+  value: 30
+max_steps:
+  desc: null
+  value: -1
+lr_scheduler_type:
+  desc: null
+  value: linear
+lr_scheduler_kwargs:
+  desc: null
+  value: {}
+warmup_ratio:
+  desc: null
+  value: 0.0
+warmup_steps:
+  desc: null
+  value: 2
+log_level:
+  desc: null
+  value: passive
+log_level_replica:
+  desc: null
+  value: warning
+log_on_each_node:
+  desc: null
+  value: true
+logging_dir:
+  desc: null
+  value: /kaggle/working/runs/Apr02_07-24-52_0ecfc053b35f
+logging_strategy:
+  desc: null
+  value: epoch
+logging_first_step:
+  desc: null
+  value: false
+logging_steps:
+  desc: null
+  value: 500
+logging_nan_inf_filter:
+  desc: null
+  value: true
+save_strategy:
+  desc: null
+  value: epoch
+save_steps:
+  desc: null
+  value: 500
+save_total_limit:
+  desc: null
+  value: null
+save_safetensors:
+  desc: null
+  value: true
+save_on_each_node:
+  desc: null
+  value: false
+save_only_model:
+  desc: null
+  value: false
+no_cuda:
+  desc: null
+  value: false
+use_cpu:
+  desc: null
+  value: false
+use_mps_device:
+  desc: null
+  value: false
+seed:
+  desc: null
+  value: 42
+data_seed:
+  desc: null
+  value: null
+jit_mode_eval:
+  desc: null
+  value: false
+use_ipex:
+  desc: null
+  value: false
+bf16:
+  desc: null
+  value: false
+fp16:
+  desc: null
+  value: true
+fp16_opt_level:
+  desc: null
+  value: O1
+half_precision_backend:
+  desc: null
+  value: auto
+bf16_full_eval:
+  desc: null
+  value: false
+fp16_full_eval:
+  desc: null
+  value: false
+tf32:
+  desc: null
+  value: null
+local_rank:
+  desc: null
+  value: 0
+ddp_backend:
+  desc: null
+  value: null
+tpu_num_cores:
+  desc: null
+  value: null
+tpu_metrics_debug:
+  desc: null
+  value: false
+debug:
+  desc: null
+  value: []
+dataloader_drop_last:
+  desc: null
+  value: false
+eval_steps:
+  desc: null
+  value: null
+dataloader_num_workers:
+  desc: null
+  value: 0
+dataloader_prefetch_factor:
+  desc: null
+  value: null
+past_index:
+  desc: null
+  value: -1
+run_name:
+  desc: null
+  value: /kaggle/working/
+disable_tqdm:
+  desc: null
+  value: false
+remove_unused_columns:
+  desc: null
+  value: true
+label_names:
+  desc: null
+  value: null
+load_best_model_at_end:
+  desc: null
+  value: true
+metric_for_best_model:
+  desc: null
+  value: loss
+greater_is_better:
+  desc: null
+  value: false
+ignore_data_skip:
+  desc: null
+  value: false
+fsdp:
+  desc: null
+  value: []
+fsdp_min_num_params:
+  desc: null
+  value: 0
+fsdp_config:
+  desc: null
+  value:
+    min_num_params: 0
+    xla: false
+    xla_fsdp_v2: false
+    xla_fsdp_grad_ckpt: false
+fsdp_transformer_layer_cls_to_wrap:
+  desc: null
+  value: null
+accelerator_config:
+  desc: null
+  value:
+    split_batches: false
+    dispatch_batches: null
+    even_batches: true
+    use_seedable_sampler: true
+deepspeed:
+  desc: null
+  value: null
+label_smoothing_factor:
+  desc: null
+  value: 0.0
+optim:
+  desc: null
+  value: paged_adamw_8bit
+optim_args:
+  desc: null
+  value: null
+adafactor:
+  desc: null
+  value: false
+group_by_length:
+  desc: null
+  value: false
+length_column_name:
+  desc: null
+  value: length
+report_to:
+  desc: null
+  value:
+  - tensorboard
+  - wandb
+ddp_find_unused_parameters:
+  desc: null
+  value: null
+ddp_bucket_cap_mb:
+  desc: null
+  value: null
+ddp_broadcast_buffers:
+  desc: null
+  value: null
+dataloader_pin_memory:
+  desc: null
+  value: true
+dataloader_persistent_workers:
+  desc: null
+  value: false
+skip_memory_metrics:
+  desc: null
+  value: true
+use_legacy_prediction_loop:
+  desc: null
+  value: false
+push_to_hub:
+  desc: null
+  value: false
+resume_from_checkpoint:
+  desc: null
+  value: null
+hub_model_id:
+  desc: null
+  value: null
+hub_strategy:
+  desc: null
+  value: every_save
+hub_token:
+  desc: null
+  value: <HUB_TOKEN>
+hub_private_repo:
+  desc: null
+  value: false
+hub_always_push:
+  desc: null
+  value: false
+gradient_checkpointing:
+  desc: null
+  value: false
+gradient_checkpointing_kwargs:
+  desc: null
+  value: null
+include_inputs_for_metrics:
+  desc: null
+  value: false
+fp16_backend:
+  desc: null
+  value: auto
+push_to_hub_model_id:
+  desc: null
+  value: null
+push_to_hub_organization:
+  desc: null
+  value: null
+push_to_hub_token:
+  desc: null
+  value: <PUSH_TO_HUB_TOKEN>
+mp_parameters:
+  desc: null
+  value: ''
+auto_find_batch_size:
+  desc: null
+  value: false
+full_determinism:
+  desc: null
+  value: false
+torchdynamo:
+  desc: null
+  value: null
+ray_scope:
+  desc: null
+  value: last
+ddp_timeout:
+  desc: null
+  value: 1800
+torch_compile:
+  desc: null
+  value: false
+torch_compile_backend:
+  desc: null
+  value: null
+torch_compile_mode:
+  desc: null
+  value: null
+dispatch_batches:
+  desc: null
+  value: null
+split_batches:
+  desc: null
+  value: null
+include_tokens_per_second:
+  desc: null
+  value: false
+include_num_input_tokens_seen:
+  desc: null
+  value: false
+neftune_noise_alpha:
+  desc: null
+  value: null

wandb/run-20240402_072503-b95w935d/files/output.log ADDED Viewed

	@@ -0,0 +1,57 @@

+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.

wandb/run-20240402_072503-b95w935d/files/requirements.txt ADDED Viewed

	@@ -0,0 +1,882 @@

+Babel==2.14.0
+Boruta==0.3
+Brotli==1.0.9
+CVXcanon==0.1.2
+Cartopy==0.22.0
+Cython==3.0.8
+Deprecated==1.2.14
+Farama-Notifications==0.0.4
+Flask==3.0.2
+Geohash==1.0
+GitPython==3.1.41
+ImageHash==4.3.1
+Janome==0.5.0
+Jinja2==3.1.2
+Levenshtein==0.25.0
+LunarCalendar==0.0.9
+Mako==1.3.2
+Markdown==3.5.2
+MarkupSafe==2.1.3
+MarkupSafe==2.1.5
+Pillow==9.5.0
+PuLP==2.8.0
+PyArabic==0.6.15
+PyAstronomy==0.21.0
+PyJWT==2.8.0
+PyMeeus==0.5.12
+PySocks==1.7.1
+PyUpSet==0.1.1.post7
+PyWavelets==1.5.0
+PyYAML==6.0.1
+Pygments==2.17.2
+Pympler==1.0.1
+QtPy==2.4.1
+Rtree==1.2.0
+SQLAlchemy==2.0.25
+SecretStorage==3.3.3
+Send2Trash==1.8.2
+Shapely==1.8.5.post1
+Shimmy==1.3.0
+SimpleITK==2.3.1
+TPOT==0.12.1
+Theano-PyMC==1.1.2
+Theano==1.0.5
+Unidecode==1.3.8
+Wand==0.6.13
+Werkzeug==3.0.1
+absl-py==1.4.0
+accelerate==0.28.0
+access==1.1.9
+affine==2.4.0
+aiobotocore==2.12.1
+aiofiles==22.1.0
+aiohttp-cors==0.7.0
+aiohttp==3.9.1
+aioitertools==0.11.0
+aiorwlock==1.3.0
+aiosignal==1.3.1
+aiosqlite==0.19.0
+albumentations==1.4.0
+alembic==1.13.1
+altair==5.2.0
+annotated-types==0.6.0
+annoy==1.17.3
+anyio==4.2.0
+apache-beam==2.46.0
+aplus==0.11.0
+appdirs==1.4.4
+archspec==0.2.2
+argon2-cffi-bindings==21.2.0
+argon2-cffi==23.1.0
+array-record==0.5.0
+arrow==1.3.0
+arviz==0.17.1
+astroid==3.0.3
+astropy-iers-data==0.2024.3.18.0.29.47
+astropy==6.0.0
+asttokens==2.4.1
+astunparse==1.6.3
+async-lru==2.0.4
+async-timeout==4.0.3
+attrs==23.2.0
+audioread==3.0.1
+auto_gptq==0.7.1
+autopep8==2.0.4
+backoff==2.2.1
+bayesian-optimization==1.4.3
+beatrix_jupyterlab==2023.128.151533
+beautifulsoup4==4.12.2
+bidict==0.23.1
+bitsandbytes==0.43.0
+blake3==0.2.1
+bleach==6.1.0
+blessed==1.20.0
+blinker==1.7.0
+blis==0.7.10
+blosc2==2.5.1
+bokeh==3.3.4
+boltons==23.1.1
+boto3==1.26.100
+botocore==1.34.51
+bq_helper==0.4.1
+bqplot==0.12.43
+branca==0.7.1
+brewer2mpl==1.4.1
+brotlipy==0.7.0
+cached-property==1.5.2
+cachetools==4.2.4
+cachetools==5.3.2
+catalogue==2.0.10
+catalyst==22.4
+catboost==1.2.3
+category-encoders==2.6.3
+certifi==2024.2.2
+cesium==0.12.1
+cffi==1.16.0
+charset-normalizer==3.3.2
+chex==0.1.85
+cleverhans==4.0.0
+click-plugins==1.1.1
+click==8.1.7
+cligj==0.7.2
+cloud-tpu-client==0.10
+cloud-tpu-profiler==2.4.0
+cloudpathlib==0.16.0
+cloudpickle==2.2.1
+cloudpickle==3.0.0
+cmdstanpy==1.2.1
+cmudict==1.0.21
+colorama==0.4.6
+colorcet==3.1.0
+coloredlogs==15.0.1
+colorful==0.5.6
+colorlog==6.8.2
+colorlover==0.3.0
+comm==0.2.1
+conda-libmamba-solver==23.7.0
+conda-package-handling==2.2.0
+conda==23.7.4
+conda_package_streaming==0.9.0
+confection==0.1.4
+contextily==1.5.2
+contourpy==1.2.0
+convertdate==2.4.0
+cramjam==2.8.3
+crcmod==1.7
+cryptography==41.0.7
+cuda-python==12.4.0
+cudf==23.8.0
+cufflinks==0.17.3
+cuml==23.8.0
+cupy==13.0.0
+cycler==0.12.1
+cymem==2.0.8
+cytoolz==0.12.3
+daal4py==2024.1.0
+daal==2024.1.0
+dacite==1.8.1
+dask-cuda==23.8.0
+dask-cudf==23.8.0
+dask-expr==1.0.4
+dask==2024.3.1
+dataclasses-json==0.6.4
+dataproc_jupyter_plugin==0.1.66
+datasets==2.1.0
+datashader==0.16.0
+datatile==1.0.3
+db-dtypes==1.2.0
+deap==1.4.1
+debugpy==1.8.0
+decorator==5.1.1
+deepdiff==6.7.1
+defusedxml==0.7.1
+deprecation==2.1.0
+descartes==1.1.0
+dill==0.3.8
+dipy==1.9.0
+distlib==0.3.8
+distributed==2023.7.1
+distro==1.9.0
+dm-tree==0.1.8
+docker-pycreds==0.4.0
+docker==7.0.0
+docopt==0.6.2
+docstring-parser==0.15
+docstring-to-markdown==0.15
+docutils==0.20.1
+earthengine-api==0.1.394
+easydict==1.13
+easyocr==1.7.1
+ecos==2.0.13
+eli5==0.13.0
+emoji==2.10.1
+en-core-web-lg==3.7.1
+en-core-web-sm==3.7.1
+entrypoints==0.4
+ephem==4.1.5
+esda==2.5.1
+essentia==2.1b6.dev1110
+et-xmlfile==1.1.0
+etils==1.6.0
+exceptiongroup==1.2.0
+executing==2.0.1
+explainable-ai-sdk==1.3.3
+fastai==2.7.14
+fastapi==0.108.0
+fastavro==1.9.3
+fastcore==1.5.29
+fastdownload==0.0.7
+fasteners==0.19
+fastjsonschema==2.19.1
+fastparquet==2024.2.0
+fastprogress==1.0.3
+fastrlock==0.8.2
+fasttext==0.9.2
+feather-format==0.4.1
+featuretools==1.30.0
+filelock==3.13.1
+fiona==1.9.6
+fitter==1.7.0
+flake8==7.0.0
+flashtext==2.7
+flatbuffers==23.5.26
+flax==0.8.2
+folium==0.16.0
+fonttools==4.47.0
+fonttools==4.49.0
+fqdn==1.5.1
+frozendict==2.4.0
+frozenlist==1.4.1
+fsspec==2024.3.0
+funcy==2.0
+fury==0.10.0
+future==1.0.0
+fuzzywuzzy==0.18.0
+gast==0.5.4
+gatspy==0.3
+gcsfs==2023.12.2.post1
+gekko==1.1.0
+gensim==4.3.2
+geographiclib==2.0
+geojson==3.1.0
+geopandas==0.14.3
+geoplot==0.5.1
+geopy==2.4.1
+geoviews==1.11.1
+ggplot==0.11.5
+giddy==2.3.5
+gitdb==4.0.11
+google-ai-generativelanguage==0.4.0
+google-api-core==2.11.1
+google-api-core==2.17.1
+google-api-python-client==2.122.0
+google-apitools==0.5.31
+google-auth-httplib2==0.1.1
+google-auth-oauthlib==1.2.0
+google-auth==2.26.1
+google-cloud-aiplatform==0.6.0a1
+google-cloud-artifact-registry==1.10.0
+google-cloud-automl==1.0.1
+google-cloud-bigquery==2.34.4
+google-cloud-bigtable==1.7.3
+google-cloud-core==2.4.1
+google-cloud-datastore==2.19.0
+google-cloud-dlp==3.14.0
+google-cloud-jupyter-config==0.0.5
+google-cloud-language==2.13.3
+google-cloud-monitoring==2.18.0
+google-cloud-pubsub==2.19.0
+google-cloud-pubsublite==1.9.0
+google-cloud-recommendations-ai==0.7.1
+google-cloud-resource-manager==1.11.0
+google-cloud-spanner==3.40.1
+google-cloud-storage==1.44.0
+google-cloud-translate==3.12.1
+google-cloud-videointelligence==2.13.3
+google-cloud-vision==2.8.0
+google-crc32c==1.5.0
+google-generativeai==0.4.1
+google-pasta==0.2.0
+google-resumable-media==2.7.0
+googleapis-common-protos==1.62.0
+gplearn==0.4.2
+gpustat==1.0.0
+gpxpy==1.6.2
+graphviz==0.20.2
+greenlet==3.0.3
+grpc-google-iam-v1==0.12.7
+grpcio-status==1.48.1
+grpcio-status==1.48.2
+grpcio==1.51.1
+grpcio==1.60.0
+gviz-api==1.10.0
+gym-notices==0.0.8
+gym==0.26.2
+gymnasium==0.29.0
+h11==0.14.0
+h2o==3.46.0.1
+h5netcdf==1.3.0
+h5py==3.10.0
+haversine==2.8.1
+hdfs==2.7.3
+hep-ml==0.7.2
+hijri-converter==2.3.1
+hmmlearn==0.3.2
+holidays==0.24
+holoviews==1.18.3
+hpsklearn==0.1.0
+html5lib==1.1
+htmlmin==0.1.12
+httpcore==1.0.4
+httplib2==0.21.0
+httptools==0.6.1
+httpx==0.27.0
+huggingface-hub==0.21.4
+humanfriendly==10.0
+hunspell==0.5.5
+husl==4.0.3
+hydra-slayer==0.5.0
+hyperopt==0.2.7
+hypertools==0.8.0
+idna==3.6
+igraph==0.11.4
+imagecodecs==2024.1.1
+imageio==2.33.1
+imbalanced-learn==0.12.0
+imgaug==0.4.0
+importlib-metadata==6.11.0
+importlib-metadata==7.0.1
+importlib-resources==6.1.1
+inequality==1.0.1
+iniconfig==2.0.0
+ipydatawidgets==4.3.5
+ipykernel==6.28.0
+ipyleaflet==0.18.2
+ipympl==0.7.0
+ipython-genutils==0.2.0
+ipython-genutils==0.2.0
+ipython-sql==0.5.0
+ipython==8.20.0
+ipyvolume==0.6.3
+ipyvue==1.10.2
+ipyvuetify==1.9.2
+ipywebrtc==0.6.0
+ipywidgets==7.7.1
+isoduration==20.11.0
+isort==5.13.2
+isoweek==1.3.3
+itsdangerous==2.1.2
+jaraco.classes==3.3.0
+jax-jumpy==1.0.0
+jax==0.4.23
+jaxlib==0.4.23.dev20240116
+jedi==0.19.1
+jeepney==0.8.0
+jieba==0.42.1
+jmespath==1.0.1
+joblib==1.3.2
+json5==0.9.14
+jsonpatch==1.33
+jsonpointer==2.4
+jsonschema-specifications==2023.12.1
+jsonschema==4.20.0
+jupyter-console==6.6.3
+jupyter-events==0.9.0
+jupyter-http-over-ws==0.0.8
+jupyter-lsp==1.5.1
+jupyter-server-mathjax==0.2.6
+jupyter-ydoc==0.2.5
+jupyter_client==7.4.9
+jupyter_client==8.6.0
+jupyter_core==5.7.1
+jupyter_server==2.13.0
+jupyter_server_fileid==0.9.1
+jupyter_server_proxy==4.1.0
+jupyter_server_terminals==0.5.1
+jupyter_server_ydoc==0.8.0
+jupyterlab-lsp==5.1.0
+jupyterlab-widgets==3.0.9
+jupyterlab==4.1.5
+jupyterlab_git==0.44.0
+jupyterlab_pygments==0.3.0
+jupyterlab_server==2.25.2
+jupytext==1.16.0
+kaggle-environments==1.14.3
+kaggle==1.6.6
+kagglehub==0.2.0
+keras-cv==0.8.2
+keras-nlp==0.8.2
+keras-tuner==1.4.6
+keras==3.0.5
+kernels-mixer==0.0.7
+keyring==24.3.0
+keyrings.google-artifactregistry-auth==1.1.2
+kfp-pipeline-spec==0.2.2
+kfp-server-api==2.0.5
+kfp==2.5.0
+kiwisolver==1.4.5
+kmapper==2.0.1
+kmodes==0.12.2
+korean-lunar-calendar==0.3.1
+kornia==0.7.2
+kornia_rs==0.1.2
+kt-legacy==1.0.5
+kubernetes==26.1.0
+langcodes==3.3.0
+langid==1.1.6
+lazy_loader==0.3
+learntools==0.3.4
+leven==1.0.4
+libclang==16.0.6
+libmambapy==1.5.0
+libpysal==4.9.2
+librosa==0.10.1
+lightgbm==4.2.0
+lightning-utilities==0.10.1
+lime==0.2.0.1
+line-profiler==4.1.2
+linkify-it-py==2.0.3
+llvmlite==0.41.1
+llvmlite==0.42.0
+lml==0.1.0
+locket==1.0.0
+loguru==0.7.2
+lxml==5.1.0
+lz4==4.3.3
+mamba==1.5.0
+mapclassify==2.6.1
+markdown-it-py==3.0.0
+markovify==0.9.4
+marshmallow==3.21.1
+matplotlib-inline==0.1.6
+matplotlib-venn==0.11.10
+matplotlib==3.7.5
+matplotlib==3.8.3
+mccabe==0.7.0
+mdit-py-plugins==0.4.0
+mdurl==0.1.2
+memory-profiler==0.61.0
+menuinst==2.0.1
+mercantile==1.2.1
+mgwr==2.2.1
+missingno==0.5.2
+mistune==0.8.4
+mizani==0.11.0
+ml-dtypes==0.2.0
+mlcrate==0.2.0
+mlens==0.2.3
+mlxtend==0.23.1
+mmh3==4.1.0
+mne==1.6.1
+mnist==0.2.2
+mock==5.1.0
+momepy==0.7.0
+more-itertools==10.2.0
+mpld3==0.5.10
+mpmath==1.3.0
+msgpack-numpy==0.4.8
+msgpack==1.0.7
+multidict==6.0.4
+multimethod==1.10
+multipledispatch==1.0.0
+multiprocess==0.70.16
+munkres==1.1.4
+murmurhash==1.0.10
+mypy-extensions==1.0.0
+namex==0.0.7
+nb-conda-kernels==2.3.1
+nb_conda==2.2.1
+nbclassic==1.0.0
+nbclient==0.5.13
+nbconvert==6.4.5
+nbdime==3.2.0
+nbformat==5.9.2
+ndindex==1.8
+nest-asyncio==1.5.8
+networkx==3.2.1
+nibabel==5.2.1
+nilearn==0.10.3
+ninja==1.11.1.1
+nltk==3.2.4
+nose==1.3.7
+notebook==6.5.4
+notebook==6.5.6
+notebook_executor==0.2
+notebook_shim==0.2.3
+numba==0.58.1
+numba==0.59.0
+numexpr==2.9.0
+numpy==1.26.4
+nvidia-ml-py==11.495.46
+nvtx==0.2.10
+oauth2client==4.1.3
+oauthlib==3.2.2
+objsize==0.6.1
+odfpy==1.4.1
+olefile==0.47
+onnx==1.15.0
+opencensus-context==0.1.3
+opencensus==0.11.4
+opencv-contrib-python==4.9.0.80
+opencv-python-headless==4.9.0.80
+opencv-python==4.9.0.80
+openpyxl==3.1.2
+openslide-python==1.3.1
+opentelemetry-api==1.22.0
+opentelemetry-exporter-otlp-proto-common==1.22.0
+opentelemetry-exporter-otlp-proto-grpc==1.22.0
+opentelemetry-exporter-otlp-proto-http==1.22.0
+opentelemetry-exporter-otlp==1.22.0
+opentelemetry-proto==1.22.0
+opentelemetry-sdk==1.22.0
+opentelemetry-semantic-conventions==0.43b0
+opt-einsum==3.3.0
+optax==0.2.1
+optimum==1.18.0
+optuna==3.6.0
+orbax-checkpoint==0.5.6
+ordered-set==4.1.0
+orjson==3.9.10
+ortools==9.4.1874
+osmnx==1.9.1
+overrides==7.4.0
+packaging==21.3
+pandas-datareader==0.10.0
+pandas-profiling==3.6.6
+pandas-summary==0.2.0
+pandas==2.1.4
+pandas==2.2.1
+pandasql==0.7.3
+pandocfilters==1.5.0
+panel==1.3.8
+papermill==2.5.0
+param==2.0.2
+parso==0.8.3
+partd==1.4.1
+path.py==12.5.0
+path==16.10.0
+pathos==0.3.2
+pathy==0.10.3
+patsy==0.5.6
+pdf2image==1.17.0
+peft==0.10.0
+pettingzoo==1.24.0
+pexpect==4.8.0
+pexpect==4.9.0
+phik==0.12.4
+pickleshare==0.7.5
+pip==23.3.2
+pkgutil_resolve_name==1.3.10
+platformdirs==4.2.0
+plotly-express==0.4.1
+plotly==5.18.0
+plotnine==0.13.2
+pluggy==1.4.0
+pointpats==2.4.0
+polars==0.20.15
+polyglot==16.7.4
+pooch==1.8.1
+pox==0.3.4
+ppca==0.0.4
+ppft==1.7.6.8
+preprocessing==0.1.13
+preshed==3.0.9
+prettytable==3.9.0
+progressbar2==4.4.2
+prometheus-client==0.19.0
+promise==2.3
+prompt-toolkit==3.0.42
+prompt-toolkit==3.0.43
+pronouncing==0.2.0
+prophet==1.1.1
+proto-plus==1.23.0
+protobuf==3.20.3
+protobuf==4.21.12
+psutil==5.9.3
+psutil==5.9.7
+ptyprocess==0.7.0
+pudb==2024.1
+pure-eval==0.2.2
+py-cpuinfo==9.0.0
+py-spy==0.3.14
+py4j==0.10.9.7
+pyLDAvis==3.4.1
+pyOpenSSL==23.3.0
+pyaml==23.12.0
+pyarrow==11.0.0
+pyasn1-modules==0.3.0
+pyasn1==0.5.1
+pybind11==2.11.1
+pyclipper==1.3.0.post5
+pycodestyle==2.11.1
+pycosat==0.6.6
+pycparser==2.21
+pycryptodome==3.20.0
+pyct==0.5.0
+pycuda==2024.1
+pydantic==2.5.3
+pydantic==2.6.4
+pydantic_core==2.14.6
+pydantic_core==2.16.3
+pydegensac==0.1.2
+pydicom==2.4.4
+pydocstyle==6.3.0
+pydot==1.4.2
+pydub==0.25.1
+pyemd==1.0.0
+pyerfa==2.0.1.1
+pyexcel-io==0.6.6
+pyexcel-ods==0.6.0
+pyflakes==3.2.0
+pygltflib==1.16.2
+pykalman==0.9.5
+pylibraft==23.8.0
+pylint==3.0.4
+pymc3==3.11.4
+pymongo==3.13.0
+pynndescent==0.5.11
+pynvml==11.4.1
+pynvrtc==9.2
+pyocr==0.8.5
+pyparsing==3.1.1
+pyparsing==3.1.2
+pypdf==4.1.0
+pyproj==3.6.1
+pysal==24.1
+pyshp==2.3.1
+pytesseract==0.3.10
+pytest==8.1.1
+python-Levenshtein==0.25.0
+python-bidi==0.4.2
+python-dateutil==2.9.0.post0
+python-dotenv==1.0.0
+python-json-logger==2.0.7
+python-louvain==0.16
+python-lsp-jsonrpc==1.1.2
+python-lsp-server==1.10.1
+python-slugify==8.0.4
+python-utils==3.8.2
+pythreejs==2.4.2
+pytoolconfig==1.3.1
+pytools==2023.1.1
+pytorch-ignite==0.4.13
+pytorch-lightning==2.2.1
+pytz==2023.3.post1
+pytz==2024.1
+pyu2f==0.1.5
+pyviz_comms==3.0.1
+pyzmq==24.0.1
+pyzmq==25.1.2
+qgrid==1.3.1
+qtconsole==5.5.1
+quantecon==0.7.2
+quantities==0.15.0
+qudida==0.0.4
+raft-dask==23.8.0
+rapidfuzz==3.6.2
+rasterio==1.3.9
+rasterstats==0.19.0
+ray-cpp==2.9.0
+ray==2.9.0
+referencing==0.32.1
+regex==2023.12.25
+requests-oauthlib==1.3.1
+requests-toolbelt==0.10.1
+requests==2.31.0
+responses==0.18.0
+retrying==1.3.3
+retrying==1.3.4
+rfc3339-validator==0.1.4
+rfc3986-validator==0.1.1
+rgf-python==3.12.0
+rich-click==1.7.4
+rich==13.7.0
+rich==13.7.1
+rmm==23.8.0
+rope==1.12.0
+rouge==1.0.1
+rpds-py==0.16.2
+rsa==4.9
+ruamel-yaml-conda==0.15.100
+ruamel.yaml.clib==0.2.7
+ruamel.yaml==0.17.40
+s2sphere==0.2.5
+s3fs==2024.3.0
+s3transfer==0.6.2
+safetensors==0.4.2
+scattertext==0.1.19
+scikit-image==0.22.0
+scikit-learn-intelex==2024.1.0
+scikit-learn==1.2.2
+scikit-multilearn==0.2.0
+scikit-optimize==0.10.1
+scikit-plot==0.3.7
+scikit-surprise==1.1.3
+scipy==1.11.4
+scipy==1.12.0
+seaborn==0.12.2
+segment_anything==1.0
+segregation==2.5
+semver==3.0.2
+sentencepiece==0.2.0
+sentry-sdk==1.42.0
+setproctitle==1.3.3
+setuptools-git==1.2
+setuptools-scm==8.0.4
+setuptools==69.0.3
+shap==0.44.1
+shapely==2.0.3
+shellingham==1.5.4
+simpervisor==1.0.0
+simplejson==3.19.2
+six==1.16.0
+sklearn-pandas==2.2.0
+slicer==0.0.7
+smart-open==6.4.0
+smmap==5.0.1
+sniffio==1.3.0
+snowballstemmer==2.2.0
+snuggs==1.4.7
+sortedcontainers==2.4.0
+soundfile==0.12.1
+soupsieve==2.5
+soxr==0.3.7
+spacy-legacy==3.0.12
+spacy-loggers==1.0.5
+spacy==3.7.2
+spaghetti==1.7.5.post1
+spectral==0.23.1
+spglm==1.1.0
+sphinx-rtd-theme==0.2.4
+spint==1.0.7
+splot==1.1.5.post1
+spopt==0.6.0
+spreg==1.4.2
+spvcm==0.3.0
+sqlparse==0.4.4
+squarify==0.4.3
+srsly==2.4.8
+stable-baselines3==2.1.0
+stack-data==0.6.2
+stack-data==0.6.3
+stanio==0.3.0
+starlette==0.32.0.post1
+statsmodels==0.14.1
+stemming==1.0.1
+stop-words==2018.7.23
+stopit==1.1.2
+stumpy==1.12.0
+sympy==1.12
+tables==3.9.2
+tabulate==0.9.0
+tangled-up-in-unicode==0.2.0
+tbb==2021.11.0
+tblib==3.0.0
+tenacity==8.2.3
+tensorboard-data-server==0.7.2
+tensorboard-plugin-profile==2.15.0
+tensorboard==2.15.1
+tensorboardX==2.6.2.2
+tensorflow-cloud==0.1.16
+tensorflow-datasets==4.9.4
+tensorflow-decision-forests==1.8.1
+tensorflow-estimator==2.15.0
+tensorflow-hub==0.16.1
+tensorflow-io-gcs-filesystem==0.35.0
+tensorflow-io==0.35.0
+tensorflow-metadata==0.14.0
+tensorflow-probability==0.23.0
+tensorflow-serving-api==2.14.1
+tensorflow-text==2.15.0
+tensorflow-transform==0.14.0
+tensorflow==2.15.0
+tensorpack==0.11
+tensorstore==0.1.56
+termcolor==2.4.0
+terminado==0.18.0
+testpath==0.6.0
+text-unidecode==1.3
+textblob==0.18.0.post0
+texttable==1.7.0
+tf_keras==2.15.1
+tfp-nightly==0.24.0.dev0
+thinc==8.2.2
+threadpoolctl==3.2.0
+tifffile==2023.12.9
+timm==0.9.16
+tinycss2==1.2.1
+tobler==0.11.2
+tokenizers==0.15.2
+toml==0.10.2
+tomli==2.0.1
+tomlkit==0.12.4
+toolz==0.12.1
+torch==2.1.2
+torchaudio==2.1.2
+torchdata==0.7.1
+torchinfo==1.8.0
+torchmetrics==1.3.2
+torchtext==0.16.2
+torchvision==0.16.2
+tornado==6.3.3
+tqdm==4.66.1
+traceml==1.0.8
+traitlets==5.9.0
+traittypes==0.2.1
+transformers==4.38.2
+treelite-runtime==3.2.0
+treelite==3.2.0
+trueskill==0.4.5
+truststore==0.8.0
+trx-python==0.2.9
+tsfresh==0.20.2
+typeguard==4.1.5
+typer==0.9.0
+types-python-dateutil==2.8.19.20240106
+typing-inspect==0.9.0
+typing-utils==0.1.0
+typing_extensions==4.9.0
+tzdata==2023.4
+uc-micro-py==1.0.3
+ucx-py==0.33.0
+ujson==5.9.0
+umap-learn==0.5.5
+unicodedata2==15.1.0
+update-checker==0.18.0
+uri-template==1.3.0
+uritemplate==3.0.1
+urllib3==1.26.18
+urllib3==2.1.0
+urwid==2.6.9
+urwid_readline==0.14
+uvicorn==0.25.0
+uvloop==0.19.0
+vaex-astro==0.9.3
+vaex-core==4.17.1
+vaex-hdf5==0.14.1
+vaex-jupyter==0.8.2
+vaex-ml==0.18.3
+vaex-server==0.9.0
+vaex-viz==0.5.4
+vaex==4.17.0
+vec_noise==1.1.4
+vecstack==0.4.0
+virtualenv==20.21.0
+visions==0.7.5
+vowpalwabbit==9.9.0
+vtk==9.3.0
+wandb==0.16.4
+wasabi==1.1.2
+watchfiles==0.21.0
+wavio==0.0.8
+wcwidth==0.2.13
+weasel==0.3.4
+webcolors==1.13
+webencodings==0.5.1
+websocket-client==1.7.0
+websockets==12.0
+wfdb==4.1.2
+whatthepatch==1.0.5
+wheel==0.42.0
+widgetsnbextension==3.6.6
+witwidget==1.8.1
+woodwork==0.29.0
+wordcloud==1.9.3
+wordsegment==1.3.1
+wrapt==1.14.1
+xarray-einstats==0.7.0
+xarray==2024.2.0
+xgboost==2.0.3
+xvfbwrapper==0.2.9
+xxhash==3.4.1
+xyzservices==2023.10.1
+y-py==0.6.2
+yapf==0.40.2
+yarl==1.9.3
+yarl==1.9.4
+ydata-profiling==4.6.4
+yellowbrick==1.5
+ypy-websocket==0.8.4
+zict==3.0.0
+zipp==3.17.0
+zstandard==0.22.0

wandb/run-20240402_072503-b95w935d/files/wandb-metadata.json ADDED Viewed

	@@ -0,0 +1,66 @@

+{
+    "os": "Linux-5.15.133+-x86_64-with-glibc2.31",
+    "python": "3.10.13",
+    "heartbeatAt": "2024-04-02T07:25:04.219238",
+    "startedAt": "2024-04-02T07:25:03.308167",
+    "docker": null,
+    "cuda": null,
+    "args": [],
+    "state": "running",
+    "program": "kaggle.ipynb",
+    "codePathLocal": null,
+    "root": "/kaggle/working",
+    "host": "0ecfc053b35f",
+    "username": "root",
+    "executable": "/opt/conda/bin/python3.10",
+    "cpu_count": 2,
+    "cpu_count_logical": 4,
+    "cpu_freq": {
+        "current": 2000.142,
+        "min": 0.0,
+        "max": 0.0
+    },
+    "cpu_freq_per_core": [
+        {
+            "current": 2000.142,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.142,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.142,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.142,
+            "min": 0.0,
+            "max": 0.0
+        }
+    ],
+    "disk": {
+        "/": {
+            "total": 8062.387607574463,
+            "used": 5524.099773406982
+        }
+    },
+    "gpu": "Tesla T4",
+    "gpu_count": 2,
+    "gpu_devices": [
+        {
+            "name": "Tesla T4",
+            "memory_total": 16106127360
+        },
+        {
+            "name": "Tesla T4",
+            "memory_total": 16106127360
+        }
+    ],
+    "memory": {
+        "total": 31.357559204101562
+    }
+}

wandb/run-20240402_072503-b95w935d/files/wandb-summary.json ADDED Viewed

	@@ -0,0 +1 @@

+ {"train/loss": 0.3094, "train/grad_norm": 0.5635281801223755, "train/learning_rate": 0.0, "train/epoch": 28.8, "train/global_step": 360, "_timestamp": 1712049025.4445002, "_runtime": 6322.128933191299, "_step": 58, "eval/loss": 0.549111545085907, "eval/runtime": 19.991, "eval/samples_per_second": 3.752, "eval/steps_per_second": 0.65, "train/train_runtime": 6328.6782, "train/train_samples_per_second": 1.422, "train/train_steps_per_second": 0.057, "train/total_flos": 861990497353728.0, "train/train_loss": 0.5323640836609734}

wandb/run-20240402_072503-b95w935d/logs/debug-internal.log ADDED Viewed

The diff for this file is too large to render. See raw diff

wandb/run-20240402_072503-b95w935d/logs/debug.log ADDED Viewed

	@@ -0,0 +1,43 @@

+2024-04-02 07:25:03,309 INFO    MainThread:34 [wandb_setup.py:_flush():76] Current SDK version is 0.16.4
+2024-04-02 07:25:03,309 INFO    MainThread:34 [wandb_setup.py:_flush():76] Configure stats pid to 34
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying login settings: {'api_key': '***REDACTED***'}
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:_log_setup():526] Logging user logs to /kaggle/working/wandb/run-20240402_072503-b95w935d/logs/debug.log
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:_log_setup():527] Logging internal logs to /kaggle/working/wandb/run-20240402_072503-b95w935d/logs/debug-internal.log
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:_jupyter_setup():472] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7e03f1c07cd0>
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:init():566] calling init triggers
+2024-04-02 07:25:03,310 INFO    MainThread:34 [wandb_init.py:init():573] wandb.init called with sweep_config: {}
+config: {}
+2024-04-02 07:25:03,311 INFO    MainThread:34 [wandb_init.py:init():616] starting backend
+2024-04-02 07:25:03,311 INFO    MainThread:34 [wandb_init.py:init():620] setting up manager
+2024-04-02 07:25:03,312 INFO    MainThread:34 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
+2024-04-02 07:25:03,315 INFO    MainThread:34 [wandb_init.py:init():628] backend started and connected
+2024-04-02 07:25:03,328 INFO    MainThread:34 [wandb_run.py:_label_probe_notebook():1295] probe notebook
+2024-04-02 07:25:03,908 INFO    MainThread:34 [wandb_init.py:init():720] updated telemetry
+2024-04-02 07:25:03,911 INFO    MainThread:34 [wandb_init.py:init():753] communicating run to backend with 90.0 second timeout
+2024-04-02 07:25:04,116 INFO    MainThread:34 [wandb_run.py:_on_init():2262] communicating current version
+2024-04-02 07:25:04,182 INFO    MainThread:34 [wandb_run.py:_on_init():2271] got version response upgrade_message: "wandb version 0.16.5 is available!  To upgrade, please run:\n $ pip install wandb --upgrade"
+2024-04-02 07:25:04,183 INFO    MainThread:34 [wandb_init.py:init():804] starting run threads in backend
+2024-04-02 07:25:35,178 INFO    MainThread:34 [wandb_run.py:_console_start():2241] atexit reg
+2024-04-02 07:25:35,178 INFO    MainThread:34 [wandb_run.py:_redirect():2096] redirect: wrap_raw
+2024-04-02 07:25:35,179 INFO    MainThread:34 [wandb_run.py:_redirect():2161] Wrapping output streams.
+2024-04-02 07:25:35,179 INFO    MainThread:34 [wandb_run.py:_redirect():2186] Redirects installed.
+2024-04-02 07:25:35,180 INFO    MainThread:34 [wandb_init.py:init():847] run started, returning control to user process
+2024-04-02 07:25:35,186 INFO    MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 32000, 'max_position_embeddings': 32768, 'hidden_size': 4096, 'intermediate_size': 14336, 'num_hidden_layers': 32, 'num_attention_heads': 32, 'sliding_window': None, 'num_key_value_heads': 8, 'hidden_act': 'silu', 'initializer_range': 0.02, 'rms_norm_eps': 1e-05, 'use_cache': False, 'rope_theta': 1000000.0, 'attention_dropout': 0.0, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': False, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['MistralForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 0, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'TheBloke/Mistral-7B-Instruct-v0.2-GPTQ', 'transformers_version': '4.38.2', 'model_type': 'mistral', 'pretraining_tp': 1, 'quantization_config': {'quant_method': 'QuantizationMethod.GPTQ', 'bits': 4, 'tokenizer': None, 'dataset': None, 'group_size': 128, 'damp_percent': 0.1, 'desc_act': True, 'sym': True, 'true_sequential': True, 'use_cuda_fp16': False, 'model_seqlen': None, 'block_name_to_quantize': None, 'module_name_preceding_first_block': None, 'batch_size': 1, 'pad_token_id': None, 'use_exllama': True, 'max_input_length': None, 'exllama_config': {'version': 'ExllamaVersion.ONE'}, 'cache_block_outputs': True, 'modules_in_block_to_quantize': None}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 6, 'per_device_eval_batch_size': 6, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 4, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 30, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 2, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr02_07-24-52_0ecfc053b35f', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
+2024-04-02 09:10:25,449 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-04-02 09:10:25,449 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
+2024-04-02 09:10:45,681 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
+2024-04-02 09:10:45,704 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-04-02 09:10:45,704 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
+2024-04-02 09:11:24,497 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
+2024-04-02 09:11:24,498 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-04-02 09:11:24,498 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
+2024-04-02 09:11:34,304 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
+2024-04-02 09:11:34,306 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-04-02 09:11:34,306 INFO    MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
+2024-04-02 09:11:35,541 INFO    MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend

wandb/run-20240402_072503-b95w935d/run-b95w935d.wandb ADDED Viewed

Binary file (218 kB). View file