diff --git a/LICENSE b/LICENSE new file mode 100644 index 0000000000000000000000000000000000000000..ac09290cb2947183ab7f19d7477511dad3600a7a --- /dev/null +++ b/LICENSE @@ -0,0 +1,91 @@ +DEEPSEEK LICENSE AGREEMENT + +Version 1.0, 23 October 2023 + +Copyright (c) 2023 DeepSeek + +Section I: PREAMBLE + +Large generative models are being widely adopted and used, and have the potential to transform the way individuals conceive and benefit from AI or ML technologies. + +Notwithstanding the current and potential benefits that these artifacts can bring to society at large, there are also concerns about potential misuses of them, either due to their technical limitations or ethical considerations. + +In short, this license strives for both the open and responsible downstream use of the accompanying model. When it comes to the open character, we took inspiration from open source permissive licenses regarding the grant of IP rights. Referring to the downstream responsible use, we added use-based restrictions not permitting the use of the model in very specific scenarios, in order for the licensor to be able to enforce the license in case potential misuses of the Model may occur. At the same time, we strive to promote open and responsible research on generative models for content generation. + +Even though downstream derivative versions of the model could be released under different licensing terms, the latter will always have to include - at minimum - the same use-based restrictions as the ones in the original license (this license). We believe in the intersection between open and responsible AI development; thus, this agreement aims to strike a balance between both in order to enable responsible open-science in the field of AI. + +This License governs the use of the model (and its derivatives) and is informed by the model card associated with the model. + +NOW THEREFORE, You and DeepSeek agree as follows: + +1. Definitions +"License" means the terms and conditions for use, reproduction, and Distribution as defined in this document. +"Data" means a collection of information and/or content extracted from the dataset used with the Model, including to train, pretrain, or otherwise evaluate the Model. The Data is not licensed under this License. +"Output" means the results of operating a Model as embodied in informational content resulting therefrom. +"Model" means any accompanying machine-learning based assemblies (including checkpoints), consisting of learnt weights, parameters (including optimizer states), corresponding to the model architecture as embodied in the Complementary Material, that have been trained or tuned, in whole or in part on the Data, using the Complementary Material. +"Derivatives of the Model" means all modifications to the Model, works based on the Model, or any other model which is created or initialized by transfer of patterns of the weights, parameters, activations or output of the Model, to the other model, in order to cause the other model to perform similarly to the Model, including - but not limited to - distillation methods entailing the use of intermediate data representations or methods based on the generation of synthetic data by the Model for training the other model. +"Complementary Material" means the accompanying source code and scripts used to define, run, load, benchmark or evaluate the Model, and used to prepare data for training or evaluation, if any. This includes any accompanying documentation, tutorials, examples, etc, if any. +"Distribution" means any transmission, reproduction, publication or other sharing of the Model or Derivatives of the Model to a third party, including providing the Model as a hosted service made available by electronic or other remote means - e.g. API-based or web access. +"DeepSeek" (or "we") means Beijing DeepSeek Artificial Intelligence Fundamental Technology Research Co., Ltd., Hangzhou DeepSeek Artificial Intelligence Fundamental Technology Research Co., Ltd. and/or any of their affiliates. +"You" (or "Your") means an individual or Legal Entity exercising permissions granted by this License and/or making use of the Model for whichever purpose and in any field of use, including usage of the Model in an end-use application - e.g. chatbot, translator, etc. +"Third Parties" means individuals or legal entities that are not under common control with DeepSeek or You. + +Section II: INTELLECTUAL PROPERTY RIGHTS + +Both copyright and patent grants apply to the Model, Derivatives of the Model and Complementary Material. The Model and Derivatives of the Model are subject to additional terms as described in Section III. + +2. Grant of Copyright License. Subject to the terms and conditions of this License, DeepSeek hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable copyright license to reproduce, prepare, publicly display, publicly perform, sublicense, and distribute the Complementary Material, the Model, and Derivatives of the Model. + +3. Grant of Patent License. Subject to the terms and conditions of this License and where and as applicable, DeepSeek hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable (except as stated in this paragraph) patent license to make, have made, use, offer to sell, sell, import, and otherwise transfer the Model and the Complementary Material, where such license applies only to those patent claims licensable by DeepSeek that are necessarily infringed by its contribution(s). If You institute patent litigation against any entity (including a cross-claim or counterclaim in a lawsuit) alleging that the Model and/or Complementary Material constitutes direct or contributory patent infringement, then any patent licenses granted to You under this License for the Model and/or works shall terminate as of the date such litigation is asserted or filed. + + +Section III: CONDITIONS OF USAGE, DISTRIBUTION AND REDISTRIBUTION + +4. Distribution and Redistribution. You may host for Third Party remote access purposes (e.g. software-as-a-service), reproduce and distribute copies of the Model or Derivatives of the Model thereof in any medium, with or without modifications, provided that You meet the following conditions: +a. Use-based restrictions as referenced in paragraph 5 MUST be included as an enforceable provision by You in any type of legal agreement (e.g. a license) governing the use and/or distribution of the Model or Derivatives of the Model, and You shall give notice to subsequent users You Distribute to, that the Model or Derivatives of the Model are subject to paragraph 5. This provision does not apply to the use of Complementary Material. +b. You must give any Third Party recipients of the Model or Derivatives of the Model a copy of this License; +c. You must cause any modified files to carry prominent notices stating that You changed the files; +d. You must retain all copyright, patent, trademark, and attribution notices excluding those notices that do not pertain to any part of the Model, Derivatives of the Model. +e. You may add Your own copyright statement to Your modifications and may provide additional or different license terms and conditions - respecting paragraph 4.a. – for use, reproduction, or Distribution of Your modifications, or for any such Derivatives of the Model as a whole, provided Your use, reproduction, and Distribution of the Model otherwise complies with the conditions stated in this License. + +5. Use-based restrictions. The restrictions set forth in Attachment A are considered Use-based restrictions. Therefore You cannot use the Model and the Derivatives of the Model for the specified restricted uses. You may use the Model subject to this License, including only for lawful purposes and in accordance with the License. Use may include creating any content with, finetuning, updating, running, training, evaluating and/or reparametrizing the Model. You shall require all of Your users who use the Model or a Derivative of the Model to comply with the terms of this paragraph (paragraph 5). + +6. The Output You Generate. Except as set forth herein, DeepSeek claims no rights in the Output You generate using the Model. You are accountable for the Output you generate and its subsequent uses. No use of the output can contravene any provision as stated in the License. + +Section IV: OTHER PROVISIONS + +7. Updates and Runtime Restrictions. To the maximum extent permitted by law, DeepSeek reserves the right to restrict (remotely or otherwise) usage of the Model in violation of this License. + +8. Trademarks and related. Nothing in this License permits You to make use of DeepSeek’ trademarks, trade names, logos or to otherwise suggest endorsement or misrepresent the relationship between the parties; and any rights not expressly granted herein are reserved by DeepSeek. + +9. Personal information, IP rights and related. This Model may contain personal information and works with IP rights. You commit to complying with applicable laws and regulations in the handling of personal information and the use of such works. Please note that DeepSeek's license granted to you to use the Model does not imply that you have obtained a legitimate basis for processing the related information or works. As an independent personal information processor and IP rights user, you need to ensure full compliance with relevant legal and regulatory requirements when handling personal information and works with IP rights that may be contained in the Model, and are willing to assume solely any risks and consequences that may arise from that. + +10. Disclaimer of Warranty. Unless required by applicable law or agreed to in writing, DeepSeek provides the Model and the Complementary Material on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied, including, without limitation, any warranties or conditions of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A PARTICULAR PURPOSE. You are solely responsible for determining the appropriateness of using or redistributing the Model, Derivatives of the Model, and the Complementary Material and assume any risks associated with Your exercise of permissions under this License. + +11. Limitation of Liability. In no event and under no legal theory, whether in tort (including negligence), contract, or otherwise, unless required by applicable law (such as deliberate and grossly negligent acts) or agreed to in writing, shall DeepSeek be liable to You for damages, including any direct, indirect, special, incidental, or consequential damages of any character arising as a result of this License or out of the use or inability to use the Model and the Complementary Material (including but not limited to damages for loss of goodwill, work stoppage, computer failure or malfunction, or any and all other commercial damages or losses), even if DeepSeek has been advised of the possibility of such damages. + +12. Accepting Warranty or Additional Liability. While redistributing the Model, Derivatives of the Model and the Complementary Material thereof, You may choose to offer, and charge a fee for, acceptance of support, warranty, indemnity, or other liability obligations and/or rights consistent with this License. However, in accepting such obligations, You may act only on Your own behalf and on Your sole responsibility, not on behalf of DeepSeek, and only if You agree to indemnify, defend, and hold DeepSeek harmless for any liability incurred by, or claims asserted against, DeepSeek by reason of your accepting any such warranty or additional liability. + +13. If any provision of this License is held to be invalid, illegal or unenforceable, the remaining provisions shall be unaffected thereby and remain valid as if such provision had not been set forth herein. + +14. Governing Law and Jurisdiction. This agreement will be governed and construed under PRC laws without regard to choice of law principles, and the UN Convention on Contracts for the International Sale of Goods does not apply to this agreement. The courts located in the domicile of Hangzhou DeepSeek Artificial Intelligence Fundamental Technology Research Co., Ltd. shall have exclusive jurisdiction of any dispute arising out of this agreement. + +END OF TERMS AND CONDITIONS + +Attachment A + +Use Restrictions + +You agree not to use the Model or Derivatives of the Model: + +- In any way that violates any applicable national or international law or regulation or infringes upon the lawful rights and interests of any third party; +- For military use in any way; +- For the purpose of exploiting, harming or attempting to exploit or harm minors in any way; +- To generate or disseminate verifiably false information and/or content with the purpose of harming others; +- To generate or disseminate inappropriate content subject to applicable regulatory requirements; +- To generate or disseminate personal identifiable information without due authorization or for unreasonable use; +- To defame, disparage or otherwise harass others; +- For fully automated decision making that adversely impacts an individual’s legal rights or otherwise creates or modifies a binding, enforceable obligation; +- For any use intended to or which has the effect of discriminating against or harming individuals or groups based on online or offline social behavior or known or predicted personal or personality characteristics; +- To exploit any of the vulnerabilities of a specific group of persons based on their age, social, physical or mental characteristics, in order to materially distort the behavior of a person pertaining to that group in a manner that causes or is likely to cause that person or another person physical or psychological harm; +- For any use intended to or which has the effect of discriminating against individuals or groups based on legally protected characteristics or categories. \ No newline at end of file diff --git a/config.json b/config.json new file mode 100644 index 0000000000000000000000000000000000000000..d1ce5637fa7ac54dd0d78c8fe05f4d73a55a5a1f --- /dev/null +++ b/config.json @@ -0,0 +1,60 @@ +{ + "architectures": [ + "DeepseekV2ForCausalLM" + ], + "attention_bias": false, + "attention_dropout": 0.0, + "auto_map": { + "AutoConfig": "configuration_deepseek.DeepseekV2Config", + "AutoModel": "modeling_deepseek.DeepseekV2Model", + "AutoModelForCausalLM": "modeling_deepseek.DeepseekV2ForCausalLM" + }, + "aux_loss_alpha": 0.001, + "bos_token_id": 100000, + "eos_token_id": 100001, + "ep_size": 1, + "first_k_dense_replace": 1, + "hidden_act": "silu", + "hidden_size": 5120, + "initializer_range": 0.02, + "intermediate_size": 12288, + "kv_lora_rank": 512, + "max_position_embeddings": 163840, + "model_type": "deepseek_v2", + "moe_intermediate_size": 1536, + "moe_layer_freq": 1, + "n_group": 8, + "n_routed_experts": 160, + "n_shared_experts": 2, + "norm_topk_prob": false, + "num_attention_heads": 128, + "num_experts_per_tok": 6, + "num_hidden_layers": 60, + "num_key_value_heads": 128, + "pretraining_tp": 1, + "q_lora_rank": 1536, + "qk_nope_head_dim": 128, + "qk_rope_head_dim": 64, + "rms_norm_eps": 1e-06, + "rope_scaling": { + "beta_fast": 32, + "beta_slow": 1, + "factor": 40, + "mscale": 1.0, + "mscale_all_dim": 1.0, + "original_max_position_embeddings": 4096, + "type": "yarn" + }, + "rope_theta": 10000, + "routed_scaling_factor": 16.0, + "scoring_func": "softmax", + "seq_aux": true, + "tie_word_embeddings": false, + "topk_group": 3, + "topk_method": "group_limited_greedy", + "torch_dtype": "bfloat16", + "transformers_version": "4.39.3", + "use_cache": true, + "v_head_dim": 128, + "vocab_size": 102400 +} diff --git a/configuration_deepseek.py b/configuration_deepseek.py new file mode 100644 index 0000000000000000000000000000000000000000..82e0f5d9d33620a66e328fdeae0b8dc12e2cff7c --- /dev/null +++ b/configuration_deepseek.py @@ -0,0 +1,206 @@ +from transformers.configuration_utils import PretrainedConfig +from transformers.utils import logging + +logger = logging.get_logger(__name__) + +DEEPSEEK_PRETRAINED_CONFIG_ARCHIVE_MAP = {} +class DeepseekV2Config(PretrainedConfig): + r""" + This is the configuration class to store the configuration of a [`DeepseekV2Model`]. It is used to instantiate an DeepSeek + model according to the specified arguments, defining the model architecture. Instantiating a configuration with the + defaults will yield a similar configuration to that of the DeepSeek-V2. + + Configuration objects inherit from [`PretrainedConfig`] and can be used to control the model outputs. Read the + documentation from [`PretrainedConfig`] for more information. + + + Args: + vocab_size (`int`, *optional*, defaults to 102400): + Vocabulary size of the Deep model. Defines the number of different tokens that can be represented by the + `inputs_ids` passed when calling [`DeepseekV2Model`] + hidden_size (`int`, *optional*, defaults to 4096): + Dimension of the hidden representations. + intermediate_size (`int`, *optional*, defaults to 11008): + Dimension of the MLP representations. + moe_intermediate_size (`int`, *optional*, defaults to 1407): + Dimension of the MoE representations. + num_hidden_layers (`int`, *optional*, defaults to 32): + Number of hidden layers in the Transformer decoder. + num_attention_heads (`int`, *optional*, defaults to 32): + Number of attention heads for each attention layer in the Transformer decoder. + n_shared_experts (`int`, *optional*, defaults to None): + Number of shared experts, None means dense model. + n_routed_experts (`int`, *optional*, defaults to None): + Number of routed experts, None means dense model. + routed_scaling_factor (`float`, *optional*, defaults to 1.0): + Scaling factor or routed experts. + topk_method (`str`, *optional*, defaults to `gready`): + Topk method used in routed gate. + n_group (`int`, *optional*, defaults to None): + Number of groups for routed experts. + topk_group (`int`, *optional*, defaults to None): + Number of selected groups for each token(for each token, ensuring the selected experts is only within `topk_group` groups). + num_experts_per_tok (`int`, *optional*, defaults to None): + Number of selected experts, None means dense model. + moe_layer_freq (`int`, *optional*, defaults to 1): + The frequency of the MoE layer: one expert layer for every `moe_layer_freq - 1` dense layers. + first_k_dense_replace (`int`, *optional*, defaults to 0): + Number of dense layers in shallow layers(embed->dense->dense->...->dense->moe->moe...->lm_head). + \--k dense layers--/ + norm_topk_prob (`bool`, *optional*, defaults to False): + Whether to normalize the weights of the routed experts. + scoring_func (`str`, *optional*, defaults to 'softmax'): + Method of computing expert weights. + aux_loss_alpha (`float`, *optional*, defaults to 0.001): + Auxiliary loss weight coefficient. + seq_aux = (`bool`, *optional*, defaults to True): + Whether to compute the auxiliary loss for each individual sample. + num_key_value_heads (`int`, *optional*): + This is the number of key_value heads that should be used to implement Grouped Query Attention. If + `num_key_value_heads=num_attention_heads`, the model will use Multi Head Attention (MHA), if + `num_key_value_heads=1 the model will use Multi Query Attention (MQA) otherwise GQA is used. When + converting a multi-head checkpoint to a GQA checkpoint, each group key and value head should be constructed + by meanpooling all the original heads within that group. For more details checkout [this + paper](https://arxiv.org/pdf/2305.13245.pdf). If it is not specified, will default to + `num_attention_heads`. + hidden_act (`str` or `function`, *optional*, defaults to `"silu"`): + The non-linear activation function (function or string) in the decoder. + max_position_embeddings (`int`, *optional*, defaults to 2048): + The maximum sequence length that this model might ever be used with. + initializer_range (`float`, *optional*, defaults to 0.02): + The standard deviation of the truncated_normal_initializer for initializing all weight matrices. + rms_norm_eps (`float`, *optional*, defaults to 1e-06): + The epsilon used by the rms normalization layers. + use_cache (`bool`, *optional*, defaults to `True`): + Whether or not the model should return the last key/values attentions (not used by all models). Only + relevant if `config.is_decoder=True`. + pad_token_id (`int`, *optional*): + Padding token id. + bos_token_id (`int`, *optional*, defaults to 1): + Beginning of stream token id. + eos_token_id (`int`, *optional*, defaults to 2): + End of stream token id. + pretraining_tp (`int`, *optional*, defaults to 1): + Experimental feature. Tensor parallelism rank used during pretraining. Please refer to [this + document](https://huggingface.co/docs/transformers/parallelism) to understand more about it. This value is + necessary to ensure exact reproducibility of the pretraining results. Please refer to [this + issue](https://github.com/pytorch/pytorch/issues/76232). + tie_word_embeddings (`bool`, *optional*, defaults to `False`): + Whether to tie weight embeddings + rope_theta (`float`, *optional*, defaults to 10000.0): + The base period of the RoPE embeddings. + rope_scaling (`Dict`, *optional*): + Dictionary containing the scaling configuration for the RoPE embeddings. Currently supports two scaling + strategies: linear and dynamic. Their scaling factor must be a float greater than 1. The expected format is + `{"type": strategy name, "factor": scaling factor}`. When using this flag, don't update + `max_position_embeddings` to the expected new maximum. + attention_bias (`bool`, defaults to `False`, *optional*, defaults to `False`): + Whether to use a bias in the query, key, value and output projection layers during self-attention. + attention_dropout (`float`, *optional*, defaults to 0.0): + The dropout ratio for the attention probabilities. + + ```python + >>> from transformers import DeepseekV2Model, DeepseekV2Config + + >>> # Initializing a Deepseek-V2 style configuration + >>> configuration = DeepseekV2Config() + + >>> # Accessing the model configuration + >>> configuration = model.config + ```""" + + model_type = "deepseek_v2" + keys_to_ignore_at_inference = ["past_key_values"] + + def __init__( + self, + vocab_size=102400, + hidden_size=4096, + intermediate_size=11008, + moe_intermediate_size = 1407, + num_hidden_layers=30, + num_attention_heads=32, + num_key_value_heads=32, + n_shared_experts = None, + n_routed_experts = None, + ep_size = 1, + routed_scaling_factor = 1.0, + kv_lora_rank = 512, + q_lora_rank = 1536, + qk_rope_head_dim = 64, + v_head_dim = 128, + qk_nope_head_dim = 128, + topk_method = 'gready', + n_group = None, + topk_group = None, + num_experts_per_tok = None, + moe_layer_freq = 1, + first_k_dense_replace = 0, + norm_topk_prob = False, + scoring_func = 'softmax', + aux_loss_alpha = 0.001, + seq_aux = True, + hidden_act="silu", + max_position_embeddings=2048, + initializer_range=0.02, + rms_norm_eps=1e-6, + use_cache=True, + pad_token_id=None, + bos_token_id=100000, + eos_token_id=100001, + pretraining_tp=1, + tie_word_embeddings=False, + rope_theta=10000.0, + rope_scaling=None, + attention_bias=False, + attention_dropout=0.0, + **kwargs, + ): + self.vocab_size = vocab_size + self.max_position_embeddings = max_position_embeddings + self.hidden_size = hidden_size + self.intermediate_size = intermediate_size + self.moe_intermediate_size = moe_intermediate_size + self.num_hidden_layers = num_hidden_layers + self.num_attention_heads = num_attention_heads + self.n_shared_experts = n_shared_experts + self.n_routed_experts = n_routed_experts + self.ep_size = ep_size + self.routed_scaling_factor = routed_scaling_factor + self.kv_lora_rank = kv_lora_rank + self.q_lora_rank = q_lora_rank + self.qk_rope_head_dim = qk_rope_head_dim + self.v_head_dim = v_head_dim + self.qk_nope_head_dim = qk_nope_head_dim + self.topk_method = topk_method + self.n_group = n_group + self.topk_group = topk_group + self.num_experts_per_tok = num_experts_per_tok + self.moe_layer_freq = moe_layer_freq + self.first_k_dense_replace = first_k_dense_replace + self.norm_topk_prob = norm_topk_prob + self.scoring_func = scoring_func + self.aux_loss_alpha = aux_loss_alpha + self.seq_aux = seq_aux + # for backward compatibility + if num_key_value_heads is None: + num_key_value_heads = num_attention_heads + + self.num_key_value_heads = num_key_value_heads + self.hidden_act = hidden_act + self.initializer_range = initializer_range + self.rms_norm_eps = rms_norm_eps + self.pretraining_tp = pretraining_tp + self.use_cache = use_cache + self.rope_theta = rope_theta + self.rope_scaling = rope_scaling + self.attention_bias = attention_bias + self.attention_dropout = attention_dropout + + super().__init__( + pad_token_id=pad_token_id, + bos_token_id=bos_token_id, + eos_token_id=eos_token_id, + tie_word_embeddings=tie_word_embeddings, + **kwargs, + ) \ No newline at end of file diff --git a/generation_config.json b/generation_config.json new file mode 100644 index 0000000000000000000000000000000000000000..458e1d985ba3fbaaf62a4d1a9dd6ff795a451f7e --- /dev/null +++ b/generation_config.json @@ -0,0 +1,9 @@ +{ + "_from_model_config": true, + "bos_token_id": 100000, + "eos_token_id": 100001, + "do_sample": true, + "temperature": 0.3, + "top_p": 0.95, + "transformers_version": "4.39.3" +} diff --git a/model-00001-of-000055.safetensors b/model-00001-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..ae3a526e316a0afacde04f11b809fbd1e11059d4 --- /dev/null +++ b/model-00001-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4723867c1acc59f4dd8c214120fd709c9f491f5c4c4412ebefb74732ba0c4f0a +size 8594017186 diff --git a/model-00002-of-000055.safetensors b/model-00002-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..9dc4eeb3dcfe503c0cef25e1a7bab26473cb8904 --- /dev/null +++ b/model-00002-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:42e8ac12df89334465bd95c3267607d2a5db3150027f26fa512fa8e77dfa797e +size 8604902394 diff --git a/model-00003-of-000055.safetensors b/model-00003-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..25e7aaa0394b4c341bebfd667c365cdfc77a7b76 --- /dev/null +++ b/model-00003-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5de1b30eb92a59f8fed8250a0950b7b4390b4006c4f4e64a141b0117703f53a8 +size 8604902394 diff --git a/model-00004-of-000055.safetensors b/model-00004-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..e6799bd8841aeb8013527f274a180488837d3512 --- /dev/null +++ b/model-00004-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1864c275b9daebe0c6636acb641eff0c27acbe0da3c71909aa6a744d5c5898c0 +size 8604902402 diff --git a/model-00005-of-000055.safetensors b/model-00005-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..5ee306d22894d3e9baf03552bd4510cdca909a38 --- /dev/null +++ b/model-00005-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0e58072f5daf8a865873e233564f63d67214d7e577c4656e893b380963a32dee +size 8590441833 diff --git a/model-00006-of-000055.safetensors b/model-00006-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..4b8b11b6acb9e340821129942c13f54130161b1a --- /dev/null +++ b/model-00006-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f36cf626f3a6da8192b0be625f4216dd8d89021b338e5116e2d2d1bd4809bae8 +size 8604902329 diff --git a/model-00007-of-000055.safetensors b/model-00007-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..8f3112a976bd618719a670ee1e359b02bfaad5dd --- /dev/null +++ b/model-00007-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1192a6ea17ad9126360d4ac97d4cdcde4a25f4f473d51592ea50e95bc179a2a6 +size 8604902352 diff --git a/model-00008-of-000055.safetensors b/model-00008-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..a255a45cea78b4269d2d8749b7f6586cf122118e --- /dev/null +++ b/model-00008-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0395c3063e0d3e59760ff11ffada1e5679801410d6793f0faacb31454d64d808 +size 8604902352 diff --git a/model-00009-of-000055.safetensors b/model-00009-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..96b842e0201d0464c1e541d68f2562b9331e96f0 --- /dev/null +++ b/model-00009-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:53bc441b10cdd050ffc1b3bfec27323dcd27ff94bfa88cc97b2c7312054e5645 +size 8604902538 diff --git a/model-00010-of-000055.safetensors b/model-00010-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..4881c9d7ad0b97f02edde83990212a5db99ffe2e --- /dev/null +++ b/model-00010-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7002be0ad87c9052e0fbb17d93aedf49ac4849bf1e8fb05c3a97e9207ce365fd +size 8604902887 diff --git a/model-00011-of-000055.safetensors b/model-00011-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..51196213e55dad944cfd1975bd95812deb2f4b83 --- /dev/null +++ b/model-00011-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:adfb20589b117f35708030fdc9b6324a0fedab5fd4c6fd38e14b5c34e46f98e1 +size 8604902887 diff --git a/model-00012-of-000055.safetensors b/model-00012-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..48b4733771904c430658b56eeec1e2ee4107e50e --- /dev/null +++ b/model-00012-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4997e41ad29b0347c3e6ef456c152f2ab9ad303754963cc9d2d0ec9859c76539 +size 8604902888 diff --git a/model-00013-of-000055.safetensors b/model-00013-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..8cce8ae97d785de6f4794f0712ebfc303f9ca01b --- /dev/null +++ b/model-00013-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:532021a45bee567d14486fc17f1cd2c80fb0911394c1cd712c02931ff0301488 +size 8604902929 diff --git a/model-00014-of-000055.safetensors b/model-00014-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..cafdf07790a277a88d148751fbb214da221c3310 --- /dev/null +++ b/model-00014-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:603005cc554d7186d0ee21e25636e1be61a58d1d61c79467d811b8fe41105f9d +size 8604902929 diff --git a/model-00015-of-000055.safetensors b/model-00015-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..aacf2b7767203dddd8f412a15be1f4501e70caf4 --- /dev/null +++ b/model-00015-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c8189a596ebaa1e826354e2144222f7d332ce1be0f9fa9f41053004b6d43ba68 +size 8604902929 diff --git a/model-00016-of-000055.safetensors b/model-00016-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..4c1ad6914cf92276916a84f441c513884c6c9fff --- /dev/null +++ b/model-00016-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8dd27103727281b7d68354eb4af30cd98e5122e7dafe9fde4e2175027c0d00a4 +size 8604902933 diff --git a/model-00017-of-000055.safetensors b/model-00017-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..05477a302ab649605c2b80a3591c2dcb1af1d9bd --- /dev/null +++ b/model-00017-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0a6cb288d9e7cc2cea096c4e0a5af92af088a06fd6c79b1dec18a06b9b2b0960 +size 8590442357 diff --git a/model-00018-of-000055.safetensors b/model-00018-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..8fe8c9db403807ea599ca3ce76083d96f335847b --- /dev/null +++ b/model-00018-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7ff9411dc89507b114f3777cebf46b68ae9115f18d9740242d74d5a56f8ed061 +size 8604902862 diff --git a/model-00019-of-000055.safetensors b/model-00019-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..79558895172caa5dc723eaa3a553a508ac5a5b2c --- /dev/null +++ b/model-00019-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:36028bb9964f50b687eead5183bf9bbcd452eb55e884520e3e6f6cb9c2e363fd +size 8604902887 diff --git a/model-00020-of-000055.safetensors b/model-00020-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..de822b0c72d1b8495d0fba492fd27780b57b7c4a --- /dev/null +++ b/model-00020-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c523866605146433aa42ea87f6575d4f5ba9b24fd281a70a5467927e673d193b +size 8604902887 diff --git a/model-00021-of-000055.safetensors b/model-00021-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..05a610a30593f1e24b2ce2796d87699673821a4b --- /dev/null +++ b/model-00021-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a5f63037bdcf3fab91b7d6578adfa4d962789a4eefbe5c81fddad2d2ff76d221 +size 8604902887 diff --git a/model-00022-of-000055.safetensors b/model-00022-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..4ef3b56e9d7cc352a6fa450599f6c59c9d341533 --- /dev/null +++ b/model-00022-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:66b2bbf34d8378f211749a24d25405679c2c3bd98934900c80faefef7fc2a1a7 +size 8604902887 diff --git a/model-00023-of-000055.safetensors b/model-00023-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..c71ed97fe47577d055b365920a408b96e844151a --- /dev/null +++ b/model-00023-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e3c3cbca08de856f7027f9b8800a29641527460b46fe695049412a2ea8b21a8e +size 8604902887 diff --git a/model-00024-of-000055.safetensors b/model-00024-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..8752a479ab07ec06178b658cefbc22ed08f823e4 --- /dev/null +++ b/model-00024-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bb8e2d62cda76c923814c6cfc477eaee41c39bda24fa12b8679ee698b607d062 +size 8604902887 diff --git a/model-00025-of-000055.safetensors b/model-00025-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..b9edacbf83d128881b789d6ad12fdf7e3fd163ac --- /dev/null +++ b/model-00025-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:085f0d106e0bf243e4b4de0321bf543b5f7819721740f8c5f28c1210cae9466e +size 8604902928 diff --git a/model-00026-of-000055.safetensors b/model-00026-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..36eee64403d4298b0b66a3421f5eb0382d9dffca --- /dev/null +++ b/model-00026-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:df0b0a602fbc052c61fdb7f321e5314443daac4045095dd0acd360980ef5d989 +size 8604902929 diff --git a/model-00027-of-000055.safetensors b/model-00027-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..ba86d63eba8f65223a69b8c8df4aa615956b3b1a --- /dev/null +++ b/model-00027-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9d4efcbd2cfa096b6479db22034b7c4b5b6397df2796eb38dc5a911838c996f6 +size 8604902929 diff --git a/model-00028-of-000055.safetensors b/model-00028-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..ef16eaabd3dc7a1ac0f7004f0e9c80bd9065fc70 --- /dev/null +++ b/model-00028-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:41d1eb892403fb13f8f376b63a3aa5d817b5ab682d48c777f1b5f609b422489c +size 8604902929 diff --git a/model-00029-of-000055.safetensors b/model-00029-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..f2088cc3a2c54d30648b89c2085f6a625848a88f --- /dev/null +++ b/model-00029-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cab9f7950c4454c41319fbbbd85f6f82429cd347f22a5508755a6a71379a448c +size 8590442363 diff --git a/model-00030-of-000055.safetensors b/model-00030-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..17cf6002773036400fa67ad70911acb0f2ac1822 --- /dev/null +++ b/model-00030-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4e25fb281c6208ddf09dc203b94f2b7cd3d5723f7521130686aa8cf52b6083b3 +size 8604902860 diff --git a/model-00031-of-000055.safetensors b/model-00031-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..f6b78fff755b015552a176412c0553419d1bfc81 --- /dev/null +++ b/model-00031-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7890c1ef6f2297ebe9f7b12f9d451d6128445dbafea7b2634834bd78d5d51be2 +size 8604902887 diff --git a/model-00032-of-000055.safetensors b/model-00032-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..69635bc41a4874dfd291f7942b40f9339840345f --- /dev/null +++ b/model-00032-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1445276e505591c701102e1b2eef95a6726b7c48314c584b2b1013e29d79bffc +size 8604902887 diff --git a/model-00033-of-000055.safetensors b/model-00033-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..91f9d7c5aa2eeabf602587b89d070993d16e79a4 --- /dev/null +++ b/model-00033-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:68ef5a8e25da48704e2d19613c26540f4cc2b4fc11c038b7f08d5665ac9f6813 +size 8604902887 diff --git a/model-00034-of-000055.safetensors b/model-00034-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..e1566dc73ef50888ef27d94837bc00f221a6a889 --- /dev/null +++ b/model-00034-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fd00974ad9603f8af075dc2dd5ce72e91398953c0f0fc24d4ce5def422d28ad2 +size 8604902887 diff --git a/model-00035-of-000055.safetensors b/model-00035-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..417748ef26b17dd689e59c7aa452881cbab7d320 --- /dev/null +++ b/model-00035-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:67ba0def2b3ca293015a51fb7ddefb5690acaae7227abdb9fb3aa72cee91d783 +size 8604902887 diff --git a/model-00036-of-000055.safetensors b/model-00036-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..62223a64d402ca35cf39d345c58bf91409176283 --- /dev/null +++ b/model-00036-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bcc62dd95f53275d96720ea6da7ecfa05188b86bbb7a18f364cd282c66271bb2 +size 8604902887 diff --git a/model-00037-of-000055.safetensors b/model-00037-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..10957fd571ca97c4fe193882999a86c91ee6757a --- /dev/null +++ b/model-00037-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:31f999e9c5875288e323cf717b0bb3c34ca363e0951024a432c750480ef10621 +size 8604902926 diff --git a/model-00038-of-000055.safetensors b/model-00038-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..09fdd1f8096cba855baac5d63918339464f388b6 --- /dev/null +++ b/model-00038-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:433f65f96f8f13db6522dabb7f0e6425f986427c9d2870585ee840536a911618 +size 8604902929 diff --git a/model-00039-of-000055.safetensors b/model-00039-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..96d16d9555b06568e4bae93cdbd31cb66b8e78fc --- /dev/null +++ b/model-00039-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3161a2b5f22d8422101886d6e1f85e84c5db82d63e3fbd52e5c0f6ab17363486 +size 8604902929 diff --git a/model-00040-of-000055.safetensors b/model-00040-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..ccbf70f7c506e762c45dab8ab8be674bb6aab5a1 --- /dev/null +++ b/model-00040-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5a33e07c5f1e8dadd4eb9b7392ad735015c395e7a8c940d7a68927082d3f1368 +size 8604902927 diff --git a/model-00041-of-000055.safetensors b/model-00041-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..e753d586959617ccbef58d582e5c0c19366bf868 --- /dev/null +++ b/model-00041-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:351b6916d5ce36445ef22b30b265913acc7caf2e8fb5a86c28c163d9c12c3932 +size 8590442367 diff --git a/model-00042-of-000055.safetensors b/model-00042-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..63fb04756c2e0327e256e8ed7ebb1c16a161b1e6 --- /dev/null +++ b/model-00042-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bfe70d0667a0f09c8c95860fd95026456fa04f8bf64bfeb889a1fd03ac187268 +size 8604902858 diff --git a/model-00043-of-000055.safetensors b/model-00043-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..3569d0b1764ee5058c0524ae19f1a960a6e6fe83 --- /dev/null +++ b/model-00043-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a0d644951c2f95dd8761552d4ff3ca241c99f302852a300c7371ca8b83d40854 +size 8604902887 diff --git a/model-00044-of-000055.safetensors b/model-00044-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..955550e025a38c7e9d2affb4da56236a27a9dc65 --- /dev/null +++ b/model-00044-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:311c6e91304fd6b25a6a33c4ba9bb1a816e64e898db2d3ed4a2352c80e876119 +size 8604902887 diff --git a/model-00045-of-000055.safetensors b/model-00045-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..944eb0a751d4283e8e7adeb49ca58c68d03727d6 --- /dev/null +++ b/model-00045-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:98c0bd1806482cf024daf8d4ca61c1f2e8b24dec42d47facc8c11be95081bba7 +size 8604902887 diff --git a/model-00046-of-000055.safetensors b/model-00046-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..71ae034997585442afb044cd16767afc1f11d242 --- /dev/null +++ b/model-00046-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ac5758d83ba9972dba89846d37645cb46aff7b6a6d8bc84191643a8bc53fb112 +size 8604902887 diff --git a/model-00047-of-000055.safetensors b/model-00047-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..ef036bf5b019dd13853d3482c4de40caf5283f26 --- /dev/null +++ b/model-00047-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f1d752099b74b5e2a1cd958ca3c5d9c99386c7085a0d24478bd906939712a252 +size 8604902887 diff --git a/model-00048-of-000055.safetensors b/model-00048-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..17cfa3583a5f6cd6bcba7a1ae9d21f807ba33ab5 --- /dev/null +++ b/model-00048-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cac460b11885c5a6a179c19787a5aeb430398af6317e5bc938d2285e53027cb3 +size 8604902887 diff --git a/model-00049-of-000055.safetensors b/model-00049-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..efa9ab73d07ca597646e05ffe6e5fd11dfffbb5c --- /dev/null +++ b/model-00049-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:faad9a717022b39be1da069f528fc14e363a4a1c88153c9fbe660cd689a39bc7 +size 8604902924 diff --git a/model-00050-of-000055.safetensors b/model-00050-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..389d37da4d9dd5db20d26a05b7ff43f3cecc1f26 --- /dev/null +++ b/model-00050-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:296b7bb240e8c398f39252729c563e7192b21b5db32cdd2c35abc3b8d1875664 +size 8604902929 diff --git a/model-00051-of-000055.safetensors b/model-00051-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..a15bfb1d025c5d88d74a64deeae68695bb07d023 --- /dev/null +++ b/model-00051-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7213ae9ca463ef5ad7b7dfee2fa9a060b54895563c3789a8350816c9194838c9 +size 8604902929 diff --git a/model-00052-of-000055.safetensors b/model-00052-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..a923c17807ad82a5dbeeb0b486863296c6b8a645 --- /dev/null +++ b/model-00052-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d05571212b08d2083dd5e32db28b81a36c49ab81ac5fc574e7f87ce74b684394 +size 8604902929 diff --git a/model-00053-of-000055.safetensors b/model-00053-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..0f8cc45a73cec432f27f9933c9759139dbf433a2 --- /dev/null +++ b/model-00053-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cc1f71cffe4f46c7c55d6040572941157c913a0d622376169a538801c10ea7dd +size 8606171134 diff --git a/model-00054-of-000055.safetensors b/model-00054-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..d6da5c8f75b07551cd439ebdb5ca6aed2f5c8d81 --- /dev/null +++ b/model-00054-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2870aa0867f9d0275e0e72e30ab58d340435ab10dbc0a8c67fbe0655e6291259 +size 8604902857 diff --git a/model-00055-of-000055.safetensors b/model-00055-of-000055.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..84238aef151a52f90144e619b8e2f3fe8503a903 --- /dev/null +++ b/model-00055-of-000055.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c24c913f51454e4a984ed1c4717a191f02d87621300934bae6881973e436685d +size 6889220002 diff --git a/model.safetensors.index.json b/model.safetensors.index.json new file mode 100644 index 0000000000000000000000000000000000000000..2472ece0ff488ed893bd4cf4507d484a1a363983 --- /dev/null +++ b/model.safetensors.index.json @@ -0,0 +1,29109 @@ +{ + "metadata": { + "total_size": 471482869760 + }, + "weight_map": { + "model.embed_tokens.weight": "model-00001-of-000055.safetensors", + "model.norm.weight": "model-00001-of-000055.safetensors", + "lm_head.weight": "model-00001-of-000055.safetensors", + "model.layers.0.self_attn.q_a_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.0.self_attn.q_a_layernorm.weight": "model-00001-of-000055.safetensors", + "model.layers.0.self_attn.q_b_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.0.self_attn.kv_a_proj_with_mqa.weight": "model-00001-of-000055.safetensors", + "model.layers.0.self_attn.kv_a_layernorm.weight": "model-00001-of-000055.safetensors", + "model.layers.0.self_attn.kv_b_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.0.self_attn.o_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.0.mlp.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.0.mlp.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.0.mlp.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.0.input_layernorm.weight": "model-00001-of-000055.safetensors", + "model.layers.0.post_attention_layernorm.weight": "model-00001-of-000055.safetensors", + "model.layers.1.self_attn.q_a_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.self_attn.q_a_layernorm.weight": "model-00001-of-000055.safetensors", + "model.layers.1.self_attn.q_b_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.self_attn.kv_a_proj_with_mqa.weight": "model-00001-of-000055.safetensors", + "model.layers.1.self_attn.kv_a_layernorm.weight": "model-00001-of-000055.safetensors", + "model.layers.1.self_attn.kv_b_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.self_attn.o_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.gate.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.shared_experts.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.shared_experts.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.shared_experts.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.0.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.0.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.0.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.1.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.1.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.1.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.2.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.2.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.2.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.3.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.3.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.3.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.4.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.4.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.4.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.5.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.5.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.5.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.6.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.6.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.6.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.7.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.7.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.7.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.8.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.8.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.8.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.9.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.9.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.9.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.10.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.10.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.10.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.11.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.11.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.11.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.12.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.12.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.12.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.13.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.13.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.13.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.14.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.14.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.14.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.15.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.15.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.15.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.16.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.16.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.16.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.17.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.17.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.17.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.18.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.18.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.18.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.19.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.19.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.19.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.20.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.20.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.20.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.21.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.21.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.21.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.22.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.22.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.22.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.23.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.23.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.23.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.24.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.24.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.24.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.25.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.25.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.25.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.26.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.26.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.26.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.27.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.27.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.27.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.28.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.28.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.28.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.29.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.29.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.29.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.30.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.30.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.30.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.31.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.31.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.31.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.32.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.32.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.32.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.33.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.33.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.33.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.34.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.34.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.34.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.35.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.35.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.35.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.36.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.36.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.36.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.37.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.37.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.37.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.38.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.38.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.38.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.39.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.39.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.39.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.40.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.40.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.40.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.41.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.41.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.41.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.42.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.42.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.42.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.43.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.43.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.43.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.44.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.44.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.44.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.45.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.45.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.45.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.46.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.46.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.46.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.47.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.47.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.47.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.48.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.48.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.48.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.49.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.49.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.49.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.50.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.50.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.50.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.51.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.51.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.51.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.52.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.52.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.52.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.53.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.53.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.53.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.54.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.54.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.54.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.55.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.55.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.55.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.56.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.56.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.56.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.57.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.57.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.57.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.58.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.58.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.58.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.59.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.59.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.59.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.60.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.60.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.60.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.61.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.61.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.61.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.62.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.62.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.62.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.63.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.63.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.63.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.64.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.64.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.64.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.65.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.65.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.65.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.66.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.66.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.66.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.67.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.67.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.67.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.68.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.68.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.68.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.69.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.69.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.69.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.70.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.70.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.70.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.71.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.71.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.71.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.72.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.72.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.72.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.73.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.73.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.73.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.74.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.74.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.74.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.75.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.75.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.75.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.76.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.76.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.76.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.77.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.77.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.77.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.78.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.78.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.78.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.79.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.79.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.79.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.80.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.80.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.80.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.81.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.81.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.81.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.82.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.82.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.82.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.83.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.83.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.83.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.84.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.84.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.84.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.85.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.85.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.85.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.86.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.86.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.86.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.87.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.87.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.87.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.88.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.88.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.88.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.89.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.89.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.89.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.90.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.90.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.90.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.91.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.91.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.91.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.92.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.92.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.92.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.93.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.93.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.93.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.94.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.94.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.94.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.95.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.95.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.95.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.96.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.96.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.96.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.97.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.97.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.97.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.98.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.98.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.98.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.99.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.99.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.99.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.100.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.100.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.100.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.101.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.101.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.101.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.102.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.102.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.102.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.103.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.103.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.103.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.104.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.104.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.104.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.105.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.105.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.105.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.106.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.106.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.106.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.107.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.107.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.107.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.108.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.108.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.108.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.109.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.109.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.109.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.110.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.110.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.110.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.111.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.111.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.111.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.112.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.112.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.112.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.113.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.113.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.113.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.114.gate_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.114.up_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.114.down_proj.weight": "model-00001-of-000055.safetensors", + "model.layers.1.mlp.experts.115.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.115.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.115.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.116.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.116.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.116.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.117.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.117.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.117.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.118.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.118.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.118.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.119.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.119.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.119.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.120.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.120.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.120.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.121.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.121.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.121.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.122.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.122.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.122.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.123.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.123.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.123.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.124.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.124.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.124.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.125.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.125.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.125.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.126.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.126.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.126.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.127.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.127.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.127.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.128.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.128.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.128.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.129.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.129.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.129.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.130.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.130.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.130.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.131.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.131.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.131.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.132.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.132.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.132.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.133.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.133.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.133.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.134.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.134.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.134.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.135.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.135.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.135.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.136.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.136.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.136.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.137.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.137.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.137.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.138.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.138.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.138.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.139.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.139.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.139.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.140.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.140.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.140.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.141.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.141.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.141.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.142.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.142.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.142.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.143.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.143.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.143.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.144.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.144.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.144.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.145.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.145.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.145.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.146.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.146.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.146.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.147.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.147.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.147.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.148.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.148.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.148.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.149.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.149.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.149.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.150.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.150.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.150.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.151.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.151.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.151.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.152.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.152.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.152.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.153.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.153.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.153.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.154.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.154.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.154.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.155.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.155.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.155.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.156.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.156.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.156.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.157.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.157.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.157.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.158.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.158.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.158.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.159.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.159.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.mlp.experts.159.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.1.input_layernorm.weight": "model-00002-of-000055.safetensors", + "model.layers.1.post_attention_layernorm.weight": "model-00002-of-000055.safetensors", + "model.layers.2.self_attn.q_a_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.self_attn.q_a_layernorm.weight": "model-00002-of-000055.safetensors", + "model.layers.2.self_attn.q_b_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.self_attn.kv_a_proj_with_mqa.weight": "model-00002-of-000055.safetensors", + "model.layers.2.self_attn.kv_a_layernorm.weight": "model-00002-of-000055.safetensors", + "model.layers.2.self_attn.kv_b_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.self_attn.o_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.gate.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.shared_experts.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.shared_experts.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.shared_experts.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.0.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.0.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.0.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.1.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.1.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.1.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.2.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.2.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.2.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.3.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.3.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.3.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.4.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.4.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.4.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.5.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.5.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.5.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.6.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.6.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.6.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.7.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.7.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.7.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.8.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.8.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.8.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.9.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.9.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.9.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.10.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.10.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.10.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.11.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.11.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.11.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.12.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.12.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.12.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.13.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.13.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.13.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.14.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.14.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.14.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.15.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.15.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.15.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.16.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.16.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.16.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.17.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.17.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.17.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.18.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.18.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.18.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.19.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.19.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.19.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.20.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.20.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.20.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.21.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.21.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.21.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.22.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.22.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.22.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.23.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.23.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.23.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.24.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.24.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.24.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.25.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.25.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.25.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.26.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.26.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.26.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.27.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.27.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.27.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.28.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.28.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.28.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.29.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.29.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.29.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.30.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.30.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.30.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.31.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.31.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.31.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.32.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.32.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.32.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.33.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.33.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.33.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.34.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.34.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.34.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.35.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.35.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.35.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.36.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.36.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.36.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.37.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.37.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.37.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.38.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.38.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.38.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.39.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.39.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.39.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.40.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.40.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.40.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.41.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.41.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.41.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.42.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.42.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.42.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.43.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.43.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.43.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.44.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.44.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.44.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.45.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.45.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.45.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.46.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.46.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.46.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.47.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.47.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.47.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.48.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.48.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.48.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.49.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.49.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.49.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.50.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.50.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.50.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.51.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.51.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.51.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.52.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.52.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.52.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.53.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.53.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.53.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.54.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.54.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.54.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.55.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.55.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.55.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.56.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.56.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.56.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.57.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.57.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.57.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.58.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.58.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.58.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.59.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.59.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.59.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.60.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.60.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.60.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.61.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.61.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.61.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.62.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.62.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.62.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.63.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.63.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.63.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.64.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.64.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.64.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.65.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.65.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.65.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.66.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.66.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.66.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.67.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.67.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.67.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.68.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.68.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.68.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.69.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.69.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.69.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.70.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.70.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.70.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.71.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.71.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.71.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.72.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.72.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.72.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.73.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.73.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.73.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.74.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.74.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.74.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.75.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.75.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.75.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.76.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.76.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.76.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.77.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.77.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.77.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.78.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.78.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.78.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.79.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.79.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.79.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.80.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.80.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.80.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.81.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.81.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.81.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.82.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.82.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.82.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.83.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.83.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.83.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.84.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.84.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.84.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.85.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.85.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.85.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.86.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.86.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.86.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.87.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.87.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.87.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.88.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.88.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.88.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.89.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.89.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.89.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.90.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.90.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.90.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.91.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.91.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.91.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.92.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.92.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.92.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.93.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.93.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.93.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.94.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.94.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.94.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.95.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.95.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.95.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.96.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.96.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.96.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.97.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.97.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.97.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.98.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.98.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.98.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.99.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.99.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.99.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.100.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.100.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.100.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.101.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.101.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.101.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.102.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.102.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.102.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.103.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.103.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.103.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.104.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.104.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.104.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.105.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.105.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.105.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.106.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.106.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.106.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.107.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.107.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.107.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.108.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.108.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.108.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.109.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.109.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.109.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.110.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.110.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.110.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.111.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.111.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.111.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.112.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.112.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.112.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.113.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.113.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.113.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.114.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.114.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.114.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.115.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.115.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.115.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.116.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.116.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.116.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.117.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.117.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.117.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.118.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.118.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.118.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.119.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.119.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.119.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.120.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.120.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.120.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.121.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.121.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.121.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.122.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.122.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.122.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.123.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.123.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.123.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.124.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.124.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.124.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.125.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.125.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.125.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.126.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.126.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.126.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.127.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.127.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.127.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.128.gate_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.128.up_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.128.down_proj.weight": "model-00002-of-000055.safetensors", + "model.layers.2.mlp.experts.129.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.129.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.129.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.130.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.130.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.130.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.131.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.131.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.131.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.132.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.132.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.132.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.133.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.133.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.133.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.134.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.134.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.134.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.135.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.135.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.135.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.136.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.136.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.136.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.137.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.137.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.137.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.138.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.138.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.138.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.139.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.139.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.139.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.140.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.140.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.140.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.141.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.141.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.141.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.142.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.142.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.142.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.143.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.143.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.143.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.144.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.144.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.144.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.145.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.145.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.145.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.146.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.146.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.146.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.147.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.147.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.147.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.148.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.148.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.148.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.149.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.149.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.149.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.150.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.150.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.150.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.151.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.151.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.151.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.152.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.152.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.152.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.153.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.153.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.153.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.154.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.154.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.154.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.155.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.155.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.155.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.156.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.156.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.156.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.157.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.157.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.157.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.158.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.158.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.158.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.159.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.159.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.mlp.experts.159.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.2.input_layernorm.weight": "model-00003-of-000055.safetensors", + "model.layers.2.post_attention_layernorm.weight": "model-00003-of-000055.safetensors", + "model.layers.3.self_attn.q_a_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.self_attn.q_a_layernorm.weight": "model-00003-of-000055.safetensors", + "model.layers.3.self_attn.q_b_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.self_attn.kv_a_proj_with_mqa.weight": "model-00003-of-000055.safetensors", + "model.layers.3.self_attn.kv_a_layernorm.weight": "model-00003-of-000055.safetensors", + "model.layers.3.self_attn.kv_b_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.self_attn.o_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.gate.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.shared_experts.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.shared_experts.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.shared_experts.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.0.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.0.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.0.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.1.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.1.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.1.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.2.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.2.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.2.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.3.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.3.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.3.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.4.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.4.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.4.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.5.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.5.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.5.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.6.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.6.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.6.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.7.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.7.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.7.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.8.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.8.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.8.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.9.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.9.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.9.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.10.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.10.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.10.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.11.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.11.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.11.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.12.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.12.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.12.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.13.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.13.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.13.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.14.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.14.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.14.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.15.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.15.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.15.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.16.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.16.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.16.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.17.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.17.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.17.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.18.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.18.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.18.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.19.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.19.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.19.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.20.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.20.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.20.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.21.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.21.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.21.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.22.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.22.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.22.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.23.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.23.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.23.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.24.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.24.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.24.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.25.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.25.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.25.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.26.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.26.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.26.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.27.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.27.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.27.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.28.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.28.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.28.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.29.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.29.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.29.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.30.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.30.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.30.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.31.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.31.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.31.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.32.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.32.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.32.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.33.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.33.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.33.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.34.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.34.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.34.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.35.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.35.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.35.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.36.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.36.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.36.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.37.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.37.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.37.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.38.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.38.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.38.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.39.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.39.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.39.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.40.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.40.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.40.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.41.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.41.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.41.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.42.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.42.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.42.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.43.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.43.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.43.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.44.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.44.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.44.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.45.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.45.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.45.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.46.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.46.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.46.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.47.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.47.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.47.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.48.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.48.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.48.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.49.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.49.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.49.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.50.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.50.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.50.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.51.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.51.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.51.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.52.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.52.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.52.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.53.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.53.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.53.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.54.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.54.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.54.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.55.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.55.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.55.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.56.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.56.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.56.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.57.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.57.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.57.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.58.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.58.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.58.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.59.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.59.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.59.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.60.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.60.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.60.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.61.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.61.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.61.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.62.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.62.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.62.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.63.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.63.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.63.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.64.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.64.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.64.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.65.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.65.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.65.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.66.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.66.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.66.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.67.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.67.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.67.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.68.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.68.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.68.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.69.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.69.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.69.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.70.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.70.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.70.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.71.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.71.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.71.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.72.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.72.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.72.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.73.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.73.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.73.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.74.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.74.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.74.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.75.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.75.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.75.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.76.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.76.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.76.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.77.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.77.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.77.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.78.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.78.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.78.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.79.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.79.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.79.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.80.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.80.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.80.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.81.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.81.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.81.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.82.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.82.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.82.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.83.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.83.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.83.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.84.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.84.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.84.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.85.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.85.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.85.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.86.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.86.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.86.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.87.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.87.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.87.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.88.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.88.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.88.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.89.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.89.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.89.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.90.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.90.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.90.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.91.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.91.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.91.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.92.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.92.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.92.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.93.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.93.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.93.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.94.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.94.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.94.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.95.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.95.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.95.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.96.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.96.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.96.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.97.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.97.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.97.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.98.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.98.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.98.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.99.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.99.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.99.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.100.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.100.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.100.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.101.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.101.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.101.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.102.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.102.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.102.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.103.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.103.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.103.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.104.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.104.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.104.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.105.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.105.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.105.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.106.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.106.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.106.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.107.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.107.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.107.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.108.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.108.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.108.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.109.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.109.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.109.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.110.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.110.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.110.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.111.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.111.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.111.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.112.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.112.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.112.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.113.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.113.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.113.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.114.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.114.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.114.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.115.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.115.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.115.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.116.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.116.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.116.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.117.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.117.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.117.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.118.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.118.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.118.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.119.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.119.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.119.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.120.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.120.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.120.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.121.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.121.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.121.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.122.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.122.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.122.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.123.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.123.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.123.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.124.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.124.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.124.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.125.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.125.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.125.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.126.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.126.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.126.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.127.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.127.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.127.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.128.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.128.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.128.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.129.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.129.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.129.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.130.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.130.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.130.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.131.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.131.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.131.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.132.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.132.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.132.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.133.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.133.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.133.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.134.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.134.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.134.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.135.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.135.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.135.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.136.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.136.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.136.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.137.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.137.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.137.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.138.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.138.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.138.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.139.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.139.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.139.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.140.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.140.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.140.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.141.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.141.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.141.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.142.gate_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.142.up_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.142.down_proj.weight": "model-00003-of-000055.safetensors", + "model.layers.3.mlp.experts.143.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.143.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.143.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.144.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.144.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.144.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.145.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.145.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.145.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.146.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.146.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.146.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.147.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.147.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.147.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.148.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.148.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.148.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.149.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.149.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.149.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.150.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.150.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.150.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.151.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.151.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.151.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.152.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.152.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.152.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.153.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.153.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.153.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.154.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.154.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.154.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.155.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.155.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.155.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.156.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.156.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.156.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.157.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.157.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.157.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.158.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.158.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.158.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.159.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.159.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.mlp.experts.159.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.3.input_layernorm.weight": "model-00004-of-000055.safetensors", + "model.layers.3.post_attention_layernorm.weight": "model-00004-of-000055.safetensors", + "model.layers.4.self_attn.q_a_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.self_attn.q_a_layernorm.weight": "model-00004-of-000055.safetensors", + "model.layers.4.self_attn.q_b_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.self_attn.kv_a_proj_with_mqa.weight": "model-00004-of-000055.safetensors", + "model.layers.4.self_attn.kv_a_layernorm.weight": "model-00004-of-000055.safetensors", + "model.layers.4.self_attn.kv_b_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.self_attn.o_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.gate.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.shared_experts.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.shared_experts.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.shared_experts.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.0.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.0.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.0.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.1.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.1.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.1.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.2.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.2.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.2.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.3.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.3.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.3.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.4.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.4.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.4.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.5.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.5.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.5.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.6.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.6.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.6.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.7.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.7.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.7.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.8.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.8.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.8.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.9.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.9.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.9.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.10.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.10.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.10.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.11.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.11.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.11.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.12.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.12.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.12.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.13.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.13.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.13.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.14.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.14.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.14.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.15.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.15.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.15.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.16.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.16.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.16.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.17.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.17.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.17.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.18.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.18.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.18.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.19.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.19.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.19.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.20.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.20.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.20.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.21.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.21.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.21.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.22.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.22.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.22.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.23.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.23.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.23.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.24.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.24.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.24.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.25.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.25.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.25.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.26.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.26.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.26.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.27.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.27.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.27.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.28.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.28.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.28.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.29.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.29.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.29.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.30.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.30.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.30.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.31.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.31.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.31.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.32.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.32.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.32.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.33.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.33.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.33.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.34.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.34.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.34.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.35.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.35.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.35.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.36.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.36.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.36.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.37.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.37.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.37.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.38.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.38.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.38.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.39.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.39.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.39.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.40.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.40.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.40.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.41.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.41.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.41.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.42.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.42.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.42.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.43.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.43.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.43.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.44.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.44.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.44.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.45.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.45.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.45.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.46.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.46.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.46.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.47.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.47.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.47.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.48.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.48.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.48.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.49.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.49.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.49.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.50.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.50.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.50.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.51.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.51.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.51.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.52.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.52.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.52.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.53.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.53.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.53.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.54.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.54.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.54.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.55.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.55.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.55.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.56.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.56.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.56.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.57.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.57.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.57.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.58.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.58.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.58.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.59.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.59.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.59.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.60.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.60.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.60.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.61.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.61.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.61.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.62.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.62.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.62.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.63.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.63.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.63.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.64.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.64.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.64.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.65.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.65.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.65.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.66.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.66.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.66.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.67.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.67.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.67.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.68.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.68.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.68.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.69.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.69.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.69.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.70.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.70.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.70.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.71.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.71.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.71.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.72.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.72.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.72.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.73.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.73.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.73.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.74.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.74.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.74.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.75.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.75.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.75.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.76.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.76.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.76.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.77.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.77.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.77.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.78.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.78.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.78.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.79.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.79.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.79.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.80.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.80.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.80.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.81.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.81.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.81.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.82.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.82.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.82.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.83.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.83.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.83.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.84.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.84.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.84.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.85.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.85.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.85.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.86.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.86.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.86.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.87.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.87.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.87.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.88.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.88.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.88.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.89.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.89.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.89.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.90.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.90.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.90.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.91.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.91.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.91.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.92.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.92.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.92.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.93.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.93.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.93.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.94.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.94.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.94.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.95.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.95.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.95.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.96.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.96.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.96.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.97.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.97.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.97.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.98.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.98.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.98.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.99.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.99.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.99.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.100.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.100.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.100.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.101.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.101.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.101.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.102.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.102.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.102.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.103.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.103.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.103.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.104.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.104.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.104.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.105.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.105.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.105.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.106.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.106.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.106.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.107.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.107.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.107.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.108.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.108.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.108.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.109.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.109.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.109.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.110.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.110.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.110.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.111.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.111.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.111.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.112.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.112.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.112.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.113.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.113.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.113.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.114.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.114.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.114.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.115.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.115.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.115.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.116.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.116.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.116.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.117.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.117.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.117.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.118.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.118.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.118.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.119.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.119.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.119.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.120.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.120.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.120.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.121.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.121.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.121.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.122.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.122.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.122.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.123.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.123.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.123.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.124.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.124.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.124.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.125.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.125.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.125.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.126.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.126.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.126.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.127.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.127.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.127.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.128.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.128.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.128.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.129.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.129.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.129.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.130.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.130.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.130.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.131.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.131.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.131.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.132.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.132.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.132.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.133.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.133.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.133.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.134.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.134.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.134.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.135.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.135.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.135.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.136.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.136.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.136.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.137.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.137.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.137.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.138.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.138.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.138.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.139.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.139.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.139.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.140.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.140.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.140.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.141.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.141.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.141.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.142.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.142.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.142.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.143.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.143.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.143.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.144.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.144.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.144.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.145.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.145.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.145.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.146.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.146.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.146.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.147.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.147.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.147.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.148.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.148.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.148.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.149.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.149.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.149.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.150.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.150.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.150.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.151.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.151.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.151.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.152.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.152.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.152.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.153.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.153.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.153.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.154.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.154.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.154.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.155.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.155.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.155.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.156.gate_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.156.up_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.156.down_proj.weight": "model-00004-of-000055.safetensors", + "model.layers.4.mlp.experts.157.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.mlp.experts.157.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.mlp.experts.157.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.mlp.experts.158.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.mlp.experts.158.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.mlp.experts.158.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.mlp.experts.159.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.mlp.experts.159.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.mlp.experts.159.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.4.input_layernorm.weight": "model-00005-of-000055.safetensors", + "model.layers.4.post_attention_layernorm.weight": "model-00005-of-000055.safetensors", + "model.layers.5.self_attn.q_a_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.self_attn.q_a_layernorm.weight": "model-00005-of-000055.safetensors", + "model.layers.5.self_attn.q_b_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.self_attn.kv_a_proj_with_mqa.weight": "model-00005-of-000055.safetensors", + "model.layers.5.self_attn.kv_a_layernorm.weight": "model-00005-of-000055.safetensors", + "model.layers.5.self_attn.kv_b_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.self_attn.o_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.gate.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.shared_experts.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.shared_experts.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.shared_experts.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.0.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.0.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.0.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.1.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.1.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.1.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.2.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.2.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.2.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.3.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.3.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.3.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.4.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.4.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.4.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.5.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.5.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.5.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.6.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.6.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.6.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.7.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.7.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.7.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.8.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.8.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.8.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.9.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.9.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.9.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.10.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.10.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.10.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.11.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.11.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.11.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.12.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.12.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.12.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.13.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.13.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.13.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.14.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.14.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.14.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.15.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.15.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.15.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.16.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.16.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.16.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.17.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.17.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.17.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.18.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.18.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.18.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.19.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.19.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.19.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.20.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.20.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.20.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.21.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.21.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.21.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.22.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.22.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.22.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.23.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.23.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.23.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.24.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.24.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.24.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.25.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.25.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.25.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.26.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.26.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.26.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.27.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.27.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.27.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.28.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.28.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.28.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.29.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.29.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.29.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.30.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.30.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.30.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.31.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.31.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.31.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.32.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.32.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.32.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.33.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.33.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.33.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.34.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.34.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.34.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.35.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.35.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.35.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.36.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.36.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.36.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.37.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.37.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.37.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.38.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.38.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.38.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.39.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.39.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.39.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.40.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.40.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.40.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.41.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.41.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.41.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.42.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.42.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.42.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.43.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.43.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.43.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.44.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.44.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.44.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.45.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.45.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.45.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.46.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.46.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.46.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.47.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.47.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.47.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.48.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.48.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.48.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.49.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.49.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.49.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.50.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.50.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.50.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.51.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.51.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.51.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.52.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.52.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.52.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.53.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.53.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.53.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.54.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.54.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.54.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.55.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.55.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.55.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.56.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.56.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.56.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.57.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.57.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.57.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.58.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.58.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.58.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.59.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.59.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.59.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.60.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.60.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.60.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.61.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.61.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.61.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.62.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.62.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.62.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.63.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.63.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.63.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.64.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.64.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.64.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.65.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.65.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.65.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.66.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.66.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.66.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.67.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.67.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.67.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.68.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.68.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.68.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.69.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.69.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.69.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.70.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.70.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.70.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.71.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.71.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.71.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.72.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.72.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.72.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.73.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.73.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.73.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.74.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.74.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.74.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.75.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.75.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.75.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.76.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.76.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.76.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.77.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.77.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.77.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.78.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.78.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.78.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.79.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.79.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.79.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.80.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.80.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.80.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.81.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.81.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.81.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.82.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.82.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.82.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.83.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.83.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.83.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.84.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.84.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.84.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.85.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.85.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.85.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.86.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.86.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.86.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.87.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.87.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.87.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.88.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.88.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.88.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.89.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.89.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.89.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.90.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.90.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.90.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.91.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.91.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.91.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.92.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.92.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.92.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.93.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.93.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.93.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.94.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.94.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.94.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.95.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.95.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.95.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.96.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.96.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.96.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.97.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.97.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.97.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.98.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.98.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.98.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.99.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.99.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.99.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.100.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.100.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.100.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.101.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.101.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.101.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.102.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.102.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.102.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.103.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.103.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.103.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.104.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.104.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.104.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.105.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.105.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.105.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.106.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.106.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.106.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.107.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.107.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.107.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.108.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.108.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.108.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.109.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.109.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.109.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.110.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.110.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.110.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.111.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.111.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.111.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.112.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.112.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.112.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.113.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.113.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.113.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.114.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.114.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.114.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.115.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.115.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.115.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.116.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.116.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.116.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.117.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.117.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.117.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.118.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.118.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.118.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.119.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.119.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.119.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.120.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.120.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.120.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.121.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.121.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.121.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.122.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.122.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.122.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.123.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.123.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.123.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.124.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.124.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.124.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.125.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.125.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.125.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.126.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.126.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.126.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.127.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.127.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.127.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.128.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.128.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.128.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.129.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.129.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.129.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.130.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.130.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.130.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.131.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.131.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.131.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.132.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.132.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.132.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.133.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.133.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.133.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.134.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.134.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.134.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.135.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.135.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.135.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.136.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.136.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.136.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.137.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.137.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.137.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.138.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.138.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.138.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.139.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.139.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.139.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.140.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.140.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.140.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.141.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.141.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.141.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.142.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.142.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.142.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.143.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.143.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.143.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.144.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.144.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.144.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.145.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.145.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.145.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.146.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.146.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.146.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.147.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.147.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.147.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.148.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.148.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.148.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.149.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.149.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.149.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.150.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.150.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.150.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.151.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.151.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.151.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.152.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.152.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.152.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.153.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.153.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.153.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.154.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.154.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.154.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.155.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.155.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.155.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.156.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.156.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.156.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.157.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.157.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.157.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.158.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.158.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.158.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.159.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.159.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.mlp.experts.159.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.5.input_layernorm.weight": "model-00005-of-000055.safetensors", + "model.layers.5.post_attention_layernorm.weight": "model-00005-of-000055.safetensors", + "model.layers.6.self_attn.q_a_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.self_attn.q_a_layernorm.weight": "model-00005-of-000055.safetensors", + "model.layers.6.self_attn.q_b_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.self_attn.kv_a_proj_with_mqa.weight": "model-00005-of-000055.safetensors", + "model.layers.6.self_attn.kv_a_layernorm.weight": "model-00005-of-000055.safetensors", + "model.layers.6.self_attn.kv_b_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.self_attn.o_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.gate.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.shared_experts.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.shared_experts.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.shared_experts.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.experts.0.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.experts.0.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.experts.0.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.experts.1.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.experts.1.up_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.experts.1.down_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.experts.2.gate_proj.weight": "model-00005-of-000055.safetensors", + "model.layers.6.mlp.experts.2.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.2.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.3.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.3.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.3.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.4.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.4.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.4.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.5.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.5.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.5.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.6.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.6.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.6.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.7.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.7.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.7.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.8.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.8.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.8.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.9.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.9.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.9.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.10.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.10.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.10.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.11.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.11.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.11.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.12.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.12.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.12.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.13.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.13.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.13.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.14.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.14.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.14.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.15.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.15.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.15.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.16.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.16.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.16.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.17.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.17.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.17.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.18.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.18.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.18.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.19.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.19.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.19.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.20.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.20.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.20.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.21.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.21.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.21.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.22.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.22.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.22.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.23.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.23.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.23.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.24.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.24.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.24.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.25.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.25.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.25.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.26.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.26.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.26.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.27.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.27.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.27.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.28.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.28.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.28.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.29.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.29.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.29.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.30.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.30.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.30.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.31.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.31.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.31.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.32.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.32.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.32.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.33.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.33.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.33.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.34.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.34.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.34.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.35.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.35.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.35.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.36.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.36.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.36.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.37.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.37.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.37.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.38.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.38.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.38.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.39.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.39.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.39.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.40.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.40.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.40.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.41.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.41.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.41.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.42.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.42.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.42.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.43.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.43.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.43.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.44.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.44.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.44.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.45.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.45.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.45.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.46.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.46.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.46.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.47.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.47.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.47.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.48.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.48.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.48.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.49.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.49.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.49.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.50.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.50.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.50.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.51.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.51.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.51.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.52.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.52.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.52.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.53.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.53.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.53.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.54.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.54.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.54.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.55.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.55.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.55.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.56.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.56.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.56.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.57.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.57.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.57.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.58.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.58.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.58.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.59.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.59.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.59.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.60.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.60.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.60.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.61.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.61.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.61.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.62.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.62.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.62.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.63.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.63.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.63.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.64.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.64.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.64.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.65.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.65.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.65.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.66.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.66.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.66.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.67.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.67.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.67.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.68.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.68.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.68.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.69.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.69.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.69.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.70.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.70.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.70.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.71.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.71.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.71.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.72.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.72.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.72.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.73.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.73.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.73.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.74.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.74.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.74.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.75.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.75.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.75.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.76.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.76.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.76.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.77.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.77.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.77.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.78.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.78.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.78.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.79.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.79.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.79.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.80.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.80.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.80.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.81.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.81.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.81.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.82.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.82.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.82.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.83.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.83.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.83.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.84.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.84.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.84.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.85.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.85.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.85.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.86.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.86.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.86.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.87.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.87.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.87.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.88.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.88.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.88.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.89.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.89.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.89.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.90.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.90.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.90.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.91.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.91.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.91.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.92.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.92.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.92.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.93.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.93.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.93.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.94.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.94.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.94.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.95.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.95.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.95.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.96.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.96.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.96.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.97.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.97.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.97.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.98.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.98.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.98.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.99.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.99.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.99.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.100.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.100.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.100.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.101.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.101.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.101.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.102.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.102.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.102.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.103.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.103.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.103.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.104.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.104.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.104.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.105.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.105.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.105.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.106.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.106.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.106.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.107.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.107.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.107.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.108.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.108.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.108.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.109.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.109.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.109.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.110.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.110.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.110.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.111.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.111.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.111.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.112.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.112.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.112.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.113.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.113.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.113.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.114.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.114.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.114.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.115.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.115.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.115.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.116.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.116.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.116.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.117.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.117.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.117.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.118.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.118.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.118.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.119.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.119.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.119.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.120.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.120.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.120.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.121.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.121.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.121.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.122.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.122.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.122.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.123.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.123.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.123.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.124.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.124.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.124.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.125.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.125.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.125.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.126.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.126.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.126.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.127.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.127.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.127.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.128.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.128.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.128.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.129.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.129.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.129.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.130.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.130.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.130.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.131.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.131.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.131.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.132.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.132.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.132.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.133.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.133.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.133.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.134.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.134.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.134.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.135.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.135.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.135.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.136.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.136.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.136.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.137.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.137.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.137.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.138.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.138.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.138.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.139.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.139.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.139.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.140.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.140.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.140.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.141.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.141.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.141.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.142.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.142.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.142.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.143.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.143.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.143.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.144.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.144.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.144.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.145.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.145.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.145.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.146.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.146.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.146.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.147.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.147.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.147.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.148.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.148.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.148.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.149.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.149.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.149.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.150.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.150.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.150.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.151.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.151.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.151.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.152.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.152.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.152.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.153.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.153.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.153.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.154.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.154.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.154.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.155.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.155.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.155.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.156.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.156.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.156.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.157.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.157.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.157.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.158.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.158.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.158.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.159.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.159.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.mlp.experts.159.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.6.input_layernorm.weight": "model-00006-of-000055.safetensors", + "model.layers.6.post_attention_layernorm.weight": "model-00006-of-000055.safetensors", + "model.layers.7.self_attn.q_a_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.self_attn.q_a_layernorm.weight": "model-00006-of-000055.safetensors", + "model.layers.7.self_attn.q_b_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.self_attn.kv_a_proj_with_mqa.weight": "model-00006-of-000055.safetensors", + "model.layers.7.self_attn.kv_a_layernorm.weight": "model-00006-of-000055.safetensors", + "model.layers.7.self_attn.kv_b_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.self_attn.o_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.gate.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.shared_experts.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.shared_experts.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.shared_experts.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.0.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.0.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.0.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.1.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.1.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.1.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.2.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.2.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.2.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.3.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.3.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.3.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.4.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.4.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.4.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.5.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.5.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.5.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.6.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.6.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.6.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.7.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.7.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.7.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.8.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.8.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.8.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.9.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.9.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.9.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.10.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.10.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.10.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.11.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.11.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.11.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.12.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.12.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.12.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.13.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.13.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.13.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.14.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.14.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.14.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.15.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.15.up_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.15.down_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.16.gate_proj.weight": "model-00006-of-000055.safetensors", + "model.layers.7.mlp.experts.16.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.16.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.17.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.17.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.17.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.18.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.18.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.18.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.19.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.19.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.19.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.20.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.20.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.20.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.21.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.21.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.21.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.22.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.22.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.22.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.23.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.23.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.23.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.24.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.24.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.24.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.25.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.25.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.25.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.26.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.26.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.26.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.27.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.27.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.27.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.28.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.28.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.28.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.29.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.29.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.29.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.30.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.30.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.30.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.31.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.31.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.31.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.32.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.32.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.32.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.33.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.33.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.33.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.34.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.34.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.34.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.35.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.35.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.35.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.36.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.36.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.36.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.37.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.37.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.37.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.38.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.38.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.38.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.39.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.39.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.39.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.40.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.40.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.40.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.41.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.41.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.41.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.42.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.42.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.42.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.43.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.43.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.43.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.44.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.44.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.44.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.45.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.45.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.45.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.46.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.46.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.46.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.47.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.47.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.47.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.48.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.48.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.48.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.49.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.49.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.49.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.50.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.50.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.50.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.51.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.51.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.51.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.52.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.52.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.52.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.53.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.53.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.53.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.54.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.54.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.54.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.55.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.55.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.55.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.56.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.56.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.56.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.57.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.57.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.57.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.58.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.58.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.58.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.59.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.59.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.59.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.60.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.60.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.60.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.61.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.61.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.61.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.62.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.62.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.62.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.63.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.63.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.63.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.64.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.64.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.64.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.65.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.65.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.65.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.66.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.66.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.66.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.67.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.67.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.67.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.68.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.68.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.68.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.69.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.69.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.69.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.70.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.70.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.70.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.71.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.71.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.71.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.72.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.72.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.72.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.73.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.73.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.73.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.74.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.74.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.74.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.75.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.75.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.75.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.76.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.76.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.76.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.77.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.77.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.77.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.78.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.78.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.78.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.79.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.79.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.79.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.80.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.80.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.80.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.81.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.81.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.81.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.82.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.82.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.82.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.83.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.83.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.83.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.84.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.84.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.84.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.85.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.85.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.85.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.86.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.86.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.86.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.87.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.87.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.87.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.88.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.88.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.88.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.89.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.89.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.89.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.90.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.90.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.90.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.91.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.91.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.91.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.92.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.92.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.92.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.93.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.93.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.93.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.94.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.94.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.94.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.95.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.95.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.95.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.96.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.96.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.96.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.97.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.97.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.97.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.98.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.98.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.98.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.99.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.99.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.99.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.100.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.100.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.100.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.101.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.101.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.101.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.102.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.102.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.102.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.103.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.103.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.103.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.104.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.104.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.104.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.105.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.105.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.105.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.106.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.106.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.106.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.107.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.107.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.107.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.108.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.108.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.108.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.109.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.109.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.109.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.110.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.110.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.110.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.111.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.111.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.111.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.112.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.112.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.112.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.113.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.113.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.113.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.114.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.114.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.114.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.115.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.115.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.115.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.116.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.116.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.116.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.117.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.117.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.117.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.118.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.118.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.118.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.119.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.119.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.119.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.120.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.120.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.120.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.121.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.121.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.121.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.122.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.122.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.122.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.123.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.123.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.123.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.124.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.124.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.124.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.125.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.125.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.125.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.126.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.126.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.126.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.127.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.127.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.127.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.128.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.128.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.128.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.129.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.129.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.129.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.130.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.130.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.130.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.131.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.131.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.131.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.132.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.132.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.132.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.133.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.133.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.133.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.134.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.134.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.134.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.135.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.135.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.135.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.136.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.136.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.136.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.137.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.137.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.137.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.138.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.138.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.138.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.139.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.139.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.139.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.140.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.140.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.140.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.141.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.141.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.141.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.142.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.142.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.142.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.143.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.143.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.143.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.144.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.144.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.144.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.145.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.145.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.145.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.146.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.146.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.146.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.147.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.147.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.147.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.148.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.148.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.148.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.149.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.149.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.149.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.150.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.150.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.150.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.151.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.151.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.151.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.152.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.152.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.152.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.153.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.153.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.153.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.154.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.154.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.154.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.155.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.155.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.155.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.156.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.156.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.156.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.157.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.157.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.157.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.158.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.158.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.158.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.159.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.159.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.mlp.experts.159.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.7.input_layernorm.weight": "model-00007-of-000055.safetensors", + "model.layers.7.post_attention_layernorm.weight": "model-00007-of-000055.safetensors", + "model.layers.8.self_attn.q_a_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.self_attn.q_a_layernorm.weight": "model-00007-of-000055.safetensors", + "model.layers.8.self_attn.q_b_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.self_attn.kv_a_proj_with_mqa.weight": "model-00007-of-000055.safetensors", + "model.layers.8.self_attn.kv_a_layernorm.weight": "model-00007-of-000055.safetensors", + "model.layers.8.self_attn.kv_b_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.self_attn.o_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.gate.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.shared_experts.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.shared_experts.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.shared_experts.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.0.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.0.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.0.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.1.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.1.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.1.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.2.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.2.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.2.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.3.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.3.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.3.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.4.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.4.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.4.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.5.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.5.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.5.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.6.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.6.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.6.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.7.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.7.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.7.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.8.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.8.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.8.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.9.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.9.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.9.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.10.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.10.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.10.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.11.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.11.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.11.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.12.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.12.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.12.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.13.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.13.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.13.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.14.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.14.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.14.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.15.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.15.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.15.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.16.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.16.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.16.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.17.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.17.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.17.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.18.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.18.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.18.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.19.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.19.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.19.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.20.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.20.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.20.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.21.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.21.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.21.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.22.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.22.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.22.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.23.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.23.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.23.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.24.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.24.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.24.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.25.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.25.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.25.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.26.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.26.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.26.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.27.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.27.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.27.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.28.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.28.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.28.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.29.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.29.up_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.29.down_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.30.gate_proj.weight": "model-00007-of-000055.safetensors", + "model.layers.8.mlp.experts.30.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.30.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.31.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.31.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.31.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.32.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.32.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.32.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.33.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.33.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.33.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.34.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.34.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.34.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.35.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.35.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.35.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.36.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.36.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.36.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.37.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.37.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.37.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.38.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.38.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.38.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.39.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.39.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.39.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.40.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.40.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.40.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.41.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.41.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.41.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.42.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.42.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.42.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.43.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.43.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.43.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.44.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.44.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.44.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.45.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.45.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.45.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.46.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.46.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.46.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.47.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.47.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.47.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.48.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.48.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.48.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.49.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.49.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.49.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.50.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.50.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.50.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.51.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.51.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.51.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.52.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.52.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.52.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.53.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.53.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.53.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.54.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.54.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.54.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.55.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.55.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.55.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.56.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.56.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.56.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.57.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.57.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.57.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.58.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.58.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.58.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.59.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.59.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.59.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.60.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.60.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.60.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.61.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.61.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.61.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.62.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.62.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.62.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.63.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.63.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.63.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.64.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.64.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.64.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.65.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.65.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.65.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.66.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.66.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.66.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.67.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.67.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.67.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.68.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.68.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.68.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.69.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.69.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.69.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.70.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.70.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.70.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.71.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.71.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.71.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.72.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.72.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.72.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.73.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.73.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.73.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.74.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.74.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.74.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.75.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.75.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.75.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.76.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.76.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.76.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.77.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.77.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.77.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.78.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.78.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.78.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.79.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.79.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.79.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.80.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.80.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.80.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.81.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.81.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.81.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.82.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.82.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.82.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.83.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.83.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.83.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.84.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.84.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.84.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.85.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.85.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.85.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.86.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.86.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.86.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.87.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.87.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.87.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.88.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.88.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.88.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.89.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.89.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.89.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.90.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.90.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.90.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.91.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.91.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.91.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.92.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.92.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.92.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.93.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.93.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.93.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.94.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.94.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.94.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.95.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.95.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.95.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.96.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.96.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.96.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.97.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.97.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.97.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.98.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.98.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.98.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.99.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.99.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.99.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.100.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.100.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.100.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.101.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.101.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.101.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.102.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.102.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.102.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.103.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.103.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.103.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.104.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.104.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.104.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.105.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.105.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.105.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.106.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.106.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.106.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.107.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.107.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.107.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.108.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.108.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.108.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.109.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.109.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.109.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.110.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.110.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.110.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.111.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.111.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.111.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.112.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.112.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.112.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.113.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.113.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.113.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.114.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.114.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.114.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.115.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.115.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.115.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.116.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.116.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.116.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.117.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.117.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.117.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.118.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.118.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.118.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.119.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.119.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.119.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.120.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.120.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.120.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.121.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.121.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.121.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.122.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.122.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.122.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.123.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.123.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.123.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.124.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.124.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.124.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.125.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.125.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.125.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.126.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.126.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.126.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.127.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.127.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.127.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.128.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.128.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.128.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.129.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.129.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.129.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.130.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.130.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.130.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.131.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.131.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.131.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.132.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.132.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.132.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.133.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.133.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.133.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.134.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.134.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.134.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.135.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.135.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.135.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.136.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.136.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.136.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.137.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.137.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.137.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.138.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.138.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.138.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.139.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.139.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.139.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.140.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.140.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.140.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.141.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.141.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.141.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.142.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.142.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.142.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.143.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.143.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.143.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.144.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.144.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.144.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.145.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.145.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.145.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.146.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.146.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.146.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.147.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.147.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.147.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.148.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.148.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.148.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.149.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.149.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.149.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.150.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.150.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.150.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.151.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.151.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.151.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.152.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.152.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.152.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.153.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.153.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.153.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.154.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.154.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.154.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.155.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.155.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.155.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.156.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.156.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.156.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.157.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.157.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.157.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.158.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.158.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.158.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.159.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.159.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.mlp.experts.159.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.8.input_layernorm.weight": "model-00008-of-000055.safetensors", + "model.layers.8.post_attention_layernorm.weight": "model-00008-of-000055.safetensors", + "model.layers.9.self_attn.q_a_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.self_attn.q_a_layernorm.weight": "model-00008-of-000055.safetensors", + "model.layers.9.self_attn.q_b_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.self_attn.kv_a_proj_with_mqa.weight": "model-00008-of-000055.safetensors", + "model.layers.9.self_attn.kv_a_layernorm.weight": "model-00008-of-000055.safetensors", + "model.layers.9.self_attn.kv_b_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.self_attn.o_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.gate.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.shared_experts.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.shared_experts.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.shared_experts.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.0.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.0.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.0.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.1.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.1.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.1.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.2.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.2.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.2.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.3.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.3.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.3.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.4.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.4.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.4.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.5.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.5.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.5.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.6.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.6.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.6.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.7.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.7.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.7.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.8.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.8.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.8.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.9.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.9.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.9.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.10.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.10.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.10.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.11.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.11.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.11.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.12.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.12.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.12.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.13.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.13.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.13.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.14.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.14.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.14.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.15.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.15.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.15.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.16.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.16.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.16.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.17.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.17.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.17.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.18.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.18.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.18.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.19.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.19.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.19.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.20.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.20.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.20.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.21.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.21.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.21.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.22.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.22.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.22.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.23.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.23.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.23.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.24.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.24.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.24.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.25.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.25.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.25.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.26.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.26.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.26.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.27.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.27.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.27.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.28.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.28.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.28.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.29.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.29.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.29.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.30.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.30.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.30.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.31.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.31.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.31.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.32.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.32.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.32.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.33.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.33.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.33.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.34.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.34.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.34.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.35.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.35.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.35.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.36.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.36.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.36.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.37.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.37.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.37.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.38.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.38.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.38.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.39.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.39.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.39.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.40.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.40.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.40.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.41.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.41.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.41.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.42.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.42.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.42.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.43.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.43.up_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.43.down_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.44.gate_proj.weight": "model-00008-of-000055.safetensors", + "model.layers.9.mlp.experts.44.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.44.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.45.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.45.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.45.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.46.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.46.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.46.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.47.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.47.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.47.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.48.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.48.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.48.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.49.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.49.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.49.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.50.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.50.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.50.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.51.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.51.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.51.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.52.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.52.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.52.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.53.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.53.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.53.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.54.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.54.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.54.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.55.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.55.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.55.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.56.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.56.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.56.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.57.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.57.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.57.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.58.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.58.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.58.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.59.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.59.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.59.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.60.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.60.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.60.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.61.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.61.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.61.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.62.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.62.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.62.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.63.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.63.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.63.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.64.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.64.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.64.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.65.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.65.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.65.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.66.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.66.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.66.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.67.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.67.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.67.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.68.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.68.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.68.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.69.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.69.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.69.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.70.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.70.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.70.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.71.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.71.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.71.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.72.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.72.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.72.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.73.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.73.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.73.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.74.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.74.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.74.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.75.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.75.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.75.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.76.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.76.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.76.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.77.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.77.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.77.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.78.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.78.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.78.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.79.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.79.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.79.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.80.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.80.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.80.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.81.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.81.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.81.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.82.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.82.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.82.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.83.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.83.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.83.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.84.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.84.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.84.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.85.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.85.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.85.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.86.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.86.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.86.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.87.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.87.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.87.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.88.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.88.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.88.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.89.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.89.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.89.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.90.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.90.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.90.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.91.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.91.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.91.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.92.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.92.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.92.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.93.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.93.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.93.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.94.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.94.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.94.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.95.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.95.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.95.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.96.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.96.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.96.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.97.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.97.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.97.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.98.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.98.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.98.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.99.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.99.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.99.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.100.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.100.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.100.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.101.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.101.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.101.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.102.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.102.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.102.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.103.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.103.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.103.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.104.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.104.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.104.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.105.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.105.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.105.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.106.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.106.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.106.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.107.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.107.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.107.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.108.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.108.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.108.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.109.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.109.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.109.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.110.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.110.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.110.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.111.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.111.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.111.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.112.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.112.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.112.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.113.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.113.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.113.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.114.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.114.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.114.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.115.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.115.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.115.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.116.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.116.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.116.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.117.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.117.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.117.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.118.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.118.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.118.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.119.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.119.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.119.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.120.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.120.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.120.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.121.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.121.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.121.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.122.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.122.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.122.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.123.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.123.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.123.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.124.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.124.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.124.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.125.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.125.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.125.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.126.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.126.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.126.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.127.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.127.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.127.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.128.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.128.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.128.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.129.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.129.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.129.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.130.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.130.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.130.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.131.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.131.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.131.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.132.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.132.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.132.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.133.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.133.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.133.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.134.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.134.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.134.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.135.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.135.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.135.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.136.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.136.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.136.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.137.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.137.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.137.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.138.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.138.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.138.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.139.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.139.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.139.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.140.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.140.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.140.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.141.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.141.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.141.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.142.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.142.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.142.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.143.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.143.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.143.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.144.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.144.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.144.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.145.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.145.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.145.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.146.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.146.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.146.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.147.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.147.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.147.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.148.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.148.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.148.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.149.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.149.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.149.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.150.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.150.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.150.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.151.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.151.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.151.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.152.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.152.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.152.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.153.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.153.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.153.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.154.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.154.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.154.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.155.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.155.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.155.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.156.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.156.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.156.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.157.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.157.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.157.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.158.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.158.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.158.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.159.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.159.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.mlp.experts.159.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.9.input_layernorm.weight": "model-00009-of-000055.safetensors", + "model.layers.9.post_attention_layernorm.weight": "model-00009-of-000055.safetensors", + "model.layers.10.self_attn.q_a_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.self_attn.q_a_layernorm.weight": "model-00009-of-000055.safetensors", + "model.layers.10.self_attn.q_b_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.self_attn.kv_a_proj_with_mqa.weight": "model-00009-of-000055.safetensors", + "model.layers.10.self_attn.kv_a_layernorm.weight": "model-00009-of-000055.safetensors", + "model.layers.10.self_attn.kv_b_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.self_attn.o_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.gate.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.shared_experts.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.shared_experts.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.shared_experts.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.0.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.0.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.0.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.1.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.1.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.1.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.2.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.2.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.2.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.3.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.3.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.3.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.4.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.4.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.4.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.5.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.5.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.5.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.6.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.6.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.6.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.7.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.7.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.7.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.8.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.8.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.8.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.9.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.9.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.9.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.10.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.10.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.10.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.11.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.11.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.11.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.12.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.12.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.12.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.13.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.13.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.13.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.14.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.14.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.14.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.15.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.15.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.15.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.16.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.16.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.16.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.17.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.17.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.17.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.18.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.18.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.18.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.19.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.19.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.19.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.20.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.20.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.20.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.21.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.21.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.21.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.22.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.22.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.22.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.23.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.23.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.23.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.24.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.24.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.24.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.25.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.25.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.25.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.26.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.26.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.26.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.27.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.27.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.27.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.28.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.28.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.28.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.29.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.29.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.29.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.30.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.30.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.30.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.31.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.31.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.31.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.32.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.32.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.32.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.33.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.33.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.33.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.34.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.34.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.34.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.35.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.35.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.35.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.36.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.36.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.36.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.37.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.37.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.37.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.38.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.38.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.38.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.39.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.39.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.39.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.40.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.40.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.40.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.41.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.41.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.41.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.42.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.42.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.42.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.43.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.43.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.43.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.44.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.44.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.44.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.45.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.45.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.45.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.46.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.46.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.46.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.47.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.47.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.47.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.48.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.48.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.48.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.49.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.49.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.49.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.50.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.50.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.50.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.51.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.51.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.51.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.52.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.52.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.52.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.53.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.53.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.53.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.54.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.54.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.54.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.55.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.55.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.55.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.56.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.56.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.56.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.57.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.57.up_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.57.down_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.58.gate_proj.weight": "model-00009-of-000055.safetensors", + "model.layers.10.mlp.experts.58.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.58.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.59.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.59.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.59.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.60.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.60.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.60.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.61.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.61.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.61.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.62.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.62.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.62.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.63.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.63.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.63.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.64.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.64.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.64.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.65.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.65.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.65.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.66.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.66.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.66.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.67.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.67.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.67.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.68.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.68.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.68.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.69.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.69.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.69.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.70.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.70.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.70.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.71.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.71.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.71.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.72.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.72.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.72.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.73.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.73.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.73.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.74.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.74.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.74.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.75.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.75.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.75.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.76.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.76.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.76.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.77.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.77.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.77.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.78.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.78.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.78.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.79.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.79.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.79.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.80.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.80.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.80.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.81.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.81.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.81.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.82.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.82.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.82.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.83.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.83.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.83.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.84.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.84.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.84.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.85.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.85.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.85.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.86.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.86.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.86.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.87.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.87.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.87.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.88.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.88.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.88.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.89.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.89.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.89.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.90.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.90.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.90.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.91.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.91.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.91.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.92.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.92.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.92.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.93.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.93.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.93.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.94.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.94.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.94.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.95.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.95.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.95.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.96.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.96.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.96.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.97.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.97.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.97.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.98.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.98.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.98.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.99.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.99.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.99.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.100.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.100.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.100.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.101.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.101.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.101.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.102.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.102.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.102.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.103.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.103.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.103.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.104.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.104.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.104.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.105.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.105.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.105.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.106.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.106.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.106.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.107.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.107.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.107.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.108.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.108.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.108.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.109.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.109.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.109.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.110.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.110.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.110.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.111.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.111.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.111.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.112.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.112.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.112.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.113.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.113.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.113.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.114.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.114.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.114.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.115.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.115.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.115.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.116.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.116.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.116.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.117.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.117.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.117.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.118.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.118.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.118.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.119.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.119.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.119.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.120.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.120.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.120.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.121.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.121.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.121.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.122.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.122.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.122.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.123.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.123.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.123.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.124.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.124.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.124.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.125.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.125.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.125.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.126.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.126.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.126.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.127.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.127.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.127.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.128.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.128.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.128.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.129.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.129.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.129.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.130.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.130.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.130.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.131.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.131.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.131.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.132.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.132.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.132.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.133.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.133.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.133.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.134.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.134.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.134.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.135.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.135.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.135.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.136.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.136.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.136.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.137.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.137.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.137.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.138.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.138.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.138.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.139.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.139.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.139.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.140.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.140.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.140.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.141.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.141.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.141.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.142.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.142.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.142.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.143.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.143.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.143.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.144.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.144.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.144.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.145.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.145.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.145.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.146.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.146.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.146.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.147.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.147.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.147.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.148.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.148.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.148.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.149.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.149.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.149.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.150.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.150.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.150.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.151.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.151.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.151.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.152.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.152.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.152.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.153.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.153.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.153.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.154.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.154.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.154.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.155.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.155.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.155.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.156.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.156.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.156.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.157.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.157.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.157.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.158.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.158.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.158.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.159.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.159.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.mlp.experts.159.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.10.input_layernorm.weight": "model-00010-of-000055.safetensors", + "model.layers.10.post_attention_layernorm.weight": "model-00010-of-000055.safetensors", + "model.layers.11.self_attn.q_a_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.self_attn.q_a_layernorm.weight": "model-00010-of-000055.safetensors", + "model.layers.11.self_attn.q_b_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.self_attn.kv_a_proj_with_mqa.weight": "model-00010-of-000055.safetensors", + "model.layers.11.self_attn.kv_a_layernorm.weight": "model-00010-of-000055.safetensors", + "model.layers.11.self_attn.kv_b_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.self_attn.o_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.gate.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.shared_experts.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.shared_experts.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.shared_experts.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.0.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.0.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.0.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.1.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.1.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.1.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.2.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.2.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.2.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.3.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.3.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.3.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.4.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.4.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.4.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.5.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.5.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.5.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.6.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.6.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.6.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.7.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.7.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.7.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.8.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.8.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.8.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.9.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.9.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.9.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.10.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.10.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.10.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.11.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.11.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.11.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.12.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.12.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.12.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.13.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.13.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.13.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.14.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.14.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.14.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.15.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.15.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.15.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.16.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.16.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.16.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.17.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.17.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.17.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.18.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.18.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.18.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.19.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.19.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.19.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.20.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.20.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.20.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.21.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.21.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.21.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.22.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.22.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.22.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.23.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.23.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.23.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.24.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.24.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.24.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.25.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.25.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.25.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.26.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.26.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.26.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.27.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.27.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.27.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.28.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.28.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.28.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.29.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.29.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.29.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.30.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.30.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.30.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.31.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.31.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.31.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.32.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.32.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.32.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.33.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.33.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.33.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.34.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.34.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.34.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.35.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.35.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.35.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.36.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.36.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.36.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.37.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.37.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.37.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.38.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.38.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.38.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.39.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.39.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.39.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.40.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.40.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.40.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.41.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.41.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.41.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.42.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.42.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.42.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.43.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.43.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.43.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.44.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.44.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.44.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.45.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.45.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.45.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.46.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.46.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.46.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.47.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.47.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.47.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.48.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.48.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.48.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.49.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.49.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.49.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.50.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.50.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.50.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.51.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.51.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.51.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.52.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.52.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.52.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.53.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.53.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.53.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.54.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.54.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.54.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.55.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.55.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.55.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.56.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.56.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.56.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.57.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.57.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.57.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.58.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.58.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.58.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.59.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.59.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.59.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.60.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.60.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.60.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.61.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.61.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.61.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.62.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.62.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.62.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.63.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.63.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.63.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.64.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.64.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.64.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.65.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.65.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.65.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.66.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.66.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.66.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.67.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.67.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.67.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.68.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.68.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.68.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.69.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.69.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.69.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.70.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.70.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.70.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.71.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.71.up_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.71.down_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.72.gate_proj.weight": "model-00010-of-000055.safetensors", + "model.layers.11.mlp.experts.72.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.72.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.73.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.73.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.73.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.74.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.74.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.74.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.75.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.75.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.75.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.76.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.76.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.76.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.77.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.77.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.77.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.78.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.78.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.78.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.79.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.79.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.79.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.80.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.80.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.80.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.81.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.81.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.81.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.82.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.82.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.82.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.83.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.83.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.83.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.84.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.84.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.84.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.85.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.85.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.85.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.86.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.86.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.86.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.87.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.87.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.87.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.88.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.88.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.88.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.89.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.89.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.89.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.90.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.90.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.90.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.91.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.91.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.91.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.92.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.92.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.92.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.93.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.93.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.93.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.94.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.94.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.94.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.95.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.95.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.95.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.96.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.96.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.96.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.97.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.97.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.97.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.98.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.98.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.98.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.99.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.99.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.99.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.100.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.100.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.100.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.101.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.101.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.101.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.102.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.102.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.102.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.103.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.103.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.103.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.104.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.104.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.104.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.105.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.105.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.105.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.106.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.106.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.106.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.107.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.107.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.107.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.108.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.108.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.108.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.109.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.109.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.109.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.110.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.110.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.110.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.111.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.111.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.111.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.112.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.112.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.112.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.113.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.113.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.113.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.114.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.114.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.114.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.115.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.115.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.115.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.116.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.116.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.116.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.117.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.117.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.117.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.118.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.118.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.118.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.119.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.119.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.119.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.120.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.120.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.120.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.121.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.121.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.121.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.122.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.122.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.122.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.123.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.123.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.123.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.124.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.124.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.124.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.125.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.125.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.125.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.126.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.126.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.126.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.127.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.127.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.127.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.128.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.128.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.128.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.129.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.129.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.129.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.130.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.130.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.130.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.131.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.131.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.131.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.132.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.132.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.132.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.133.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.133.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.133.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.134.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.134.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.134.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.135.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.135.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.135.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.136.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.136.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.136.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.137.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.137.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.137.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.138.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.138.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.138.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.139.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.139.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.139.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.140.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.140.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.140.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.141.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.141.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.141.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.142.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.142.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.142.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.143.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.143.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.143.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.144.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.144.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.144.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.145.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.145.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.145.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.146.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.146.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.146.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.147.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.147.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.147.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.148.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.148.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.148.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.149.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.149.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.149.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.150.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.150.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.150.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.151.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.151.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.151.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.152.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.152.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.152.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.153.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.153.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.153.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.154.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.154.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.154.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.155.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.155.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.155.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.156.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.156.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.156.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.157.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.157.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.157.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.158.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.158.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.158.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.159.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.159.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.mlp.experts.159.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.11.input_layernorm.weight": "model-00011-of-000055.safetensors", + "model.layers.11.post_attention_layernorm.weight": "model-00011-of-000055.safetensors", + "model.layers.12.self_attn.q_a_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.self_attn.q_a_layernorm.weight": "model-00011-of-000055.safetensors", + "model.layers.12.self_attn.q_b_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.self_attn.kv_a_proj_with_mqa.weight": "model-00011-of-000055.safetensors", + "model.layers.12.self_attn.kv_a_layernorm.weight": "model-00011-of-000055.safetensors", + "model.layers.12.self_attn.kv_b_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.self_attn.o_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.gate.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.shared_experts.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.shared_experts.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.shared_experts.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.0.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.0.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.0.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.1.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.1.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.1.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.2.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.2.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.2.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.3.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.3.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.3.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.4.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.4.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.4.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.5.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.5.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.5.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.6.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.6.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.6.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.7.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.7.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.7.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.8.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.8.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.8.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.9.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.9.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.9.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.10.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.10.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.10.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.11.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.11.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.11.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.12.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.12.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.12.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.13.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.13.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.13.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.14.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.14.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.14.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.15.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.15.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.15.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.16.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.16.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.16.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.17.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.17.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.17.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.18.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.18.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.18.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.19.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.19.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.19.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.20.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.20.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.20.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.21.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.21.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.21.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.22.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.22.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.22.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.23.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.23.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.23.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.24.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.24.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.24.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.25.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.25.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.25.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.26.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.26.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.26.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.27.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.27.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.27.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.28.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.28.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.28.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.29.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.29.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.29.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.30.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.30.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.30.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.31.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.31.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.31.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.32.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.32.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.32.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.33.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.33.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.33.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.34.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.34.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.34.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.35.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.35.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.35.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.36.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.36.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.36.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.37.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.37.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.37.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.38.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.38.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.38.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.39.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.39.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.39.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.40.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.40.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.40.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.41.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.41.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.41.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.42.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.42.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.42.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.43.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.43.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.43.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.44.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.44.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.44.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.45.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.45.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.45.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.46.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.46.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.46.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.47.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.47.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.47.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.48.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.48.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.48.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.49.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.49.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.49.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.50.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.50.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.50.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.51.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.51.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.51.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.52.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.52.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.52.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.53.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.53.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.53.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.54.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.54.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.54.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.55.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.55.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.55.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.56.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.56.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.56.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.57.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.57.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.57.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.58.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.58.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.58.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.59.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.59.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.59.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.60.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.60.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.60.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.61.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.61.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.61.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.62.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.62.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.62.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.63.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.63.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.63.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.64.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.64.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.64.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.65.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.65.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.65.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.66.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.66.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.66.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.67.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.67.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.67.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.68.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.68.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.68.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.69.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.69.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.69.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.70.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.70.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.70.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.71.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.71.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.71.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.72.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.72.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.72.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.73.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.73.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.73.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.74.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.74.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.74.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.75.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.75.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.75.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.76.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.76.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.76.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.77.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.77.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.77.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.78.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.78.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.78.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.79.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.79.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.79.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.80.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.80.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.80.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.81.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.81.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.81.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.82.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.82.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.82.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.83.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.83.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.83.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.84.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.84.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.84.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.85.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.85.up_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.85.down_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.86.gate_proj.weight": "model-00011-of-000055.safetensors", + "model.layers.12.mlp.experts.86.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.86.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.87.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.87.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.87.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.88.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.88.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.88.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.89.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.89.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.89.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.90.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.90.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.90.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.91.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.91.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.91.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.92.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.92.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.92.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.93.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.93.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.93.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.94.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.94.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.94.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.95.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.95.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.95.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.96.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.96.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.96.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.97.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.97.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.97.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.98.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.98.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.98.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.99.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.99.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.99.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.100.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.100.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.100.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.101.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.101.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.101.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.102.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.102.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.102.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.103.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.103.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.103.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.104.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.104.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.104.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.105.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.105.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.105.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.106.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.106.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.106.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.107.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.107.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.107.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.108.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.108.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.108.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.109.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.109.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.109.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.110.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.110.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.110.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.111.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.111.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.111.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.112.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.112.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.112.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.113.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.113.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.113.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.114.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.114.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.114.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.115.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.115.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.115.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.116.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.116.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.116.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.117.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.117.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.117.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.118.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.118.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.118.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.119.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.119.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.119.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.120.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.120.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.120.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.121.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.121.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.121.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.122.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.122.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.122.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.123.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.123.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.123.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.124.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.124.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.124.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.125.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.125.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.125.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.126.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.126.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.126.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.127.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.127.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.127.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.128.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.128.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.128.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.129.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.129.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.129.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.130.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.130.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.130.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.131.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.131.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.131.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.132.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.132.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.132.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.133.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.133.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.133.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.134.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.134.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.134.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.135.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.135.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.135.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.136.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.136.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.136.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.137.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.137.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.137.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.138.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.138.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.138.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.139.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.139.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.139.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.140.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.140.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.140.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.141.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.141.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.141.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.142.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.142.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.142.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.143.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.143.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.143.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.144.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.144.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.144.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.145.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.145.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.145.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.146.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.146.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.146.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.147.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.147.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.147.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.148.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.148.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.148.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.149.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.149.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.149.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.150.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.150.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.150.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.151.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.151.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.151.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.152.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.152.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.152.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.153.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.153.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.153.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.154.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.154.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.154.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.155.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.155.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.155.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.156.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.156.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.156.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.157.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.157.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.157.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.158.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.158.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.158.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.159.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.159.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.mlp.experts.159.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.12.input_layernorm.weight": "model-00012-of-000055.safetensors", + "model.layers.12.post_attention_layernorm.weight": "model-00012-of-000055.safetensors", + "model.layers.13.self_attn.q_a_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.self_attn.q_a_layernorm.weight": "model-00012-of-000055.safetensors", + "model.layers.13.self_attn.q_b_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.self_attn.kv_a_proj_with_mqa.weight": "model-00012-of-000055.safetensors", + "model.layers.13.self_attn.kv_a_layernorm.weight": "model-00012-of-000055.safetensors", + "model.layers.13.self_attn.kv_b_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.self_attn.o_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.gate.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.shared_experts.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.shared_experts.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.shared_experts.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.0.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.0.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.0.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.1.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.1.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.1.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.2.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.2.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.2.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.3.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.3.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.3.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.4.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.4.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.4.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.5.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.5.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.5.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.6.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.6.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.6.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.7.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.7.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.7.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.8.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.8.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.8.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.9.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.9.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.9.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.10.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.10.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.10.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.11.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.11.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.11.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.12.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.12.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.12.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.13.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.13.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.13.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.14.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.14.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.14.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.15.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.15.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.15.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.16.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.16.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.16.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.17.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.17.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.17.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.18.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.18.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.18.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.19.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.19.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.19.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.20.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.20.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.20.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.21.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.21.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.21.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.22.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.22.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.22.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.23.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.23.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.23.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.24.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.24.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.24.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.25.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.25.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.25.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.26.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.26.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.26.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.27.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.27.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.27.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.28.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.28.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.28.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.29.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.29.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.29.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.30.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.30.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.30.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.31.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.31.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.31.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.32.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.32.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.32.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.33.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.33.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.33.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.34.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.34.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.34.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.35.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.35.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.35.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.36.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.36.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.36.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.37.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.37.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.37.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.38.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.38.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.38.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.39.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.39.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.39.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.40.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.40.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.40.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.41.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.41.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.41.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.42.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.42.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.42.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.43.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.43.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.43.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.44.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.44.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.44.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.45.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.45.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.45.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.46.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.46.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.46.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.47.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.47.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.47.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.48.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.48.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.48.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.49.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.49.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.49.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.50.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.50.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.50.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.51.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.51.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.51.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.52.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.52.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.52.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.53.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.53.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.53.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.54.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.54.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.54.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.55.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.55.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.55.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.56.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.56.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.56.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.57.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.57.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.57.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.58.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.58.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.58.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.59.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.59.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.59.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.60.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.60.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.60.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.61.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.61.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.61.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.62.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.62.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.62.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.63.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.63.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.63.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.64.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.64.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.64.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.65.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.65.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.65.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.66.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.66.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.66.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.67.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.67.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.67.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.68.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.68.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.68.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.69.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.69.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.69.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.70.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.70.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.70.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.71.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.71.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.71.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.72.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.72.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.72.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.73.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.73.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.73.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.74.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.74.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.74.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.75.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.75.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.75.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.76.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.76.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.76.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.77.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.77.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.77.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.78.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.78.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.78.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.79.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.79.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.79.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.80.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.80.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.80.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.81.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.81.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.81.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.82.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.82.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.82.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.83.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.83.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.83.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.84.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.84.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.84.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.85.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.85.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.85.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.86.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.86.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.86.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.87.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.87.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.87.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.88.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.88.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.88.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.89.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.89.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.89.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.90.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.90.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.90.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.91.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.91.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.91.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.92.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.92.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.92.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.93.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.93.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.93.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.94.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.94.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.94.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.95.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.95.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.95.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.96.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.96.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.96.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.97.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.97.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.97.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.98.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.98.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.98.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.99.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.99.up_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.99.down_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.100.gate_proj.weight": "model-00012-of-000055.safetensors", + "model.layers.13.mlp.experts.100.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.100.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.101.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.101.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.101.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.102.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.102.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.102.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.103.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.103.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.103.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.104.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.104.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.104.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.105.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.105.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.105.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.106.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.106.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.106.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.107.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.107.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.107.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.108.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.108.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.108.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.109.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.109.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.109.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.110.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.110.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.110.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.111.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.111.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.111.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.112.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.112.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.112.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.113.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.113.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.113.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.114.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.114.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.114.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.115.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.115.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.115.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.116.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.116.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.116.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.117.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.117.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.117.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.118.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.118.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.118.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.119.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.119.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.119.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.120.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.120.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.120.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.121.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.121.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.121.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.122.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.122.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.122.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.123.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.123.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.123.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.124.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.124.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.124.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.125.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.125.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.125.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.126.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.126.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.126.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.127.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.127.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.127.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.128.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.128.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.128.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.129.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.129.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.129.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.130.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.130.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.130.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.131.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.131.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.131.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.132.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.132.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.132.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.133.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.133.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.133.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.134.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.134.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.134.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.135.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.135.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.135.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.136.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.136.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.136.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.137.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.137.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.137.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.138.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.138.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.138.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.139.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.139.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.139.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.140.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.140.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.140.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.141.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.141.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.141.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.142.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.142.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.142.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.143.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.143.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.143.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.144.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.144.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.144.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.145.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.145.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.145.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.146.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.146.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.146.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.147.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.147.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.147.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.148.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.148.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.148.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.149.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.149.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.149.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.150.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.150.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.150.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.151.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.151.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.151.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.152.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.152.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.152.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.153.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.153.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.153.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.154.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.154.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.154.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.155.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.155.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.155.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.156.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.156.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.156.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.157.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.157.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.157.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.158.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.158.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.158.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.159.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.159.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.mlp.experts.159.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.13.input_layernorm.weight": "model-00013-of-000055.safetensors", + "model.layers.13.post_attention_layernorm.weight": "model-00013-of-000055.safetensors", + "model.layers.14.self_attn.q_a_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.self_attn.q_a_layernorm.weight": "model-00013-of-000055.safetensors", + "model.layers.14.self_attn.q_b_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.self_attn.kv_a_proj_with_mqa.weight": "model-00013-of-000055.safetensors", + "model.layers.14.self_attn.kv_a_layernorm.weight": "model-00013-of-000055.safetensors", + "model.layers.14.self_attn.kv_b_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.self_attn.o_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.gate.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.shared_experts.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.shared_experts.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.shared_experts.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.0.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.0.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.0.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.1.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.1.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.1.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.2.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.2.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.2.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.3.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.3.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.3.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.4.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.4.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.4.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.5.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.5.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.5.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.6.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.6.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.6.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.7.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.7.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.7.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.8.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.8.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.8.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.9.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.9.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.9.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.10.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.10.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.10.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.11.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.11.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.11.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.12.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.12.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.12.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.13.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.13.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.13.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.14.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.14.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.14.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.15.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.15.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.15.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.16.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.16.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.16.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.17.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.17.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.17.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.18.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.18.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.18.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.19.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.19.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.19.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.20.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.20.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.20.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.21.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.21.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.21.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.22.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.22.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.22.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.23.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.23.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.23.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.24.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.24.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.24.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.25.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.25.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.25.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.26.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.26.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.26.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.27.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.27.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.27.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.28.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.28.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.28.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.29.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.29.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.29.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.30.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.30.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.30.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.31.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.31.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.31.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.32.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.32.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.32.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.33.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.33.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.33.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.34.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.34.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.34.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.35.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.35.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.35.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.36.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.36.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.36.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.37.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.37.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.37.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.38.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.38.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.38.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.39.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.39.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.39.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.40.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.40.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.40.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.41.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.41.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.41.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.42.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.42.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.42.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.43.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.43.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.43.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.44.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.44.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.44.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.45.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.45.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.45.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.46.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.46.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.46.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.47.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.47.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.47.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.48.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.48.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.48.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.49.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.49.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.49.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.50.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.50.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.50.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.51.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.51.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.51.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.52.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.52.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.52.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.53.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.53.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.53.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.54.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.54.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.54.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.55.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.55.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.55.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.56.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.56.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.56.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.57.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.57.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.57.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.58.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.58.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.58.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.59.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.59.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.59.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.60.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.60.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.60.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.61.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.61.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.61.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.62.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.62.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.62.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.63.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.63.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.63.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.64.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.64.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.64.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.65.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.65.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.65.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.66.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.66.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.66.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.67.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.67.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.67.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.68.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.68.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.68.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.69.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.69.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.69.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.70.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.70.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.70.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.71.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.71.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.71.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.72.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.72.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.72.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.73.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.73.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.73.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.74.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.74.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.74.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.75.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.75.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.75.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.76.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.76.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.76.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.77.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.77.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.77.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.78.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.78.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.78.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.79.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.79.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.79.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.80.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.80.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.80.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.81.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.81.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.81.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.82.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.82.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.82.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.83.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.83.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.83.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.84.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.84.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.84.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.85.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.85.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.85.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.86.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.86.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.86.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.87.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.87.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.87.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.88.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.88.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.88.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.89.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.89.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.89.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.90.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.90.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.90.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.91.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.91.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.91.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.92.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.92.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.92.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.93.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.93.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.93.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.94.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.94.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.94.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.95.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.95.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.95.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.96.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.96.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.96.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.97.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.97.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.97.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.98.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.98.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.98.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.99.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.99.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.99.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.100.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.100.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.100.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.101.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.101.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.101.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.102.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.102.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.102.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.103.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.103.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.103.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.104.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.104.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.104.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.105.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.105.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.105.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.106.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.106.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.106.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.107.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.107.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.107.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.108.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.108.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.108.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.109.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.109.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.109.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.110.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.110.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.110.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.111.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.111.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.111.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.112.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.112.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.112.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.113.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.113.up_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.113.down_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.114.gate_proj.weight": "model-00013-of-000055.safetensors", + "model.layers.14.mlp.experts.114.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.114.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.115.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.115.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.115.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.116.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.116.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.116.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.117.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.117.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.117.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.118.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.118.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.118.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.119.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.119.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.119.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.120.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.120.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.120.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.121.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.121.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.121.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.122.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.122.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.122.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.123.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.123.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.123.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.124.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.124.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.124.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.125.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.125.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.125.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.126.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.126.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.126.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.127.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.127.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.127.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.128.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.128.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.128.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.129.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.129.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.129.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.130.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.130.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.130.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.131.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.131.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.131.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.132.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.132.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.132.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.133.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.133.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.133.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.134.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.134.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.134.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.135.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.135.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.135.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.136.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.136.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.136.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.137.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.137.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.137.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.138.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.138.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.138.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.139.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.139.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.139.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.140.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.140.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.140.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.141.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.141.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.141.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.142.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.142.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.142.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.143.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.143.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.143.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.144.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.144.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.144.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.145.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.145.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.145.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.146.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.146.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.146.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.147.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.147.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.147.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.148.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.148.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.148.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.149.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.149.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.149.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.150.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.150.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.150.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.151.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.151.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.151.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.152.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.152.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.152.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.153.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.153.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.153.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.154.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.154.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.154.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.155.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.155.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.155.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.156.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.156.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.156.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.157.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.157.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.157.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.158.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.158.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.158.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.159.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.159.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.mlp.experts.159.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.14.input_layernorm.weight": "model-00014-of-000055.safetensors", + "model.layers.14.post_attention_layernorm.weight": "model-00014-of-000055.safetensors", + "model.layers.15.self_attn.q_a_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.self_attn.q_a_layernorm.weight": "model-00014-of-000055.safetensors", + "model.layers.15.self_attn.q_b_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.self_attn.kv_a_proj_with_mqa.weight": "model-00014-of-000055.safetensors", + "model.layers.15.self_attn.kv_a_layernorm.weight": "model-00014-of-000055.safetensors", + "model.layers.15.self_attn.kv_b_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.self_attn.o_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.gate.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.shared_experts.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.shared_experts.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.shared_experts.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.0.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.0.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.0.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.1.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.1.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.1.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.2.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.2.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.2.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.3.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.3.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.3.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.4.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.4.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.4.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.5.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.5.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.5.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.6.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.6.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.6.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.7.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.7.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.7.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.8.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.8.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.8.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.9.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.9.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.9.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.10.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.10.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.10.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.11.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.11.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.11.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.12.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.12.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.12.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.13.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.13.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.13.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.14.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.14.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.14.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.15.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.15.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.15.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.16.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.16.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.16.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.17.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.17.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.17.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.18.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.18.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.18.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.19.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.19.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.19.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.20.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.20.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.20.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.21.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.21.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.21.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.22.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.22.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.22.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.23.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.23.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.23.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.24.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.24.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.24.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.25.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.25.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.25.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.26.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.26.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.26.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.27.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.27.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.27.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.28.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.28.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.28.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.29.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.29.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.29.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.30.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.30.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.30.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.31.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.31.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.31.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.32.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.32.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.32.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.33.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.33.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.33.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.34.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.34.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.34.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.35.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.35.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.35.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.36.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.36.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.36.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.37.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.37.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.37.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.38.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.38.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.38.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.39.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.39.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.39.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.40.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.40.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.40.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.41.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.41.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.41.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.42.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.42.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.42.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.43.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.43.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.43.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.44.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.44.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.44.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.45.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.45.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.45.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.46.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.46.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.46.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.47.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.47.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.47.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.48.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.48.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.48.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.49.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.49.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.49.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.50.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.50.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.50.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.51.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.51.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.51.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.52.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.52.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.52.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.53.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.53.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.53.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.54.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.54.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.54.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.55.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.55.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.55.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.56.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.56.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.56.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.57.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.57.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.57.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.58.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.58.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.58.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.59.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.59.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.59.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.60.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.60.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.60.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.61.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.61.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.61.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.62.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.62.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.62.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.63.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.63.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.63.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.64.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.64.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.64.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.65.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.65.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.65.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.66.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.66.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.66.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.67.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.67.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.67.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.68.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.68.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.68.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.69.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.69.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.69.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.70.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.70.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.70.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.71.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.71.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.71.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.72.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.72.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.72.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.73.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.73.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.73.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.74.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.74.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.74.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.75.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.75.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.75.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.76.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.76.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.76.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.77.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.77.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.77.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.78.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.78.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.78.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.79.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.79.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.79.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.80.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.80.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.80.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.81.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.81.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.81.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.82.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.82.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.82.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.83.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.83.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.83.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.84.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.84.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.84.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.85.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.85.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.85.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.86.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.86.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.86.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.87.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.87.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.87.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.88.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.88.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.88.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.89.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.89.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.89.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.90.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.90.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.90.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.91.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.91.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.91.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.92.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.92.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.92.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.93.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.93.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.93.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.94.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.94.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.94.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.95.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.95.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.95.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.96.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.96.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.96.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.97.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.97.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.97.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.98.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.98.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.98.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.99.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.99.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.99.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.100.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.100.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.100.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.101.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.101.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.101.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.102.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.102.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.102.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.103.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.103.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.103.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.104.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.104.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.104.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.105.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.105.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.105.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.106.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.106.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.106.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.107.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.107.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.107.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.108.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.108.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.108.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.109.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.109.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.109.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.110.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.110.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.110.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.111.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.111.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.111.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.112.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.112.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.112.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.113.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.113.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.113.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.114.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.114.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.114.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.115.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.115.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.115.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.116.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.116.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.116.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.117.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.117.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.117.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.118.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.118.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.118.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.119.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.119.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.119.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.120.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.120.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.120.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.121.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.121.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.121.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.122.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.122.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.122.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.123.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.123.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.123.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.124.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.124.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.124.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.125.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.125.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.125.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.126.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.126.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.126.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.127.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.127.up_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.127.down_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.128.gate_proj.weight": "model-00014-of-000055.safetensors", + "model.layers.15.mlp.experts.128.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.128.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.129.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.129.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.129.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.130.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.130.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.130.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.131.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.131.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.131.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.132.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.132.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.132.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.133.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.133.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.133.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.134.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.134.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.134.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.135.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.135.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.135.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.136.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.136.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.136.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.137.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.137.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.137.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.138.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.138.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.138.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.139.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.139.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.139.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.140.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.140.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.140.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.141.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.141.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.141.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.142.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.142.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.142.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.143.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.143.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.143.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.144.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.144.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.144.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.145.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.145.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.145.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.146.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.146.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.146.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.147.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.147.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.147.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.148.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.148.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.148.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.149.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.149.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.149.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.150.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.150.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.150.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.151.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.151.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.151.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.152.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.152.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.152.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.153.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.153.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.153.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.154.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.154.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.154.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.155.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.155.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.155.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.156.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.156.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.156.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.157.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.157.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.157.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.158.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.158.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.158.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.159.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.159.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.mlp.experts.159.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.15.input_layernorm.weight": "model-00015-of-000055.safetensors", + "model.layers.15.post_attention_layernorm.weight": "model-00015-of-000055.safetensors", + "model.layers.16.self_attn.q_a_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.self_attn.q_a_layernorm.weight": "model-00015-of-000055.safetensors", + "model.layers.16.self_attn.q_b_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.self_attn.kv_a_proj_with_mqa.weight": "model-00015-of-000055.safetensors", + "model.layers.16.self_attn.kv_a_layernorm.weight": "model-00015-of-000055.safetensors", + "model.layers.16.self_attn.kv_b_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.self_attn.o_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.gate.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.shared_experts.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.shared_experts.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.shared_experts.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.0.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.0.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.0.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.1.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.1.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.1.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.2.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.2.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.2.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.3.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.3.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.3.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.4.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.4.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.4.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.5.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.5.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.5.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.6.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.6.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.6.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.7.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.7.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.7.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.8.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.8.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.8.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.9.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.9.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.9.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.10.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.10.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.10.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.11.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.11.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.11.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.12.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.12.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.12.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.13.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.13.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.13.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.14.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.14.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.14.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.15.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.15.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.15.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.16.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.16.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.16.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.17.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.17.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.17.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.18.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.18.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.18.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.19.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.19.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.19.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.20.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.20.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.20.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.21.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.21.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.21.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.22.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.22.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.22.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.23.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.23.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.23.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.24.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.24.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.24.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.25.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.25.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.25.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.26.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.26.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.26.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.27.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.27.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.27.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.28.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.28.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.28.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.29.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.29.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.29.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.30.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.30.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.30.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.31.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.31.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.31.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.32.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.32.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.32.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.33.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.33.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.33.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.34.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.34.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.34.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.35.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.35.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.35.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.36.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.36.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.36.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.37.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.37.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.37.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.38.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.38.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.38.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.39.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.39.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.39.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.40.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.40.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.40.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.41.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.41.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.41.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.42.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.42.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.42.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.43.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.43.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.43.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.44.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.44.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.44.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.45.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.45.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.45.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.46.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.46.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.46.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.47.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.47.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.47.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.48.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.48.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.48.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.49.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.49.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.49.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.50.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.50.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.50.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.51.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.51.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.51.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.52.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.52.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.52.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.53.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.53.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.53.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.54.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.54.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.54.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.55.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.55.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.55.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.56.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.56.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.56.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.57.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.57.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.57.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.58.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.58.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.58.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.59.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.59.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.59.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.60.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.60.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.60.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.61.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.61.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.61.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.62.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.62.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.62.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.63.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.63.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.63.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.64.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.64.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.64.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.65.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.65.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.65.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.66.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.66.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.66.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.67.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.67.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.67.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.68.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.68.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.68.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.69.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.69.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.69.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.70.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.70.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.70.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.71.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.71.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.71.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.72.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.72.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.72.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.73.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.73.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.73.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.74.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.74.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.74.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.75.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.75.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.75.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.76.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.76.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.76.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.77.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.77.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.77.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.78.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.78.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.78.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.79.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.79.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.79.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.80.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.80.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.80.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.81.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.81.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.81.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.82.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.82.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.82.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.83.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.83.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.83.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.84.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.84.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.84.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.85.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.85.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.85.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.86.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.86.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.86.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.87.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.87.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.87.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.88.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.88.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.88.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.89.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.89.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.89.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.90.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.90.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.90.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.91.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.91.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.91.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.92.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.92.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.92.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.93.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.93.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.93.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.94.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.94.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.94.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.95.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.95.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.95.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.96.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.96.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.96.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.97.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.97.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.97.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.98.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.98.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.98.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.99.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.99.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.99.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.100.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.100.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.100.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.101.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.101.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.101.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.102.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.102.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.102.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.103.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.103.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.103.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.104.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.104.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.104.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.105.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.105.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.105.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.106.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.106.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.106.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.107.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.107.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.107.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.108.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.108.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.108.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.109.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.109.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.109.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.110.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.110.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.110.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.111.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.111.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.111.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.112.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.112.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.112.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.113.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.113.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.113.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.114.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.114.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.114.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.115.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.115.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.115.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.116.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.116.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.116.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.117.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.117.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.117.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.118.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.118.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.118.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.119.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.119.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.119.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.120.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.120.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.120.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.121.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.121.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.121.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.122.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.122.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.122.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.123.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.123.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.123.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.124.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.124.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.124.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.125.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.125.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.125.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.126.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.126.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.126.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.127.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.127.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.127.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.128.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.128.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.128.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.129.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.129.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.129.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.130.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.130.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.130.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.131.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.131.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.131.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.132.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.132.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.132.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.133.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.133.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.133.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.134.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.134.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.134.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.135.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.135.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.135.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.136.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.136.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.136.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.137.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.137.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.137.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.138.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.138.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.138.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.139.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.139.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.139.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.140.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.140.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.140.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.141.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.141.up_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.141.down_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.142.gate_proj.weight": "model-00015-of-000055.safetensors", + "model.layers.16.mlp.experts.142.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.142.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.143.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.143.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.143.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.144.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.144.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.144.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.145.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.145.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.145.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.146.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.146.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.146.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.147.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.147.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.147.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.148.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.148.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.148.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.149.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.149.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.149.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.150.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.150.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.150.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.151.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.151.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.151.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.152.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.152.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.152.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.153.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.153.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.153.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.154.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.154.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.154.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.155.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.155.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.155.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.156.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.156.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.156.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.157.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.157.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.157.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.158.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.158.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.158.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.159.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.159.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.mlp.experts.159.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.16.input_layernorm.weight": "model-00016-of-000055.safetensors", + "model.layers.16.post_attention_layernorm.weight": "model-00016-of-000055.safetensors", + "model.layers.17.self_attn.q_a_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.self_attn.q_a_layernorm.weight": "model-00016-of-000055.safetensors", + "model.layers.17.self_attn.q_b_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.self_attn.kv_a_proj_with_mqa.weight": "model-00016-of-000055.safetensors", + "model.layers.17.self_attn.kv_a_layernorm.weight": "model-00016-of-000055.safetensors", + "model.layers.17.self_attn.kv_b_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.self_attn.o_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.gate.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.shared_experts.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.shared_experts.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.shared_experts.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.0.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.0.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.0.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.1.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.1.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.1.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.2.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.2.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.2.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.3.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.3.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.3.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.4.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.4.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.4.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.5.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.5.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.5.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.6.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.6.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.6.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.7.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.7.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.7.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.8.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.8.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.8.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.9.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.9.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.9.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.10.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.10.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.10.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.11.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.11.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.11.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.12.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.12.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.12.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.13.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.13.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.13.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.14.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.14.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.14.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.15.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.15.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.15.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.16.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.16.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.16.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.17.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.17.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.17.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.18.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.18.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.18.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.19.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.19.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.19.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.20.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.20.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.20.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.21.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.21.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.21.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.22.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.22.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.22.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.23.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.23.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.23.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.24.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.24.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.24.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.25.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.25.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.25.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.26.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.26.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.26.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.27.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.27.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.27.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.28.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.28.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.28.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.29.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.29.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.29.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.30.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.30.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.30.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.31.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.31.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.31.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.32.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.32.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.32.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.33.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.33.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.33.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.34.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.34.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.34.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.35.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.35.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.35.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.36.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.36.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.36.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.37.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.37.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.37.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.38.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.38.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.38.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.39.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.39.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.39.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.40.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.40.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.40.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.41.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.41.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.41.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.42.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.42.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.42.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.43.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.43.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.43.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.44.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.44.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.44.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.45.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.45.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.45.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.46.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.46.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.46.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.47.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.47.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.47.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.48.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.48.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.48.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.49.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.49.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.49.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.50.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.50.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.50.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.51.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.51.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.51.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.52.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.52.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.52.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.53.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.53.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.53.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.54.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.54.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.54.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.55.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.55.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.55.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.56.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.56.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.56.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.57.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.57.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.57.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.58.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.58.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.58.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.59.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.59.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.59.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.60.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.60.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.60.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.61.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.61.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.61.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.62.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.62.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.62.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.63.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.63.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.63.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.64.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.64.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.64.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.65.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.65.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.65.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.66.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.66.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.66.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.67.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.67.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.67.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.68.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.68.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.68.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.69.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.69.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.69.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.70.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.70.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.70.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.71.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.71.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.71.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.72.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.72.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.72.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.73.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.73.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.73.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.74.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.74.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.74.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.75.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.75.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.75.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.76.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.76.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.76.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.77.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.77.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.77.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.78.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.78.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.78.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.79.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.79.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.79.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.80.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.80.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.80.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.81.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.81.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.81.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.82.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.82.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.82.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.83.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.83.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.83.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.84.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.84.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.84.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.85.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.85.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.85.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.86.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.86.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.86.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.87.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.87.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.87.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.88.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.88.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.88.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.89.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.89.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.89.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.90.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.90.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.90.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.91.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.91.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.91.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.92.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.92.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.92.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.93.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.93.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.93.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.94.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.94.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.94.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.95.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.95.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.95.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.96.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.96.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.96.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.97.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.97.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.97.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.98.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.98.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.98.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.99.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.99.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.99.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.100.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.100.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.100.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.101.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.101.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.101.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.102.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.102.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.102.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.103.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.103.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.103.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.104.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.104.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.104.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.105.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.105.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.105.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.106.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.106.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.106.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.107.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.107.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.107.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.108.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.108.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.108.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.109.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.109.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.109.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.110.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.110.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.110.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.111.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.111.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.111.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.112.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.112.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.112.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.113.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.113.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.113.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.114.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.114.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.114.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.115.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.115.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.115.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.116.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.116.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.116.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.117.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.117.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.117.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.118.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.118.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.118.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.119.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.119.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.119.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.120.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.120.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.120.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.121.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.121.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.121.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.122.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.122.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.122.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.123.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.123.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.123.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.124.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.124.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.124.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.125.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.125.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.125.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.126.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.126.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.126.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.127.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.127.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.127.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.128.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.128.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.128.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.129.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.129.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.129.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.130.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.130.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.130.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.131.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.131.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.131.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.132.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.132.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.132.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.133.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.133.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.133.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.134.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.134.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.134.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.135.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.135.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.135.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.136.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.136.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.136.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.137.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.137.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.137.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.138.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.138.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.138.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.139.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.139.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.139.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.140.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.140.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.140.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.141.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.141.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.141.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.142.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.142.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.142.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.143.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.143.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.143.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.144.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.144.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.144.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.145.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.145.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.145.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.146.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.146.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.146.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.147.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.147.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.147.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.148.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.148.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.148.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.149.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.149.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.149.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.150.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.150.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.150.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.151.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.151.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.151.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.152.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.152.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.152.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.153.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.153.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.153.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.154.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.154.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.154.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.155.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.155.up_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.155.down_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.156.gate_proj.weight": "model-00016-of-000055.safetensors", + "model.layers.17.mlp.experts.156.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.156.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.157.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.157.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.157.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.158.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.158.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.158.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.159.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.159.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.mlp.experts.159.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.17.input_layernorm.weight": "model-00017-of-000055.safetensors", + "model.layers.17.post_attention_layernorm.weight": "model-00017-of-000055.safetensors", + "model.layers.18.self_attn.q_a_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.self_attn.q_a_layernorm.weight": "model-00017-of-000055.safetensors", + "model.layers.18.self_attn.q_b_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.self_attn.kv_a_proj_with_mqa.weight": "model-00017-of-000055.safetensors", + "model.layers.18.self_attn.kv_a_layernorm.weight": "model-00017-of-000055.safetensors", + "model.layers.18.self_attn.kv_b_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.self_attn.o_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.gate.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.shared_experts.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.shared_experts.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.shared_experts.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.0.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.0.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.0.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.1.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.1.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.1.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.2.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.2.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.2.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.3.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.3.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.3.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.4.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.4.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.4.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.5.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.5.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.5.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.6.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.6.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.6.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.7.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.7.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.7.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.8.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.8.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.8.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.9.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.9.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.9.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.10.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.10.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.10.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.11.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.11.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.11.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.12.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.12.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.12.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.13.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.13.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.13.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.14.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.14.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.14.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.15.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.15.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.15.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.16.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.16.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.16.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.17.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.17.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.17.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.18.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.18.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.18.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.19.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.19.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.19.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.20.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.20.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.20.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.21.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.21.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.21.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.22.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.22.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.22.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.23.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.23.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.23.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.24.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.24.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.24.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.25.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.25.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.25.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.26.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.26.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.26.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.27.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.27.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.27.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.28.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.28.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.28.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.29.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.29.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.29.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.30.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.30.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.30.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.31.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.31.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.31.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.32.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.32.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.32.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.33.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.33.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.33.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.34.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.34.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.34.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.35.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.35.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.35.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.36.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.36.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.36.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.37.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.37.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.37.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.38.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.38.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.38.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.39.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.39.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.39.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.40.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.40.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.40.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.41.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.41.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.41.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.42.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.42.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.42.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.43.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.43.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.43.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.44.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.44.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.44.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.45.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.45.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.45.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.46.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.46.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.46.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.47.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.47.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.47.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.48.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.48.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.48.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.49.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.49.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.49.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.50.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.50.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.50.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.51.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.51.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.51.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.52.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.52.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.52.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.53.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.53.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.53.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.54.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.54.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.54.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.55.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.55.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.55.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.56.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.56.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.56.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.57.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.57.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.57.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.58.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.58.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.58.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.59.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.59.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.59.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.60.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.60.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.60.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.61.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.61.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.61.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.62.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.62.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.62.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.63.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.63.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.63.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.64.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.64.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.64.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.65.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.65.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.65.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.66.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.66.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.66.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.67.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.67.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.67.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.68.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.68.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.68.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.69.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.69.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.69.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.70.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.70.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.70.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.71.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.71.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.71.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.72.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.72.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.72.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.73.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.73.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.73.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.74.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.74.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.74.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.75.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.75.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.75.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.76.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.76.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.76.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.77.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.77.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.77.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.78.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.78.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.78.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.79.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.79.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.79.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.80.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.80.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.80.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.81.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.81.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.81.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.82.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.82.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.82.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.83.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.83.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.83.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.84.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.84.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.84.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.85.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.85.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.85.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.86.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.86.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.86.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.87.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.87.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.87.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.88.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.88.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.88.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.89.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.89.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.89.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.90.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.90.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.90.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.91.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.91.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.91.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.92.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.92.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.92.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.93.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.93.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.93.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.94.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.94.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.94.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.95.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.95.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.95.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.96.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.96.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.96.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.97.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.97.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.97.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.98.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.98.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.98.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.99.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.99.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.99.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.100.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.100.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.100.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.101.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.101.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.101.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.102.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.102.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.102.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.103.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.103.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.103.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.104.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.104.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.104.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.105.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.105.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.105.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.106.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.106.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.106.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.107.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.107.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.107.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.108.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.108.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.108.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.109.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.109.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.109.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.110.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.110.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.110.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.111.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.111.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.111.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.112.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.112.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.112.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.113.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.113.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.113.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.114.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.114.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.114.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.115.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.115.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.115.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.116.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.116.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.116.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.117.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.117.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.117.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.118.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.118.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.118.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.119.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.119.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.119.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.120.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.120.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.120.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.121.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.121.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.121.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.122.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.122.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.122.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.123.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.123.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.123.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.124.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.124.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.124.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.125.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.125.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.125.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.126.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.126.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.126.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.127.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.127.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.127.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.128.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.128.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.128.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.129.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.129.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.129.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.130.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.130.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.130.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.131.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.131.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.131.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.132.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.132.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.132.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.133.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.133.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.133.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.134.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.134.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.134.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.135.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.135.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.135.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.136.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.136.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.136.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.137.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.137.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.137.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.138.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.138.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.138.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.139.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.139.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.139.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.140.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.140.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.140.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.141.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.141.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.141.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.142.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.142.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.142.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.143.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.143.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.143.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.144.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.144.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.144.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.145.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.145.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.145.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.146.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.146.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.146.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.147.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.147.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.147.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.148.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.148.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.148.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.149.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.149.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.149.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.150.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.150.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.150.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.151.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.151.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.151.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.152.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.152.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.152.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.153.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.153.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.153.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.154.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.154.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.154.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.155.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.155.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.155.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.156.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.156.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.156.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.157.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.157.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.157.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.158.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.158.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.158.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.159.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.159.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.mlp.experts.159.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.18.input_layernorm.weight": "model-00017-of-000055.safetensors", + "model.layers.18.post_attention_layernorm.weight": "model-00017-of-000055.safetensors", + "model.layers.19.self_attn.q_a_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.self_attn.q_a_layernorm.weight": "model-00017-of-000055.safetensors", + "model.layers.19.self_attn.q_b_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.self_attn.kv_a_proj_with_mqa.weight": "model-00017-of-000055.safetensors", + "model.layers.19.self_attn.kv_a_layernorm.weight": "model-00017-of-000055.safetensors", + "model.layers.19.self_attn.kv_b_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.self_attn.o_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.gate.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.shared_experts.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.shared_experts.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.shared_experts.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.experts.0.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.experts.0.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.experts.0.down_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.experts.1.gate_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.experts.1.up_proj.weight": "model-00017-of-000055.safetensors", + "model.layers.19.mlp.experts.1.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.2.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.2.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.2.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.3.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.3.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.3.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.4.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.4.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.4.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.5.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.5.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.5.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.6.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.6.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.6.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.7.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.7.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.7.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.8.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.8.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.8.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.9.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.9.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.9.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.10.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.10.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.10.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.11.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.11.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.11.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.12.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.12.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.12.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.13.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.13.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.13.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.14.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.14.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.14.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.15.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.15.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.15.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.16.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.16.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.16.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.17.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.17.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.17.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.18.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.18.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.18.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.19.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.19.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.19.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.20.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.20.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.20.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.21.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.21.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.21.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.22.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.22.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.22.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.23.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.23.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.23.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.24.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.24.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.24.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.25.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.25.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.25.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.26.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.26.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.26.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.27.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.27.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.27.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.28.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.28.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.28.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.29.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.29.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.29.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.30.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.30.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.30.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.31.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.31.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.31.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.32.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.32.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.32.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.33.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.33.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.33.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.34.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.34.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.34.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.35.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.35.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.35.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.36.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.36.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.36.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.37.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.37.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.37.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.38.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.38.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.38.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.39.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.39.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.39.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.40.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.40.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.40.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.41.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.41.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.41.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.42.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.42.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.42.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.43.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.43.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.43.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.44.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.44.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.44.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.45.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.45.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.45.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.46.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.46.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.46.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.47.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.47.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.47.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.48.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.48.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.48.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.49.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.49.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.49.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.50.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.50.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.50.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.51.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.51.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.51.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.52.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.52.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.52.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.53.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.53.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.53.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.54.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.54.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.54.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.55.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.55.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.55.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.56.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.56.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.56.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.57.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.57.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.57.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.58.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.58.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.58.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.59.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.59.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.59.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.60.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.60.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.60.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.61.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.61.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.61.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.62.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.62.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.62.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.63.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.63.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.63.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.64.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.64.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.64.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.65.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.65.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.65.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.66.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.66.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.66.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.67.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.67.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.67.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.68.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.68.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.68.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.69.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.69.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.69.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.70.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.70.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.70.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.71.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.71.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.71.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.72.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.72.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.72.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.73.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.73.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.73.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.74.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.74.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.74.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.75.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.75.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.75.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.76.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.76.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.76.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.77.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.77.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.77.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.78.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.78.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.78.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.79.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.79.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.79.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.80.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.80.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.80.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.81.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.81.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.81.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.82.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.82.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.82.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.83.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.83.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.83.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.84.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.84.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.84.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.85.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.85.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.85.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.86.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.86.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.86.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.87.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.87.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.87.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.88.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.88.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.88.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.89.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.89.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.89.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.90.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.90.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.90.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.91.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.91.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.91.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.92.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.92.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.92.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.93.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.93.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.93.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.94.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.94.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.94.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.95.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.95.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.95.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.96.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.96.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.96.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.97.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.97.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.97.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.98.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.98.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.98.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.99.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.99.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.99.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.100.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.100.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.100.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.101.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.101.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.101.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.102.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.102.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.102.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.103.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.103.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.103.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.104.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.104.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.104.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.105.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.105.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.105.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.106.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.106.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.106.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.107.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.107.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.107.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.108.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.108.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.108.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.109.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.109.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.109.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.110.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.110.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.110.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.111.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.111.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.111.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.112.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.112.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.112.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.113.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.113.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.113.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.114.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.114.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.114.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.115.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.115.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.115.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.116.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.116.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.116.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.117.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.117.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.117.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.118.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.118.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.118.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.119.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.119.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.119.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.120.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.120.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.120.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.121.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.121.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.121.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.122.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.122.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.122.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.123.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.123.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.123.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.124.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.124.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.124.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.125.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.125.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.125.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.126.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.126.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.126.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.127.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.127.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.127.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.128.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.128.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.128.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.129.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.129.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.129.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.130.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.130.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.130.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.131.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.131.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.131.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.132.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.132.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.132.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.133.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.133.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.133.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.134.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.134.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.134.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.135.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.135.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.135.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.136.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.136.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.136.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.137.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.137.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.137.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.138.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.138.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.138.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.139.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.139.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.139.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.140.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.140.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.140.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.141.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.141.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.141.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.142.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.142.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.142.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.143.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.143.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.143.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.144.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.144.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.144.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.145.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.145.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.145.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.146.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.146.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.146.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.147.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.147.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.147.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.148.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.148.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.148.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.149.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.149.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.149.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.150.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.150.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.150.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.151.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.151.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.151.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.152.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.152.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.152.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.153.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.153.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.153.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.154.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.154.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.154.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.155.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.155.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.155.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.156.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.156.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.156.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.157.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.157.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.157.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.158.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.158.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.158.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.159.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.159.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.mlp.experts.159.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.19.input_layernorm.weight": "model-00018-of-000055.safetensors", + "model.layers.19.post_attention_layernorm.weight": "model-00018-of-000055.safetensors", + "model.layers.20.self_attn.q_a_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.self_attn.q_a_layernorm.weight": "model-00018-of-000055.safetensors", + "model.layers.20.self_attn.q_b_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.self_attn.kv_a_proj_with_mqa.weight": "model-00018-of-000055.safetensors", + "model.layers.20.self_attn.kv_a_layernorm.weight": "model-00018-of-000055.safetensors", + "model.layers.20.self_attn.kv_b_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.self_attn.o_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.gate.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.shared_experts.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.shared_experts.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.shared_experts.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.0.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.0.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.0.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.1.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.1.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.1.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.2.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.2.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.2.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.3.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.3.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.3.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.4.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.4.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.4.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.5.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.5.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.5.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.6.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.6.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.6.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.7.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.7.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.7.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.8.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.8.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.8.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.9.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.9.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.9.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.10.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.10.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.10.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.11.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.11.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.11.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.12.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.12.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.12.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.13.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.13.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.13.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.14.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.14.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.14.down_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.15.gate_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.15.up_proj.weight": "model-00018-of-000055.safetensors", + "model.layers.20.mlp.experts.15.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.16.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.16.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.16.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.17.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.17.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.17.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.18.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.18.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.18.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.19.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.19.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.19.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.20.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.20.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.20.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.21.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.21.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.21.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.22.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.22.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.22.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.23.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.23.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.23.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.24.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.24.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.24.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.25.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.25.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.25.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.26.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.26.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.26.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.27.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.27.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.27.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.28.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.28.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.28.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.29.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.29.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.29.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.30.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.30.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.30.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.31.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.31.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.31.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.32.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.32.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.32.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.33.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.33.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.33.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.34.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.34.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.34.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.35.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.35.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.35.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.36.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.36.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.36.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.37.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.37.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.37.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.38.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.38.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.38.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.39.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.39.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.39.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.40.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.40.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.40.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.41.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.41.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.41.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.42.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.42.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.42.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.43.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.43.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.43.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.44.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.44.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.44.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.45.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.45.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.45.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.46.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.46.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.46.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.47.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.47.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.47.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.48.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.48.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.48.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.49.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.49.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.49.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.50.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.50.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.50.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.51.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.51.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.51.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.52.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.52.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.52.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.53.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.53.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.53.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.54.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.54.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.54.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.55.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.55.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.55.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.56.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.56.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.56.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.57.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.57.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.57.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.58.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.58.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.58.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.59.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.59.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.59.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.60.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.60.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.60.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.61.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.61.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.61.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.62.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.62.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.62.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.63.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.63.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.63.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.64.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.64.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.64.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.65.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.65.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.65.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.66.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.66.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.66.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.67.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.67.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.67.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.68.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.68.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.68.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.69.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.69.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.69.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.70.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.70.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.70.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.71.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.71.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.71.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.72.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.72.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.72.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.73.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.73.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.73.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.74.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.74.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.74.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.75.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.75.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.75.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.76.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.76.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.76.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.77.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.77.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.77.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.78.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.78.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.78.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.79.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.79.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.79.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.80.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.80.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.80.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.81.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.81.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.81.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.82.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.82.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.82.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.83.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.83.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.83.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.84.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.84.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.84.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.85.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.85.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.85.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.86.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.86.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.86.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.87.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.87.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.87.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.88.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.88.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.88.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.89.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.89.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.89.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.90.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.90.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.90.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.91.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.91.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.91.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.92.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.92.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.92.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.93.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.93.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.93.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.94.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.94.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.94.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.95.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.95.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.95.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.96.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.96.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.96.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.97.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.97.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.97.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.98.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.98.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.98.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.99.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.99.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.99.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.100.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.100.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.100.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.101.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.101.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.101.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.102.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.102.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.102.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.103.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.103.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.103.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.104.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.104.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.104.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.105.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.105.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.105.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.106.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.106.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.106.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.107.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.107.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.107.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.108.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.108.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.108.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.109.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.109.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.109.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.110.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.110.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.110.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.111.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.111.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.111.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.112.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.112.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.112.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.113.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.113.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.113.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.114.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.114.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.114.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.115.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.115.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.115.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.116.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.116.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.116.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.117.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.117.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.117.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.118.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.118.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.118.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.119.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.119.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.119.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.120.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.120.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.120.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.121.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.121.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.121.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.122.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.122.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.122.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.123.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.123.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.123.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.124.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.124.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.124.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.125.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.125.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.125.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.126.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.126.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.126.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.127.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.127.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.127.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.128.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.128.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.128.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.129.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.129.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.129.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.130.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.130.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.130.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.131.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.131.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.131.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.132.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.132.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.132.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.133.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.133.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.133.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.134.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.134.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.134.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.135.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.135.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.135.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.136.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.136.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.136.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.137.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.137.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.137.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.138.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.138.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.138.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.139.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.139.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.139.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.140.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.140.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.140.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.141.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.141.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.141.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.142.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.142.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.142.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.143.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.143.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.143.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.144.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.144.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.144.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.145.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.145.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.145.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.146.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.146.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.146.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.147.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.147.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.147.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.148.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.148.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.148.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.149.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.149.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.149.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.150.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.150.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.150.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.151.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.151.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.151.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.152.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.152.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.152.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.153.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.153.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.153.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.154.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.154.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.154.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.155.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.155.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.155.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.156.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.156.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.156.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.157.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.157.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.157.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.158.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.158.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.158.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.159.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.159.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.mlp.experts.159.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.20.input_layernorm.weight": "model-00019-of-000055.safetensors", + "model.layers.20.post_attention_layernorm.weight": "model-00019-of-000055.safetensors", + "model.layers.21.self_attn.q_a_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.self_attn.q_a_layernorm.weight": "model-00019-of-000055.safetensors", + "model.layers.21.self_attn.q_b_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.self_attn.kv_a_proj_with_mqa.weight": "model-00019-of-000055.safetensors", + "model.layers.21.self_attn.kv_a_layernorm.weight": "model-00019-of-000055.safetensors", + "model.layers.21.self_attn.kv_b_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.self_attn.o_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.gate.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.shared_experts.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.shared_experts.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.shared_experts.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.0.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.0.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.0.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.1.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.1.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.1.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.2.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.2.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.2.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.3.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.3.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.3.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.4.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.4.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.4.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.5.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.5.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.5.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.6.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.6.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.6.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.7.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.7.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.7.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.8.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.8.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.8.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.9.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.9.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.9.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.10.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.10.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.10.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.11.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.11.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.11.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.12.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.12.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.12.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.13.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.13.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.13.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.14.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.14.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.14.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.15.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.15.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.15.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.16.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.16.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.16.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.17.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.17.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.17.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.18.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.18.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.18.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.19.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.19.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.19.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.20.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.20.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.20.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.21.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.21.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.21.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.22.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.22.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.22.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.23.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.23.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.23.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.24.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.24.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.24.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.25.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.25.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.25.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.26.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.26.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.26.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.27.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.27.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.27.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.28.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.28.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.28.down_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.29.gate_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.29.up_proj.weight": "model-00019-of-000055.safetensors", + "model.layers.21.mlp.experts.29.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.30.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.30.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.30.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.31.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.31.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.31.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.32.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.32.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.32.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.33.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.33.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.33.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.34.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.34.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.34.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.35.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.35.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.35.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.36.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.36.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.36.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.37.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.37.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.37.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.38.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.38.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.38.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.39.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.39.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.39.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.40.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.40.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.40.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.41.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.41.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.41.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.42.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.42.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.42.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.43.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.43.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.43.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.44.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.44.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.44.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.45.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.45.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.45.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.46.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.46.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.46.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.47.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.47.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.47.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.48.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.48.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.48.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.49.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.49.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.49.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.50.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.50.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.50.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.51.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.51.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.51.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.52.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.52.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.52.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.53.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.53.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.53.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.54.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.54.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.54.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.55.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.55.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.55.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.56.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.56.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.56.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.57.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.57.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.57.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.58.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.58.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.58.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.59.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.59.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.59.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.60.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.60.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.60.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.61.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.61.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.61.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.62.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.62.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.62.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.63.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.63.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.63.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.64.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.64.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.64.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.65.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.65.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.65.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.66.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.66.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.66.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.67.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.67.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.67.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.68.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.68.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.68.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.69.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.69.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.69.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.70.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.70.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.70.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.71.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.71.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.71.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.72.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.72.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.72.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.73.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.73.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.73.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.74.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.74.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.74.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.75.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.75.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.75.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.76.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.76.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.76.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.77.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.77.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.77.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.78.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.78.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.78.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.79.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.79.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.79.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.80.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.80.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.80.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.81.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.81.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.81.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.82.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.82.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.82.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.83.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.83.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.83.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.84.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.84.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.84.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.85.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.85.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.85.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.86.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.86.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.86.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.87.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.87.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.87.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.88.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.88.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.88.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.89.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.89.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.89.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.90.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.90.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.90.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.91.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.91.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.91.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.92.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.92.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.92.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.93.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.93.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.93.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.94.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.94.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.94.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.95.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.95.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.95.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.96.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.96.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.96.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.97.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.97.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.97.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.98.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.98.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.98.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.99.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.99.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.99.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.100.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.100.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.100.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.101.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.101.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.101.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.102.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.102.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.102.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.103.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.103.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.103.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.104.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.104.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.104.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.105.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.105.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.105.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.106.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.106.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.106.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.107.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.107.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.107.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.108.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.108.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.108.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.109.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.109.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.109.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.110.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.110.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.110.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.111.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.111.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.111.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.112.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.112.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.112.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.113.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.113.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.113.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.114.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.114.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.114.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.115.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.115.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.115.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.116.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.116.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.116.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.117.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.117.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.117.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.118.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.118.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.118.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.119.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.119.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.119.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.120.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.120.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.120.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.121.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.121.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.121.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.122.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.122.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.122.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.123.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.123.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.123.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.124.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.124.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.124.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.125.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.125.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.125.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.126.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.126.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.126.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.127.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.127.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.127.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.128.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.128.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.128.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.129.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.129.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.129.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.130.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.130.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.130.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.131.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.131.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.131.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.132.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.132.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.132.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.133.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.133.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.133.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.134.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.134.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.134.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.135.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.135.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.135.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.136.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.136.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.136.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.137.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.137.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.137.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.138.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.138.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.138.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.139.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.139.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.139.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.140.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.140.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.140.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.141.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.141.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.141.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.142.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.142.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.142.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.143.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.143.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.143.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.144.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.144.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.144.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.145.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.145.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.145.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.146.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.146.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.146.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.147.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.147.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.147.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.148.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.148.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.148.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.149.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.149.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.149.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.150.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.150.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.150.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.151.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.151.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.151.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.152.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.152.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.152.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.153.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.153.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.153.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.154.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.154.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.154.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.155.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.155.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.155.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.156.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.156.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.156.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.157.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.157.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.157.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.158.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.158.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.158.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.159.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.159.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.mlp.experts.159.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.21.input_layernorm.weight": "model-00020-of-000055.safetensors", + "model.layers.21.post_attention_layernorm.weight": "model-00020-of-000055.safetensors", + "model.layers.22.self_attn.q_a_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.self_attn.q_a_layernorm.weight": "model-00020-of-000055.safetensors", + "model.layers.22.self_attn.q_b_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.self_attn.kv_a_proj_with_mqa.weight": "model-00020-of-000055.safetensors", + "model.layers.22.self_attn.kv_a_layernorm.weight": "model-00020-of-000055.safetensors", + "model.layers.22.self_attn.kv_b_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.self_attn.o_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.gate.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.shared_experts.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.shared_experts.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.shared_experts.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.0.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.0.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.0.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.1.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.1.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.1.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.2.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.2.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.2.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.3.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.3.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.3.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.4.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.4.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.4.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.5.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.5.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.5.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.6.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.6.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.6.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.7.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.7.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.7.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.8.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.8.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.8.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.9.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.9.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.9.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.10.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.10.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.10.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.11.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.11.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.11.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.12.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.12.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.12.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.13.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.13.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.13.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.14.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.14.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.14.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.15.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.15.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.15.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.16.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.16.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.16.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.17.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.17.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.17.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.18.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.18.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.18.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.19.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.19.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.19.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.20.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.20.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.20.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.21.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.21.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.21.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.22.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.22.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.22.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.23.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.23.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.23.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.24.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.24.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.24.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.25.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.25.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.25.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.26.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.26.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.26.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.27.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.27.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.27.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.28.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.28.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.28.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.29.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.29.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.29.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.30.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.30.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.30.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.31.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.31.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.31.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.32.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.32.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.32.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.33.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.33.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.33.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.34.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.34.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.34.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.35.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.35.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.35.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.36.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.36.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.36.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.37.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.37.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.37.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.38.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.38.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.38.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.39.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.39.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.39.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.40.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.40.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.40.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.41.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.41.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.41.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.42.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.42.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.42.down_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.43.gate_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.43.up_proj.weight": "model-00020-of-000055.safetensors", + "model.layers.22.mlp.experts.43.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.44.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.44.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.44.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.45.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.45.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.45.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.46.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.46.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.46.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.47.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.47.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.47.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.48.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.48.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.48.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.49.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.49.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.49.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.50.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.50.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.50.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.51.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.51.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.51.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.52.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.52.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.52.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.53.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.53.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.53.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.54.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.54.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.54.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.55.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.55.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.55.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.56.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.56.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.56.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.57.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.57.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.57.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.58.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.58.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.58.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.59.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.59.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.59.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.60.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.60.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.60.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.61.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.61.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.61.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.62.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.62.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.62.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.63.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.63.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.63.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.64.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.64.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.64.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.65.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.65.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.65.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.66.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.66.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.66.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.67.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.67.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.67.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.68.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.68.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.68.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.69.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.69.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.69.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.70.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.70.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.70.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.71.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.71.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.71.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.72.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.72.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.72.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.73.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.73.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.73.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.74.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.74.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.74.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.75.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.75.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.75.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.76.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.76.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.76.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.77.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.77.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.77.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.78.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.78.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.78.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.79.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.79.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.79.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.80.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.80.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.80.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.81.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.81.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.81.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.82.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.82.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.82.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.83.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.83.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.83.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.84.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.84.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.84.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.85.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.85.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.85.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.86.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.86.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.86.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.87.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.87.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.87.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.88.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.88.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.88.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.89.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.89.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.89.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.90.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.90.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.90.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.91.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.91.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.91.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.92.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.92.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.92.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.93.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.93.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.93.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.94.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.94.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.94.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.95.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.95.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.95.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.96.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.96.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.96.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.97.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.97.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.97.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.98.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.98.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.98.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.99.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.99.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.99.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.100.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.100.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.100.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.101.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.101.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.101.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.102.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.102.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.102.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.103.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.103.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.103.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.104.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.104.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.104.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.105.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.105.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.105.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.106.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.106.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.106.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.107.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.107.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.107.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.108.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.108.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.108.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.109.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.109.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.109.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.110.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.110.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.110.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.111.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.111.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.111.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.112.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.112.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.112.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.113.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.113.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.113.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.114.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.114.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.114.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.115.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.115.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.115.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.116.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.116.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.116.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.117.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.117.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.117.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.118.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.118.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.118.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.119.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.119.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.119.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.120.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.120.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.120.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.121.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.121.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.121.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.122.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.122.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.122.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.123.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.123.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.123.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.124.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.124.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.124.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.125.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.125.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.125.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.126.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.126.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.126.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.127.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.127.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.127.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.128.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.128.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.128.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.129.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.129.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.129.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.130.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.130.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.130.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.131.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.131.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.131.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.132.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.132.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.132.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.133.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.133.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.133.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.134.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.134.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.134.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.135.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.135.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.135.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.136.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.136.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.136.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.137.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.137.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.137.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.138.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.138.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.138.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.139.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.139.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.139.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.140.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.140.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.140.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.141.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.141.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.141.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.142.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.142.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.142.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.143.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.143.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.143.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.144.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.144.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.144.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.145.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.145.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.145.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.146.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.146.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.146.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.147.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.147.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.147.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.148.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.148.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.148.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.149.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.149.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.149.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.150.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.150.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.150.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.151.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.151.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.151.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.152.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.152.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.152.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.153.gate_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.153.up_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22.mlp.experts.153.down_proj.weight": "model-00021-of-000055.safetensors", + "model.layers.22