--- license: apache-2.0 tags: - merge - mergekit - lazymergekit - mistralai/Mixtral-8x7B-Instruct-v0.1 - openai/whisper-large-v3 --- # rawan rawan is a merge of the following models using [mergekit](https://github.com/cg123/mergekit): * [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) * [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) ## 🧩 Configuration ```yaml slices: - sources: - model: mistralai/Mixtral-8x7B-Instruct-v0.1 layer_range: [0, 32] - model: openai/whisper-large-v3 layer_range: [0, 32] merge_method: slerp base_model: openai/whisper-large-v3 parameters: t: - filter: self_attn value: [0, 0.5, 0.3, 0.7, 1] - filter: mlp value: [1, 0.5, 0.7, 0.3, 0] - value: 0.5 dtype: bfloat16 ```