File size: 907 Bytes
7a01a97
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
license: apache-2.0
tags:
- moe
- merge
- mergekit
- lazymergekit
- cognitivecomputations/dolphin-2_6-phi-2
- lxuechen/phi-2-dpo
---

![](https://i.imgur.com/UOb2fvh.jpg)

# phixtral-2x2.8

phixtral-2x2.8 is a Mixure of Experts (MoE) made with the following models using a custom version of mergekit:
* [cognitivecomputations/dolphin-2_6-phi-2](https://huggingface.co/cognitivecomputations/dolphin-2_6-phi-2)
* [lxuechen/phi-2-dpo](https://huggingface.co/lxuechen/phi-2-dpo)

## 🧩 Configuration

```yaml
base_model: cognitivecomputations/dolphin-2_6-phi-2
gate_mode: cheap_embed
experts:
  - source_model: cognitivecomputations/dolphin-2_6-phi-2
    positive_prompts: [""]
  - source_model: lxuechen/phi-2-dpo
    positive_prompts: [""]
```

## 💻 Usage

This architecture is not compatible with the transformers library. I'm working on hacking something to run it. Contact me if you're interested!