LeroyDyer/Mixtral_AI_Cyber_Dolphin_2.0_7b
This Expert is a companon to the MEGA_MIND 24b CyberSeries represents a groundbreaking leap in the realm of language models, integrating a diverse array of expert models into a unified framework. At its core lies the Mistral-7B-Instruct-v0.2, a refined instructional model designed for versatility and efficiency.
Enhanced with an expanded context window and advanced routing mechanisms, the Mistral-7B-Instruct-v0.2 exemplifies the power of Mixture of Experts, allowing seamless integration of specialized sub-models. This architecture facilitates unparalleled performance and scalability, enabling the CyberSeries to tackle a myriad of tasks with unparalleled speed and accuracy.
Among its illustrious sub-models, the OpenOrca - Mistral-7B-8k shines as a testament to fine-tuning excellence, boasting top-ranking performance in its class. Meanwhile, the Hermes 2 Pro introduces cutting-edge capabilities such as Function Calling and JSON Mode, catering to diverse application needs.
Driven by Reinforcement Learning from AI Feedback, the Starling-LM-7B-beta demonstrates remarkable adaptability and optimization, while the Phi-1.5 Transformer model stands as a beacon of excellence across various domains, from common sense reasoning to medical inference.
With models like BioMistral tailored specifically for medical applications and Nous-Yarn-Mistral-7b-128k excelling in handling long-context data, the MEGA_MIND 24b CyberSeries emerges as a transformative force in the landscape of language understanding and artificial intelligence.
Experience the future of language models with the MEGA_MIND 24b CyberSeries, where innovation meets performance, and possibilities are limitless.
Models Merged
The following models were included in the merge:
- liminerity/Mistral-quiet-star-ascii-demo
- invalid-coder/dolphin-2.1-mistral-7b-snr-math-laser
- liminerity/Mistral-quiet-star-demo
- Crystalcareai/Mistral-Evol-Coder
- cognitivecomputations/dolphin-2.8-mistral-7b-v02
Configuration
The following YAML configuration was used to produce this model:
models:
- model: liminerity/Mistral-quiet-star-ascii-demo
parameters:
density: [0.256, 0.512, 0.128] # density gradient
weight: 0.382
- model: cognitivecomputations/dolphin-2.8-mistral-7b-v02
parameters:
density: 0.382
weight: [0.256, 0.128, 0.256, 0.128] # weight gradient
- model: invalid-coder/dolphin-2.1-mistral-7b-snr-math-laser
parameters:
density: 0.382
weight: [0.128, 0.512, 0.128, 0.128] # weight gradient
- model: Crystalcareai/Mistral-Evol-Coder
parameters:
density: 0.382
weight: [0.256, 0.256, 0.512, 0.128] # weight gradient
- model: liminerity/Mistral-quiet-star-demo
parameters:
density: 0.382
weight:
- filter: mlp
value: 0.5
- value: 0
merge_method: ties
base_model: LeroyDyer/Mixtral_AI_Cyber_Dolphin
parameters:
normalize: true
int8_mask: true
dtype: float16
- Downloads last month
- 5