Edit model card

repo_clone_080724

name: aya-23-8b
license: cc-by-nc-4.0
tags:
- Cohere
- CohereForAI
- Aya
- Command-R
- text-generation
- text2text-generation
- natural-language
- multilingual
- bartowski
type:
- 6GB
- 8GB
- llm
- chat
- multilingual
- aya
- command-r
config:
- ctx=8192
- 4bit
- 5bit
- temp=0.3
system_prefix: <|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>
system_suffix:
tools: 
token_prefix: <|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>
token_suffix: <|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
antiprompt: <|START_OF_TURN_TOKEN|><|END_OF_TURN_TOKEN|>
resolutions: 
datasets:
- CohereForAI/aya_dataset
- CohereForAI/aya_collection
- CohereForAI/aya_evaluation_suite
- CohereForAI/aya_collection_language_split
- CohereForAI/aya_redteaming
language:
- en
- fr
- de
- es
- it
- pt
- ja
- ko
- zh
- ar
- el
- fa
- pl
- id
- cs
- he
- hi
- nl
- ro
- ru
- tr
- uk
- vi
size:
- 5056982144
- 5803568256
use: 
shortcomings: 
sources:
- https://arxiv.org/abs/2402.06619
- https://arxiv.org/abs/2405.15032
funded_by: 
train_hardware: 
pipeline_tag: text-generation
examples:
- "<BOS_TOKEN><|START_OF_TURN_TOKEN|><|USER_TOKEN|>Anneme onu ne kadar sevdiğimi anlatan bir mektup yaz<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>"
Downloads last month
125
GGUF
Model size
8.03B params
Architecture
command-r

4-bit

5-bit

Inference API
Unable to determine this model's library. Check the docs .

Datasets used to train darkshapes/aya-23-8b-gguf