File size: 1,687 Bytes
0aebe1c
f95b2ba
 
 
 
28d5482
 
f95b2ba
 
 
 
 
 
 
1a1c97b
 
28d5482
 
 
0aebe1c
 
10064ab
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
0aebe1c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5e0f4c0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
---
language:
- fr
- en
- es
license: llama2
library_name: peft
tags:
- llama7b
- LLAMA2
- peft
- opensource
- culture
- code
datasets:
- Snit/french-conversation
inference: true
pipeline_tag: text-generation
base_model: meta-llama/Llama-2-7b-chat-hf
---

ARIA 7B V2 is a model created by Faraday 🇫🇷 🇧🇪
The growing need of artificial intelligence tools around the world has created a run for GPU power. We decided to create an affordable model with better skills in French which can run on single GPU and reduce data bias observed in models trained mostly on english only datasets..

ARIA 7B has been trained on over 20.000 tokens of a high quality french dataset. ARIA 7B is one of the best open source models in the world avaible for this size of parameters.

## Training procedure : NVIDIA A100. Thanks to NVIDIA GPU and Inception program,we have been able to train our model within less than 24 hours.

## Base model : LLAMA_2-7B-CHAT-HF

We strongly believe that training models in more languages datasets can not only increase their knowledge base but also give more open analysis perspectives ,less focused visions and opinions from only one part of the world.
## Contact
contact@faradaylab.fr

## Number of Epoch : 2

## Timing : Less than 24 hours

The following `bitsandbytes` quantization config was used during training:
- quant_method: bitsandbytes
- load_in_8bit: True
- load_in_4bit: False
- llm_int8_threshold: 6.0
- llm_int8_skip_modules: None
- llm_int8_enable_fp32_cpu_offload: False
- llm_int8_has_fp16_weight: False
- bnb_4bit_quant_type: fp4
- bnb_4bit_use_double_quant: False
- bnb_4bit_compute_dtype: float32
### Framework versions


- PEFT 0.6.0.dev0