INo0121 commited on
Commit
4426a4c
โ€ข
1 Parent(s): f0a60d4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -25,11 +25,21 @@ It achieves the following results on the evaluation set:
25
 
26
  ## Model description
27
 
28
- More information needed
 
 
 
 
 
 
 
 
 
29
 
30
  ## Intended uses & limitations
31
 
32
- More information needed
 
33
 
34
  ## Training and evaluation data
35
 
 
25
 
26
  ## Model description
27
 
28
+ ํ”„๋กœ์ ํŠธ ์šฉ๋„๋กœ ํŒŒ์ธํŠœ๋‹๋œ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.
29
+ OpenAI์˜ Whisper-Base ๋ชจ๋ธ์„ ๋ฐ”ํƒ•์œผ๋กœ 'ํ•œ๊ตญ์–ด ์ €์Œ์งˆ ์Œ์„ฑ ํ†ตํ™” ๋ฐ์ดํ„ฐ'์— ๋Œ€ํ•œ ์ •ํ™•๋„๋ฅผ ์ฆ๊ฐ€์‹œํ‚ค๊ณ ์ž ํŒŒ์ธํŠœ๋‹์„ ์ง„ํ–‰ํ•œ ๋ชจ๋ธ์ด๋ฉฐ,
30
+ ์‚ฌ์šฉํ•œ ๋ฐ์ดํ„ฐ๋Š” AI-HUB์˜ โ€˜์ €์Œ์งˆ ์ „ํ™”๋ง ์Œ์„ฑ์ธ์‹ ๋ฐ์ดํ„ฐโ€™ ์ค‘ ์ผ๋ถ€๋กœ์„œ ์˜ค๋””์˜ค ํŒŒ์ผ ๊ธฐ์ค€ 240,771.06์ดˆ(ํŒŒ์ผ 1๊ฐœ๋‹น ํ‰๊ท  ๊ธธ์ด๋Š” ์•ฝ 5.296์ดˆ)
31
+ ํ…์ŠคํŠธ ๋ฐ์ดํ„ฐ ๊ธฐ์ค€ ์ด 1,696,414๊ธ€์ž์˜ ํฌ๊ธฐ์ž…๋‹ˆ๋‹ค.
32
+
33
+ This is a fine-tuned model for project use.
34
+ This model was fine-tuned to increase the accuracy of โ€˜Korean low-quality voice call dataโ€™ based on OpenAIโ€™s Whisper-Base model.
35
+ The data used is part of AI-HUBโ€™s โ€˜low-quality telephone network voice recognition dataโ€™,
36
+ which is 240,771.06 seconds based on audio files(average length per file is about 5.296 seconds).
37
+ The total size is 1,696,414 characters based on text data.
38
 
39
  ## Intended uses & limitations
40
 
41
+ ํŒŒ์ธํŠœ๋‹์— ์‚ฌ์šฉ๋œ Base model๊ณผ dataset ๋ชจ๋‘ ํ•™์Šต ๋ชฉ์ ์œผ๋กœ ์‚ฌ์šฉํ•˜์˜€์œผ๋ฉฐ,
42
+ ๋”ฐ๋ผ์„œ ๋ณธ ๋ชจ๋ธ ์—ญ์‹œ ํ•™์Šต ๋ชฉ์ ์œผ๋กœ๋งŒ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
43
 
44
  ## Training and evaluation data
45