File size: 1,042 Bytes
c288dff
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
# ๋‚ด๋ถ€ ์ƒํƒœ ็ด ็”Ÿ์—ด์—์„œ CRF์˜ ็ด ็”Ÿ์—ด์„ ์ถ”์ถœํ•˜๊ธฐ ์œ„ํ•œ
# ํ…œํ”Œ๋ฆฟ์„ ์ •์˜ํ•˜๋Š” ํŒŒ์ผ์ž…๋‹ˆ๋‹ค.
# ๊ฐ ํ–‰์€ ํ•œ ํ…œํ”Œ๋ฆฟ์— ๋Œ€์‘ํ•ฉ๋‹ˆ๋‹ค.
# UNIGRAM์œผ๋กœ ์‹œ์ž‘ํ•˜๋Š” ๊ฒƒ์€ UNIGRAM์šฉ ํ…œํ”Œ๋ฆฟ์ด๊ณ 
# BIGRAM์œผ๋กœ ์‹œ์ž‘ํ•˜๋Š” ๊ฒƒ์€ ์—ฐ์ ‘(้€ฃๆŽฅ)์šฉ ํ…œํ”Œ๋ฆฟ์ž…๋‹ˆ๋‹ค.
#
# %F[0..N] Unigram ๋ฌธ๋งฅ
# %F[n] : UNIGRAM์˜ n๋ฒˆ์งธ ์†Œ์„ฑ(็ด ๆ€ง)์œผ๋กœ ์ „๊ฐœ๋ฉ๋‹ˆ๋‹ค.

# POS Unigram
UNIGRAM U00:%F[0]
UNIGRAM U01:%F[0],%F?[1]

# Read-SemanticClass-POS
UNIGRAM R00:%F[3]
UNIGRAM R01:%F[0],%F[3]
UNIGRAM R02:%F[0],%F?[1],%F[3]

# ํ’ˆ์‚ฌ
## ํ’ˆ์‚ฌ,์˜๋ฏธ->ํ’ˆ์‚ฌ,์˜๋ฏธ ์—ฐ์ ‘
BIGRAM B00:%L[0]/%R[0]
BIGRAM B01:%L[0],%L?[1]/%R[0]
BIGRAM B02:%L[0]/%R[0],%R?[1]
BIGRAM B03:%L[0],%L?[1]/%R[0],%R?[1]
## ์ข…์„ฑ์—ฌ๋ถ€->์ฝ๊ธฐ ์—ฐ์ ‘
BIGRAM B10:%L[0],%L?[2]/%R[0],%R?[3]
BIGRAM B11:%L[0],%L?[2]/%R[0],%R?[1],%R?[3]
BIGRAM B12:%L[0],%L?[1],%L?[2]/%R[0],%R?[3]
BIGRAM B13:%L[0],%L?[1],%L?[2]/%R[0],%R?[1],%R?[3]
## ์ฝ๊ธฐ ์—ฐ์ ‘
BIGRAM B20:%L[0],%L?[3]/%R[0]
BIGRAM B21:%L[0],%L?[3]/%R[0],%R?[3]
BIGRAM B22:%L[0]/%R[0],%R?[3]