lily_fast_api / ENVIRONMENT_VARIABLES.md
gbrabbit's picture
Fresh start for HF Spaces deployment
526927a
|
raw
history blame
3.36 kB

πŸ”§ ν™˜κ²½ λ³€μˆ˜ μ„€μ • κ°€μ΄λ“œ

🏠 둜컬 개발 ν™˜κ²½

.env 파일 μ„€μ •

ν”„λ‘œμ νŠΈ λ£¨νŠΈμ— .env νŒŒμΌμ„ μƒμ„±ν•˜κ³  λ‹€μŒ λ³€μˆ˜λ“€μ„ μ„€μ •ν•˜μ„Έμš”:

# κΈ°λ³Έ μ„œλ²„ μ„€μ •
HOST=0.0.0.0
PORT=8001
PYTHONPATH=/app
PYTHONUNBUFFERED=1

# ν™˜κ²½ 감지
IS_LOCAL=true
ENVIRONMENT=local
DOCKER_ENV=local

# λͺ¨λΈ μ„€μ •
DEFAULT_MODEL=kanana-1.5-v-3b-instruct
MAX_NEW_TOKENS=256
TEMPERATURE=0.7

# 둜컬 λͺ¨λΈ 경둜 (선택사항)
LOCAL_MODEL_PATH=./lily_llm_core/models/kanana_1_5_v_3b_instruct

둜컬 Docker μ‹€ν–‰

# 둜컬 개발용 Docker λΉŒλ“œ
docker build -f Dockerfile.local -t lily-llm-local .

# 둜컬 μ‹€ν–‰ (포트 8001)
docker run -p 8001:8001 --env-file .env lily-llm-local

☁️ Hugging Face Spaces ν™˜κ²½

ν•„μˆ˜ ν™˜κ²½ λ³€μˆ˜

Hugging Face Spaces Settings > Variablesμ—μ„œ λ‹€μŒ λ³€μˆ˜λ“€μ„ μ„€μ •ν•˜μ„Έμš”:

κΈ°λ³Έ μ„œλ²„ μ„€μ •

HOST=0.0.0.0
PORT=7860
PYTHONPATH=/app
PYTHONUNBUFFERED=1

Hugging Face μ„€μ •

# μΊμ‹œ 디렉토리
TRANSFORMERS_CACHE=/app/cache/transformers
HF_HOME=/app/cache/huggingface
HF_HUB_CACHE=/app/cache/huggingface

# λͺ¨λΈ μ„€μ •
HF_MODEL_NAME=gbrabbit/lily-math-model
DEFAULT_MODEL=kanana-1.5-v-3b-instruct

# 토큰화 병렬 처리 λΉ„ν™œμ„±ν™” (λ©”λͺ¨λ¦¬ μ ˆμ•½)
TOKENIZERS_PARALLELISM=false

μ„±λŠ₯ μ΅œμ ν™”

# CPU μŠ€λ ˆλ“œ μ œν•œ (λ©”λͺ¨λ¦¬ μ ˆμ•½)
OMP_NUM_THREADS=1
MKL_NUM_THREADS=1

# PyTorch μ„€μ •
TORCH_HOME=/app/cache/torch
PYTORCH_TRANSFORMERS_CACHE=/app/cache/transformers

AI λͺ¨λΈ μ„€μ •

# 생성 νŒŒλΌλ―Έν„°
MAX_NEW_TOKENS=256
TEMPERATURE=0.7
TOP_P=0.9
TOP_K=40

선택적 ν™˜κ²½ λ³€μˆ˜

디버깅

# 둜그 레벨
LOG_LEVEL=INFO
DEBUG=false

# 상세 λ‘œκΉ…
TRANSFORMERS_VERBOSITY=warning
HF_HUB_VERBOSITY=warning

λ³΄μ•ˆ (ν•„μš”μ‹œ)

# API ν‚€ (ν•„μš”ν•œ 경우)
HF_TOKEN=your_huggingface_token
API_SECRET_KEY=your_secret_key

πŸš€ μžλ™ λͺ¨λΈ λ‹€μš΄λ‘œλ“œ λ™μž‘ 방식

1단계: 둜컬 λͺ¨λΈ 확인

  • /app/lily_llm_core/models/kanana_1_5_v_3b_instruct/ 경둜 확인
  • 파일이 있으면 둜컬 λͺ¨λΈ μ‚¬μš©

2단계: Hugging Face Hub λ‹€μš΄λ‘œλ“œ

  • 둜컬 λͺ¨λΈμ΄ μ—†μœΌλ©΄ gbrabbit/lily-math-modelμ—μ„œ μžλ™ λ‹€μš΄λ‘œλ“œ
  • /app/cache/transformers/ κ²½λ‘œμ— μΊμ‹œ μ €μž₯

3단계: λͺ¨λΈ λ‘œλ”©

  • μΊμ‹œλœ λͺ¨λΈμ„ λ©”λͺ¨λ¦¬μ— λ‘œλ“œ
  • μ„œλ²„ μ‹œμž‘ μ™„λ£Œ

πŸ“Š μ˜ˆμƒ λ™μž‘

첫 번째 배포

🌐 Hugging Face Hubμ—μ„œ λ‹€μš΄λ‘œλ“œ: gbrabbit/lily-math-model
πŸ“₯ λͺ¨λΈ λ‹€μš΄λ‘œλ“œ 쀑... (μ•½ 2-5λΆ„)
βœ… λͺ¨λΈ λ‘œλ“œ μ™„λ£Œ
πŸš€ μ„œλ²„ μ‹œμž‘: 0.0.0.0:7860

이후 μž¬μ‹œμž‘

πŸ—‚οΈ μΊμ‹œλœ λͺ¨λΈ μ‚¬μš©: /app/cache/transformers/
βœ… λͺ¨λΈ λ‘œλ“œ μ™„λ£Œ (μ•½ 30초)
πŸš€ μ„œλ²„ μ‹œμž‘: 0.0.0.0:7860

πŸ” 문제 ν•΄κ²°

λͺ¨λΈ λ‹€μš΄λ‘œλ“œ μ‹€νŒ¨

# λ„€νŠΈμ›Œν¬ μ—°κ²° 확인
curl -I https://huggingface.co/gbrabbit/lily-math-model

# Hugging Face Hub μƒνƒœ 확인
curl -I https://huggingface.co/api/models/gbrabbit/lily-math-model

λ©”λͺ¨λ¦¬ λΆ€μ‘±

# 더 μž‘μ€ λͺ¨λΈ μ‚¬μš© λ˜λŠ” μ–‘μžν™” 적용
# Hardware μ—…κ·Έλ ˆμ΄λ“œ κ³ λ € (CPU upgrade λ˜λŠ” GPU)

μΊμ‹œ 문제

# μΊμ‹œ 디렉토리 κΆŒν•œ 확인
ls -la /app/cache/

# μΊμ‹œ μ‚­μ œ ν›„ μž¬μ‹œμž‘
rm -rf /app/cache/transformers/*