HW-202337458-tokenizer-practice

Assignment

Tokenizer μ‹€μŠ΅ κ³Όμ œμž…λ‹ˆλ‹€.

Student

  • Student ID: 202337458
  • Hugging Face ID: hf-june

Practice A. Text

  • AutoTokenizer μ‚¬μš©
  • bert-base-uncased λ‘œλ“œ
  • padding 적용
  • truncation 적용
  • decode μˆ˜ν–‰
  • μ €μž₯/λ‘œλ“œ 확인

Practice B. Image

  • AutoImageProcessor μ‚¬μš©
  • google/vit-base-patch16-224 λ‘œλ“œ
  • 이미지 batch 처리
  • μ €μž₯/λ‘œλ“œ 확인

Practice C. Multimodal

  • AutoProcessor μ‚¬μš©
  • openai/clip-vit-base-patch32 λ‘œλ“œ
  • text/image λ™μ‹œ 처리
  • μ €μž₯/λ‘œλ“œ 확인

Files

  • tokenizer_processor_practice.ipynb
  • assignment_summary.json
  • README.md

How to Use

  1. tokenizer_processor_practice.ipynb νŒŒμΌμ„ Colabμ—μ„œ μ—½λ‹ˆλ‹€.
  2. μœ„μ—μ„œλΆ€ν„° μˆœμ„œλŒ€λ‘œ μ‹€ν–‰ν•©λ‹ˆλ‹€.
  3. 각 μ‹€μŠ΅μ˜ μ €μž₯ μ „/ν›„ 검증 κ²°κ³Όκ°€ True둜 좜λ ₯λ˜λŠ”μ§€ ν™•μΈν•©λ‹ˆλ‹€.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support