Jaekoo Kang
jkang
AI & ML interests
Anything fun and interesting
Organizations
Collections
5
-
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
Paper β’ 2309.07915 β’ Published β’ 4 -
Skywork: A More Open Bilingual Foundation Model
Paper β’ 2310.19341 β’ Published β’ 6 -
Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V
Paper β’ 2310.19061 β’ Published β’ 8 -
Lumiere: A Space-Time Diffusion Model for Video Generation
Paper β’ 2401.12945 β’ Published β’ 85
spaces
7
models
8
jkang/espnet2_an4_transformer
Automatic Speech Recognition
β’
Updated
β’
9
jkang/espnet2_librispeech_100_conformer_char
Automatic Speech Recognition
β’
Updated
β’
3
jkang/espnet2_librispeech_100_conformer_word
Automatic Speech Recognition
β’
Updated
β’
5
β’
1
jkang/espnet2_librispeech_100_conformer
Automatic Speech Recognition
β’
Updated
β’
8
jkang/espnet2_mini_librispeech_diar
Updated
β’
3
jkang/espnet2_an4_asr
Automatic Speech Recognition
β’
Updated
β’
2
jkang/drawing-artistic-trend-classifier
Updated
β’
9
jkang/drawing-artist-classifier
Updated
β’
12
β’
1
datasets
None public yet