Dongwon Jo's picture

2 2

Dongwon Jo

dongwonjo

·

AI & ML interests

Efficient AI, Model Compression, Quantization, Pruning, Generative Model, Large Language Model, Diffusion

Recent Activity

upvoted a paper about 1 month ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

authored a paper about 1 month ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

upvoted a paper about 1 month ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

View all activity

Organizations

None yet

dongwonjo's activity

upvoted a paper about 1 month ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

Paper • 2406.12311 • Published Jun 18, 2024 • 7

authored a paper about 1 month ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published Feb 3 • 16

upvoted a paper about 1 month ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published Feb 3 • 16

commented a paper about 1 month ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published Feb 3 • 16 •

updated 5 models 6 months ago

dongwonjo/Llama-1-7B-BinaryMoS-E4

Updated Sep 9, 2024 • 7

dongwonjo/Llama-1-13B-BinaryMoS-E4

Updated Sep 9, 2024 • 6

dongwonjo/Llama-2-13B-BinaryMoS-E4

Updated Sep 9, 2024 • 9

dongwonjo/Llama-1-30B-BinaryMoS-E4

Updated Sep 9, 2024 • 15

dongwonjo/Llama-2-7B-BinaryMoS-E4

Updated Sep 9, 2024 • 10

commented a paper 9 months ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

Paper • 2406.12311 • Published Jun 18, 2024 • 7 •

authored a paper 9 months ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

Paper • 2406.12311 • Published Jun 18, 2024 • 7