Haoze Wu

WaitHZ
·

AI & ML interests

Modular DL, Complex Reasoning

Recent Activity

Organizations

None yet

WaitHZ's activity

upvoted 2 articles about 2 months ago
view article
Article

How to generate text: using different decoding methods for language generation with Transformers

181
view article
Article

You could have designed state of the art positional encoding

204
New activity in deepseek-ai/deepseek-moe-16b-base about 1 year ago

A little question about aux_loss

2
#4 opened about 1 year ago by
WaitHZ

A little question about aux_loss

2
#4 opened about 1 year ago by
WaitHZ

A little question about aux_loss

2
#4 opened about 1 year ago by
WaitHZ