New discussion

Increase drop out for gpt neo

1
#11 opened 6 months ago by

Flash attention

#8 opened 8 months ago by

Demo code for GPT-NEO 2.7B

#6 opened 8 months ago by

utilizing tensor cores

#3 opened 11 months ago by

stop sequence

1
#2 opened 12 months ago by