New discussion

Sequence length?

#5 opened 9 months ago by deleted

Handle model parallelism

#4 opened 11 months ago by sgugger

adds _no_split_block

1
#3 opened about 1 year ago by staturecrane

Results are extremely poor

2
#2 opened about 1 year ago by Steve72

Quantization support.

3
#1 opened about 1 year ago by AV99