fix(modeling_phi): Fixes cached generation when above maximum context length. ecfe56e gugarosa commited on Dec 5, 2023
Fixes exceeding maximum sequence length when using generate(). 759d148 gugarosa commited on Nov 20, 2023
Fixes any potential overflow when calculating attention weights. b5c5161 gugarosa commited on Nov 16, 2023