Update RotaryEmbedding caching

#33
by beibin79 - opened
  1. Save computation for multiple alternating calls with the same sequence length.
  2. Debug: the cos_sin(...) function could return (None, None) during multi-threading on some hardware. The latest dict design can resolve this issue.
Cannot merge
This branch has merge conflicts in the following files:
  • modeling_falcon.py

Sign up or log in to comment