Change use_cache to True which significantly speeds up inference (#2) ca45eff ehartford TheBloke commited on May 5, 2023