Submitted by Zixuan Jiang 1 Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation SJTU Cross Media Language Intelligence Lab 3 2