view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon May 9 • 11
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding Jan 30 • 4
Efficient Post-training Quantization with FP8 Formats Paper • 2309.14592 • Published Sep 26, 2023 • 10 • 2
Intel Neural Chat Collection Fine-tuned 7B parameter LLM models, one of which made it to the top of the 7B HF LLM Leaderboard • 15 items • Updated Aug 23 • 2