DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation Paper • 2404.07917 • Published Apr 11 • 1
MSEval: A Dataset for Material Selection in Conceptual Design to Evaluate Algorithmic Models Paper • 2407.09719 • Published Jul 12
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content Paper • 2406.11811 • Published Jun 17 • 16
XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference Paper • 2404.15420 • Published Apr 23 • 7
WorldSmith: Iterative and Expressive Prompting for World Building with a Generative AI Paper • 2308.13355 • Published Aug 25, 2023 • 2
Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation Paper • 2307.03869 • Published Jul 8, 2023 • 22