Large Language Models for Computer-Aided Design Fine Tuned: Dataset and ExperimentsSource: Journal of Mechanical Design:;2025:;volume( 147 ):;issue: 004::page 41710-1DOI: 10.1115/1.4067713Publisher: The American Society of Mechanical Engineers (ASME)
Abstract: Despite the power of large language models (LLMs) in various cross-modal generation tasks, their ability to generate 3D computer-aided design (CAD) models from text remains underexplored due to the scarcity of suitable datasets. Additionally, there is a lack of multimodal CAD datasets that include both reconstruction parameters and text descriptions, which are essential for the quantitative evaluation of the CAD generation capabilities of multimodal LLMs. To address these challenges, we developed a dataset of CAD models, sketches, and image data for representative mechanical components such as gears, shafts, and springs, along with natural language descriptions collected via Amazon Mechanical Turk. By using CAD programs as a bridge, we facilitate the conversion of textual output from LLMs into precise 3D CAD designs. To enhance the text-to-CAD generation capabilities of GPT models and demonstrate the utility of our dataset, we developed a pipeline to generate fine-tuning training data for GPT-3.5. We fine-tuned four GPT-3.5 models with various data sampling strategies based on the length of a CAD program. We evaluated these models using parsing rate and intersection over union (IoU) metrics, comparing their performance to that of GPT-4 without fine-tuning. The new knowledge gained from the comparative study on the four different fine-tuned models provided us with guidance on the selection of sampling strategies to build training datasets in fine-tuning practices of LLMs for text-to-CAD generation, considering the trade-off between part complexity, model performance, and cost.
|
Collections
Show full item record
| contributor author | Sun, Yuewan | |
| contributor author | Li, Xingang | |
| contributor author | Sha, Zhenghui | |
| date accessioned | 2025-08-20T09:28:26Z | |
| date available | 2025-08-20T09:28:26Z | |
| date copyright | 2/27/2025 12:00:00 AM | |
| date issued | 2025 | |
| identifier issn | 1050-0472 | |
| identifier other | md-24-1560.pdf | |
| identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4308337 | |
| description abstract | Despite the power of large language models (LLMs) in various cross-modal generation tasks, their ability to generate 3D computer-aided design (CAD) models from text remains underexplored due to the scarcity of suitable datasets. Additionally, there is a lack of multimodal CAD datasets that include both reconstruction parameters and text descriptions, which are essential for the quantitative evaluation of the CAD generation capabilities of multimodal LLMs. To address these challenges, we developed a dataset of CAD models, sketches, and image data for representative mechanical components such as gears, shafts, and springs, along with natural language descriptions collected via Amazon Mechanical Turk. By using CAD programs as a bridge, we facilitate the conversion of textual output from LLMs into precise 3D CAD designs. To enhance the text-to-CAD generation capabilities of GPT models and demonstrate the utility of our dataset, we developed a pipeline to generate fine-tuning training data for GPT-3.5. We fine-tuned four GPT-3.5 models with various data sampling strategies based on the length of a CAD program. We evaluated these models using parsing rate and intersection over union (IoU) metrics, comparing their performance to that of GPT-4 without fine-tuning. The new knowledge gained from the comparative study on the four different fine-tuned models provided us with guidance on the selection of sampling strategies to build training datasets in fine-tuning practices of LLMs for text-to-CAD generation, considering the trade-off between part complexity, model performance, and cost. | |
| publisher | The American Society of Mechanical Engineers (ASME) | |
| title | Large Language Models for Computer-Aided Design Fine Tuned: Dataset and Experiments | |
| type | Journal Paper | |
| journal volume | 147 | |
| journal issue | 4 | |
| journal title | Journal of Mechanical Design | |
| identifier doi | 10.1115/1.4067713 | |
| journal fristpage | 41710-1 | |
| journal lastpage | 41710-15 | |
| page | 15 | |
| tree | Journal of Mechanical Design:;2025:;volume( 147 ):;issue: 004 | |
| contenttype | Fulltext |