Large Language Models for Computer-Aided Design Fine Tuned: Dataset and Experiments

Sun, Yuewan; Li, Xingang; Sha, Zhenghui

Source: Journal of Mechanical Design:;2025:;volume( 147 ):;issue: 004::page 41710-1

Author:

Sun, Yuewan

Li, Xingang

Sha, Zhenghui

DOI: 10.1115/1.4067713

Publisher: The American Society of Mechanical Engineers (ASME)

Abstract: Despite the power of large language models (LLMs) in various cross-modal generation tasks, their ability to generate 3D computer-aided design (CAD) models from text remains underexplored due to the scarcity of suitable datasets. Additionally, there is a lack of multimodal CAD datasets that include both reconstruction parameters and text descriptions, which are essential for the quantitative evaluation of the CAD generation capabilities of multimodal LLMs. To address these challenges, we developed a dataset of CAD models, sketches, and image data for representative mechanical components such as gears, shafts, and springs, along with natural language descriptions collected via Amazon Mechanical Turk. By using CAD programs as a bridge, we facilitate the conversion of textual output from LLMs into precise 3D CAD designs. To enhance the text-to-CAD generation capabilities of GPT models and demonstrate the utility of our dataset, we developed a pipeline to generate fine-tuning training data for GPT-3.5. We fine-tuned four GPT-3.5 models with various data sampling strategies based on the length of a CAD program. We evaluated these models using parsing rate and intersection over union (IoU) metrics, comparing their performance to that of GPT-4 without fine-tuning. The new knowledge gained from the comparative study on the four different fine-tuned models provided us with guidance on the selection of sampling strategies to build training datasets in fine-tuning practices of LLMs for text-to-CAD generation, considering the trade-off between part complexity, model performance, and cost.

Download: (1.624Mb)
Show Full MetaData Hide Full MetaData
Get RIS
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Large Language Models for Computer-Aided Design Fine Tuned: Dataset and Experiments

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4308337

Collections

Journal of Mechanical Design

Show full item record

contributor author	Sun, Yuewan
contributor author	Li, Xingang
contributor author	Sha, Zhenghui
date accessioned	2025-08-20T09:28:26Z
date available	2025-08-20T09:28:26Z
date copyright	2/27/2025 12:00:00 AM
date issued	2025
identifier issn	1050-0472
identifier other	md-24-1560.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4308337
description abstract	Despite the power of large language models (LLMs) in various cross-modal generation tasks, their ability to generate 3D computer-aided design (CAD) models from text remains underexplored due to the scarcity of suitable datasets. Additionally, there is a lack of multimodal CAD datasets that include both reconstruction parameters and text descriptions, which are essential for the quantitative evaluation of the CAD generation capabilities of multimodal LLMs. To address these challenges, we developed a dataset of CAD models, sketches, and image data for representative mechanical components such as gears, shafts, and springs, along with natural language descriptions collected via Amazon Mechanical Turk. By using CAD programs as a bridge, we facilitate the conversion of textual output from LLMs into precise 3D CAD designs. To enhance the text-to-CAD generation capabilities of GPT models and demonstrate the utility of our dataset, we developed a pipeline to generate fine-tuning training data for GPT-3.5. We fine-tuned four GPT-3.5 models with various data sampling strategies based on the length of a CAD program. We evaluated these models using parsing rate and intersection over union (IoU) metrics, comparing their performance to that of GPT-4 without fine-tuning. The new knowledge gained from the comparative study on the four different fine-tuned models provided us with guidance on the selection of sampling strategies to build training datasets in fine-tuning practices of LLMs for text-to-CAD generation, considering the trade-off between part complexity, model performance, and cost.
publisher	The American Society of Mechanical Engineers (ASME)
title	Large Language Models for Computer-Aided Design Fine Tuned: Dataset and Experiments
type	Journal Paper
journal volume	147
journal issue	4
journal title	Journal of Mechanical Design
identifier doi	10.1115/1.4067713
journal fristpage	41710-1
journal lastpage	41710-15
page	15
tree	Journal of Mechanical Design:;2025:;volume( 147 ):;issue: 004
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive