Untrained and Unmatched: Fast and Accurate Zero-Training Classification for Tabular Engineering Data

Picard, Cyril; Ahmed, Faez

Source: Journal of Mechanical Design:;2024:;volume( 146 ):;issue: 009::page 91705-1

Author:

Picard, Cyril

Ahmed, Faez

DOI: 10.1115/1.4064811

Publisher: The American Society of Mechanical Engineers (ASME)

Abstract: In engineering design, navigating complex decision-making landscapes demands a thorough exploration of the design, performance, and constraint spaces, often impeded by resource-intensive simulations. Data-driven methods can mitigate this challenge by harnessing historical data to delineate feasible domains, accelerate optimization, or evaluate designs. However, the implementation of these methods usually demands machine learning expertise and multiple trials to choose the right method and hyperparameters. This makes them less accessible for numerous engineering situations. Additionally, there is an inherent trade-off between training speed and accuracy, with faster methods sometimes compromising precision. In our paper, we demonstrate that a recently released general-purpose transformer-based classification model, TabPFN, is both fast and accurate. Notably, it requires no dataset-specific training to assess new tabular data. TabPFN is a prior-data fitted network, which undergoes a one-time offline training across a broad spectrum of synthetic datasets and performs in-context learning. We evaluated TabPFN’s efficacy across eight engineering design classification problems, contrasting it with seven other algorithms, including a state-of-the-art automated machine learning (AutoML) method. For these classification challenges, TabPFN consistently outperforms in speed and accuracy. It is also the most data-efficient and provides the added advantage of being differentiable and giving uncertainty estimates. Our findings advocate for the potential of pre-trained models that learn from synthetic data and require no domain-specific tuning to make data-driven engineering design accessible to a broader community and open ways to efficient general-purpose models valid across applications. Furthermore, we share a benchmark problem set for evaluating new classification algorithms in engineering design.

Download: (1.155Mb)
Show Full MetaData Hide Full MetaData
Get RIS
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Untrained and Unmatched: Fast and Accurate Zero-Training Classification for Tabular Engineering Data

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4295713

Collections

Journal of Mechanical Design

Show full item record

contributor author	Picard, Cyril
contributor author	Ahmed, Faez
date accessioned	2024-04-24T22:42:11Z
date available	2024-04-24T22:42:11Z
date copyright	3/5/2024 12:00:00 AM
date issued	2024
identifier issn	1050-0472
identifier other	md_146_9_091705.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4295713
description abstract	In engineering design, navigating complex decision-making landscapes demands a thorough exploration of the design, performance, and constraint spaces, often impeded by resource-intensive simulations. Data-driven methods can mitigate this challenge by harnessing historical data to delineate feasible domains, accelerate optimization, or evaluate designs. However, the implementation of these methods usually demands machine learning expertise and multiple trials to choose the right method and hyperparameters. This makes them less accessible for numerous engineering situations. Additionally, there is an inherent trade-off between training speed and accuracy, with faster methods sometimes compromising precision. In our paper, we demonstrate that a recently released general-purpose transformer-based classification model, TabPFN, is both fast and accurate. Notably, it requires no dataset-specific training to assess new tabular data. TabPFN is a prior-data fitted network, which undergoes a one-time offline training across a broad spectrum of synthetic datasets and performs in-context learning. We evaluated TabPFN’s efficacy across eight engineering design classification problems, contrasting it with seven other algorithms, including a state-of-the-art automated machine learning (AutoML) method. For these classification challenges, TabPFN consistently outperforms in speed and accuracy. It is also the most data-efficient and provides the added advantage of being differentiable and giving uncertainty estimates. Our findings advocate for the potential of pre-trained models that learn from synthetic data and require no domain-specific tuning to make data-driven engineering design accessible to a broader community and open ways to efficient general-purpose models valid across applications. Furthermore, we share a benchmark problem set for evaluating new classification algorithms in engineering design.
publisher	The American Society of Mechanical Engineers (ASME)
title	Untrained and Unmatched: Fast and Accurate Zero-Training Classification for Tabular Engineering Data
type	Journal Paper
journal volume	146
journal issue	9
journal title	Journal of Mechanical Design
identifier doi	10.1115/1.4064811
journal fristpage	91705-1
journal lastpage	91705-13
page	13
tree	Journal of Mechanical Design:;2024:;volume( 146 ):;issue: 009
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive