Int. J. Precis. Eng. Manuf.-Smart Tech. > Volume 3(1); 2025 > Article
Islam, Ji, Kim, and Kim: Advanced Machine Learning Techniques for Predicting Z-Axis Belt Wear in Wafer Transfer Robots

Abstract

Wafer-transfer robots (WTRs) are critical to semiconductor manufacturing, where precision and efficiency are essential to ensure high production yields and minimize defects. A key challenge is the wear and degradation of the Z-axis belt, which can cause misalignments, operational disruptions, and costly downtimes. This study presents a data-driven approach leveraging machine learning to predict Z-axis belt wear, enhancing WTR reliability and performance. High-frequency acceleration sensor data from the Upper Blade axis of WTRs were utilized to develop and test various machine learning classification models, including K-Nearest Neighbors (KNN), Logistic Regression, Naive Bayes, Decision Tree, and Random Forest. The Random Forest model achieved the highest predictive accuracy at 99.8%, significantly surpassing traditional maintenance methods. These findings highlight the potential of machine learning to transform predictive maintenance in semiconductor manufacturing. By anticipating and preventing faults with exceptional precision, machine learning-based predictive maintenance enables more reliable and efficient operations, reducing costs and enhancing system longevity. This study not only demonstrates the effectiveness of machine learning in predictive maintenance but also sets the foundation for future applications aimed at optimizing the performance and lifespan of critical manufacturing equipment.

NOMENCLATURE

x

Input Feature or Variable in the Dataset

xi

Individual Data Points in the Training Dataset

yi

Coordinate of a Data Point in the i–th Dimension

dl

Euclidean Distance between Two Data Points (xi and yi) in the Feature Space

f (x)

Logistic Function (Sigmoid Function) that Outputs the Predicted Probability of an Event

P(A|B)

Conditional Probability of Event A given Event B

G(t)

Gini Impurity, a Measure of Impurity or Randomness at a Node t

c

Number of Classes

pi

The Proportion of Samples OF Class i in Node t

|T|

Size of the Training Dataset T

f(Ci,T)

Frequency of Class Ci in the Training Set T

TN

True Negative

TP

True Positive

FN

False Negative

FP

False Positive

1 Introduction

Wafer-transfer robots are critical components in the semiconductor manufacturing process and are responsible for the precise and automated handling of semiconductor wafers at various stages of fabrication. The accuracy and reliability of these robots are paramount because any mechanical deviation or failure can lead to significant defects in the wafers, resulting in costly production delays and reduced yields [1]. One of the key mechanical elements in these robots is the Z-axis belt, which controls the vertical movement essential for the correct positioning of the wafers. This belt experiences wear and tension degradation over time due to continuous operation, which can lead to misalignment, vibration, and, ultimately, equipment failure if improperly maintained. Current maintenance strategies for the Z-axis belts in WTRs typically rely on traditional methods, including periodic inspections and preventive measures. These approaches effectively reduce the risk of unexpected failures by ensuring regular system tests. Although traditional maintenance supports reliability, it may occasionally lead to planned downtimes and associated costs that can be further optimized. In addition, following fixed schedules can occasionally allow belt wear to progress between inspections, potentially leading to system interruptions. Continuous monitoring or predictive maintenance solutions can enhance precision and efficiency [2]. Therefore, there is a critical need for an intelligent maintenance approach that can predict the Z-axis belt wear and tension degradation in real time, enabling targeted interventions before issues arise. Artificial intelligence (AI) has recently gained significant traction in numerous domains, including computer vision, biomedical engineering, autonomous systems, and industrial digitalization [3,4]. Machine learning (ML) and deep learning (DL) approaches have demonstrated immense potential in prognostic and health management (PHM) systems, which are employed to monitor and predict the condition of various rotating machinery components such as those found in power generation systems, gearboxes [58], bearings, and motor [9]. Notable data-driven methods [10,11] include backpropagation neural networks [12], support vector machines [13], and artificial neural networks [14], which are effective in enhancing fault diagnosis processes [15]. Among the various algorithms used for data-driven fault detection, convolutional neural networks (CNNs) [1618] are currently the most popular because they automatically extract and learn features from complex data. However, this study overlooks CNNs because of the limited availability of data, which is crucial for effectively training deep-learning models. In cases where data scarcity is an issue, traditional ML algorithms are often more suitable because they require less data to achieve accurate predictions. Moreover, ML techniques such as artificial neural networks, decision tree methods, hybrid models, and latent variable models have been employed in studies focusing on mechanical fault diagnosis and prognosis in real industrial manufacturing use cases [19,20]. While many studies focus on fault diagnosis for individual mechanical components like motors and gears [2125], limited research addresses diagnostics at the system level in robotic applications, particularly semiconductor wafer transfer robots (WTRs). This study addresses that gap by focusing on fault prediction for Z-axis belt wear within the WTR system, marking a novel approach in predictive maintenance for robotic systems in high-precision manufacturing. By examining the Z-axis belt as part of a larger robotic subsystem, this research offers a new perspective on the semiconductor industry’s fault detection and maintenance strategies. Despite the limited focus on Z-axis belt wear in prior research, industry experts identify belt tension degradation as a primary cause of malfunctions in WTRs, often leading to issues with the Z-axis motor. Such wear can cause misalignment and other operational disruptions, emphasizing the need for real-time monitoring and maintenance of the Z-axis belt. This study fills this gap by providing critical insights into belt wear and degradation to enhance WTR reliability and longevity in high-precision environments.
This work proposes a novel fault prediction approach for Z-axis belt tension wear using data from a wireless acceleration sensor. For the first time, mechanical faults related to Z-axis belt wear were predicted through sensor-driven data collection, utilizing an industrial robotic system operating under realistic conditions to ensure applicability to practical industrial scenarios. Acceleration data were wirelessly collected from sensors during various robotic movements, with faults classified using ML algorithms. The model was validated using unseen data under multiple operating conditions (low, high, and variable speeds), demonstrating high predictive accuracy across scenarios. This study develops a predictive ML model based on real-time sensor data to prevent unexpected mechanical failures and optimize maintenance schedules. This research advances semiconductor manufacturing by introducing an ML model to monitor and predict wear and tension loss in WTR Z-axis belts. Early detection of tension loss enables manufacturers to reduce unplanned downtime, lower repair costs, and improve WTR performance and efficiency overall.
The remainder of this paper is structured as follows: Section 2 provides a comprehensive review of the existing literature on wafer transfer systems, particularly emphasizing the use of wafer WTRs in the semiconductor industry. Section 3 describes the dataset used in this study and outlines the methodology employed to develop the predictive model. Section 4 focuses on signal analysis and the application of ML to predict the wear in the Z-axis belt. Section 5 presents the results of the model evaluation, including its accuracy and potential implications for optimizing maintenance strategies. Finally, Section 6 concludes the study with a discussion of the findings and offers recommendations for future research.
Fig. 1 illustrates the comprehensive data acquisition and analysis process for monitoring the wafer WTRs. The experimental setup involved monitoring the z-axis belt tension wear, simulating various operational conditions, and artificially adjusting the belt tension. During the data acquisition phase, data was collected using an acceleration sensor and communicated wirelessly to the DAQ PC for further analysis. The signal processing stage included data preprocessing, visualization, and dataset creation. Finally, the performance evaluation used ML models to analyze the results through accuracy scores and receiver operating characteristic (ROC) curves to assess the efficiency and predictive capabilities of the system.

2 Wafer Robot Transfer System

WTRs play a critical role in semiconductor manufacturing by ensuring precise handling and transporting of wafers through crucial processes such as etching, deposition, and inspection. These robots can function in environments where precision and cleanliness are paramount because even the slightest handling errors can result in significant defects in semiconductor devices. The demand for increased throughput in semiconductor production has surged, requiring faster and more efficient wafer transfer. WTRs have evolved to meet these standards as wafers become increasingly precise and fragile. Precision is crucial because even minor impacts or misalignments can lead to wafer damage, resulting in costly delays and lower yields. To address the need for both speed and accuracy, modern WTRs are designed to handle wafers at high speeds with rapid acceleration and deceleration while maintaining precise positioning, ensuring increased productivity without compromising the delicate nature of the wafers. Figure 2 shows the NT series NT420 Kawasaki wafer-transfer robot used in this study.
A wafer transfer robot consists of a stable base, an arm structure for horizontal and vertical movement, and an end effector, typically equipped with a vacuum-based gripper that securely holds and releases wafers. The robot’s movements were driven by a combination of motors and belts, with the Z-axis belt playing a critical role in controlling the vertical position of the end effector. This Z-axis belt’s accurate functioning(operation and positioning) is crucial because it ensures the precise lifting and lowering of wafers. However, over time, the Z-axis belt is subjected to mechanical stress, which can lead to wear and loss of tension, potentially compromising the robot’s accuracy and reliability. Maintaining the Z-axis belt tension is challenging because the wear and degradation of the belt are influenced by various factors, including the operational environment, frequency of movements, and loads applied during wafer handling. Traditional maintenance practices typically involve scheduled inspections and preemptive belt replacement based on estimated wear patterns. However, these methods are not consistently effective because unexpected failures can occur between maintenance intervals, leading to unplanned downtimes and increased operational costs. Moreover, unnecessary belt replacement can further contribute to inefficiency. Given these challenges, there is growing interest in predictive maintenance strategies that aim to monitor the Z-axis belt condition in real time and predict when maintenance is required. By accurately forecasting belt wear and tension degradation, predictive maintenance can help prevent unexpected failures and optimize maintenance schedules, ultimately improving operational efficiency and reducing costs. Integrating ML into wafer transfer robotic systems offers a potential solution for achieving effective predictive maintenance. By continuously analyzing the acceleration sensor data from the robot’s operations, machine-learning models can detect subtle changes in the robot’s operation, which may indicate the onset of belt wear or tension loss. This data-driven approach enables precise and timely maintenance actions, ensuring the robot’s reliability and extending its components’ lives.
Table 1 lists the specifications of the Kawasaki NT420 model used in this study and details its vital performance characteristics. It outlines the basic structure and motion range, including the rotational movements of the θ1 and θ2 axes, which correspond to the first and second arms, respectively, and the H1 and H2 axes, representing the upper and lower blades. The Z-axis indicates the up-and-down motion. Additionally, the table specifies the robot’s position repeatability, ensuring a high precision level within ±0.1 mm and its maximum reach of 1,280 mm, making the NT420 ideal for precise and efficient wafer handling in semiconductor manufacturing environments [26].

3 Configuration of an Experimental Environment for Data Acquisition

3.1 Experimental Methods and Conditions

This study assesses the wear and tension degradation of a Z-axis belt in WTRs under real-world operational stress conditions. All experiments were conducted on an operational robot in the field, ensuring that the test conditions reflected the challenges faced in real-world scenarios. The experiments were performed on a Kawasaki NT420 WTR using combinations of key parameters, such as belt tension, velocity, and applied load, to evaluate their effects on the performance and durability of the Z-axis belt. Experimental conditions were based on commands from field engineers and real-world fault data, ensuring that the simulations accurately reflected the actual operational issues. The abnormal conditions were specifically selected based on a data-driven fault analysis derived from breakdowns and failures observed in real-life settings.
Table 2 provides an overview of the experimental conditions used in this study. Test 1 establishes the baseline, representing the standard operating conditions of the wafer transfer robot (WTR) with a belt tension of 350 Hz and velocities set to 50, 70%, and 100%. This “normal condition” was determined based on WTR specifications to represent typical functional parameters. In contrast, Tests 2, 3, and 4 were designed to simulate abnormal operating conditions by progressively decreasing the belt tension to 300, 280, and 230 Hz, respectively. These specific tension values were chosen based on the insights of field engineers, who identified them as representative of gradual tension loss, potentially approaching a threshold below which performance degradation and mechanical instability might occur. The selected tensions are close to this threshold, providing a practical basis to observe the robot’s responses under near-critical conditions. To enhance model robustness, silicon (Si) and aluminum (Al) wafers were used as loads, each with a diameter of 300 mm and weights of 125 and 135 g, respectively. These wafer types add diversity in load characteristics, helping to generalize the model’s performance across different operating scenarios. The Z-axis belt tension was measured using a Unitta U-508 sonic belt tension meter, which calculates tension by detecting sound waves (natural frequency) emitted when the belt is plucked. This noncontact method ensures precise tension measurements, aligning with the high-precision requirements of industrial equipment. The data for each experimental condition in Table 2 represents the average of five measurements to ensure accuracy and repeatability. The choice of 300, 280, and 230 Hz for the abnormal conditions simulates a gradual reduction in belt tension, which allows for a comparative analysis of how different levels of tension loss impact the WTR’s performance, particularly in signs of wear or degradation. This approach provides insights into the onset of abnormal conditions and helps clarify how approaching threshold tension levels might affect operational stability.
Fig. 3 illustrates the method used to measure the Z-axis belt tension. The average belt tension for both axes was calculated from these readings to comprehensively analyze the robot’s operational performance under varying conditions

3.2 Data Acquisition System

A sophisticated data acquisition system was deployed to monitor and analyze the robot’s performance accurately during the experiments. The system was designed to capture high-frequency, real-time data by focusing on the robot’s 3-axis acceleration. Table 3 presents a description of the sensor and the acquired data. The acceleration sensor used in this research was EBIMU24GV5, a 2.4 GHz wireless attitude and heading reference system (AHRS). The sensor provides the XXY-axis of the local data and the global data of the Z-axis. The distinction between the local and global axes refers to how the system measures and reports velocity, position, and other movement data relative to different frames of reference. The local axes correspond to the internal coordinate system of the device, which varies according to the orientation and movement of the sensor. Here, data such as the velocity and position are measured relative to the sensor itself, meaning that the x-, y-, and z-axes follow the device’s orientation as it moves or rotates. This is useful for tracking motion concerning the sensor’s current alignment but does not provide information on absolute positioning in a fixed space. By contrast, global axes refer to measurements taken relative to an external fixed reference frame, such as the Earth’s coordinate system. This system typically aligns with geographical directions (e.g., North, East, and Down downward). To convert local measurements into global coordinates, the system must integrate sensor data with heading, orientation, or position information derived from tools such as magnetometers, GPS, or other navigational aids. Global axes provide a standardized method for tracking movement and orientation in absolute terms, allowing consistent positioning regardless of the local orientation of the device. In summary, local axes track movement relative to the device itself, whereas global axes reference movement relative to the Earth’s fixed coordinates and require integration with additional data to provide an absolute frame of reference.
Moreover, the sensors were configured to record data at 1,000 Hz, ensuring that even subtle variations in the robot’s movements across the local and global axes were captured precisely. The data collected by the sensors was transmitted wirelessly to a centralized logging system. Communication between the sensors and logging system was facilitated by a high-speed protocol with a transmission rate of 921,600 bits per second (bps), ensuring that the data were not only transmitted rapidly but also retained their integrity during transfer. Fig. 4 shows the data acquisition system used in the proposed study.
Fig. 4 shows the data acquisition system used in this study. This data acquisition setup monitors the performance of the Kawasaki wafer transfer robot, which utilizes an acceleration sensor to collect motion data from the robot’s Z-axis. The sensor communicated wirelessly with a PC, and the data was analyzed. The robot interacted with a load port to transfer the wafers, whereas the robot and load port controllers managed their movements and load operations. Acceleration data helps track the robot’s performance in real time, ensuring precision and detecting potential issues during operation. Overall, this setup allowed for real-time analysis of the robot’s performance, enabling immediate detection of any operational anomalies. Once transmitted, the data were securely stored in a dedicated database where they could be accessed for postprocessing. This dataset, rich in high-resolution acceleration measurements, is used to develop advanced predictive models. These models were designed to forecast the Z-axis belt tension wear, leveraging detailed sensor data to identify patterns and trends indicative of impending belt degradation. Implementing this robust data acquisition system was critical for the study’s success, as it ensured the accuracy and reliability of the data used in developing the predictive maintenance models, ultimately leading to actionable insights that can enhance the longevity and performance of WTRs.

4 Machine Learning-based Fault Prediction

ML-based fault prediction in industrial systems applies sophisticated algorithms to analyze real-time sensor data and forecast failures in advance. For example, ML models monitor operational parameters, such as belt tension, acceleration, and vibration, to assess the health of WTRs. These models compare live sensor readings with learned patterns from normal operating conditions to detect anomalies such as wear or mechanical issues, which might indicate potential faults. For instance, if the belt tension in a WTR drops from the standard 350 Hz to lower levels like 300 or 230 Hz, the system flags this deviation as an early sign of degradation. ML models are trained on extensive datasets encompassing both normal and faulty scenarios, which enables them to recognize subtle changes in performance that may precede mechanical failures. By providing real-time predictions, these models allow preventive maintenance, reduce unexpected downtime, lower maintenance costs, and ensure that robots operate efficiently in high-precision environments such as semiconductor manufacturing.

4.1 Signal Processing

Signal processing is crucial in ML-based fault prediction, as it converts raw sensor data into a structured form suitable for analysis. This study captured acceleration data from the local XYZ axes and the global Z-axis using sensors installed on a WTR at a high sampling rate of 1,000 Hz. These high-resolution data were subjected to several preprocessing techniques to improve the signal quality and accurately predict the Z-axis belt tension wear and degradation. First, raw acceleration signals were processed using low-pass filters to eliminate high-frequency noise and irrelevant vibrations that could skew the results. By focusing only on relevant frequency bands, the filtered data more accurately represented the robot’s operational state, reducing interference from external noise and ensuring cleaner input for analysis. The baseline for the study was established under normal operating conditions: a belt tension of 350 Hz with velocities of 50, 70, and 100%. Both silicon (Si) and aluminum (Al) wafers were used to enhance the robustness of the model. This baseline facilitated comparisons with subsequent tests, in which the belt tension was incrementally reduced to 300, 280, and 230 Hz to simulate abnormal conditions and examine their impact on the mechanical stability of the system. Fig. 5 shows the global Z-axis amplitude readings at three different operational velocities of 50, 70, and 100% for four distinct belt tension classes: 350, 300, 280, and 230 Hz. Each plot reveals how the z-axis amplitude varies over time, with different colors representing the belt tension classes: red (350 Hz), green (300 Hz), blue (280 Hz), and orange (230 Hz).
At 50% velocity, the Z-axis amplitude exhibited a relatively stable and periodic pattern across all tension classes. The 350 Hz tension class consistently demonstrated the highest amplitude, while the 230 Hz tension class displayed the lowest. This suggests lower belt tensions reduce vertical forces on the Z-axis, indicating a smoother and more controlled operation at lower speeds. The minimal variation in the amplitude implies that the system experiences less dynamic vertical movement and maintains stability across all tension levels. At a velocity of 70%, the Z-axis amplitude began to fluctuate more significantly. In particular, the 350 and 300 Hz tension classes exhibited more pronounced vertical forces, as indicated by more significant amplitude fluctuations. The difference between the tension classes becomes more apparent at this speed, suggesting that the belt tension and increased operational velocity contribute to the amplified vertical forces and vibrations acting on the system. At 100% velocity, the Z-axis amplitude became more complex, with noticeable fluctuations across all tension classes. The 350 Hz and 280 Hz tension classes show the most prominent peaks and troughs, reflecting heightened vertical forces along the Z-axis. The increased complexity of the amplitude pattern at the maximum speed suggests that the system experiences more vital dynamic forces, with both the speed and belt tension playing significant roles in modulating the vertical vibrations. In conclusion, the analysis shows that the Z-axis amplitude increases with operational speed and belt tension. Higher belt tensions and faster velocities lead to greater vertical forces and vibrations, affecting the system’s overall stability. These insights into Z-axis dynamics are crucial for understanding mechanical behavior, monitoring performance, detecting wear, and identifying potential faults during operation.
Fig. 6 compares the global Z-axis amplitude readings for Silicon (Si) and Aluminum (Al) wafers under different belt tension conditions (350, 300, 280, and 230 Hz). In the Si wafer plots, the Global Z-Axis amplitude showed a periodic and stable pattern across all belt tension classes, with 350 Hz tension exhibiting the highest amplitude and 230 Hz the lowest. This suggests higher belt tensions correspond to greater vertical forces along the Z-axis for Si wafers. Still, the overall mechanical response remains consistent and controlled across different tension levels. In contrast, the Al wafer plots exhibited more irregular and pronounced variations in the Global Z-Axis amplitude, particularly at higher belt tensions. Although the 350 Hz tension class again shows the highest amplitude and the 230 Hz class the lowest, the fluctuations are less consistent than those in the Si wafer. This suggests that the Al wafers experience more variable vertical forces, possibly owing to the differences in material properties. The response of the Al wafer to vertical forces and vibrations appears to be more sensitive to belt tension, particularly at higher levels. Overall, the Si wafer demonstrated more stable and predictable behavior under different belt tensions, whereas the Al wafer showed more significant variability, particularly at higher tensions. This indicates that the material properties significantly influence the response of the wafers to vertical forces, providing essential insights into how different materials behave under similar operational conditions.

4.2 Machine Learning Classifier

Five machine learning classifiers—K-Nearest Neighbors (KNN), logistic regression, naive Bayes, decision tree, and random forest—were selected and applied to predict the Z-axis belt tension wear in the WTR system. Each algorithm was explicitly specified for its unique suitability to the characteristics of the robot’s operational data and the research objective of early fault detection.

4.2.1 K-Nearest Neighbors (KNN)

A KNN is a simple instance-based learning algorithm that classifies a data point based on the majority class of its K-nearest neighbors in the feature space. It is particularly effective in scenarios where the decision boundary is not linear and can adapt to complex data patterns. K-nearest neighbors (KNN) were selected because our system data exhibited nonlinear decision boundaries owing to the complex dynamics of the robot’s motion and the interaction between different operational parameters such as belt tension and speed. KNN, an instance-based learning algorithm, is highly adaptable to such nonlinearities. Its ability to classify based on its proximity to similar operational states in the dataset makes it particularly useful for detecting subtle shifts in belt tension patterns. Using the Euclidean distance, KNN can accurately capture these shifts, making it a practical choice for detecting belt degradation. The Euclidian distance was calculated as shown in Eq. (1) [27]:
(1)
dl=i=1n(xi-yi)2
The Euclidean distance dl is the distance between two measured points, where x is the training dataset, and y is the value for each data point. Where xi and yi are the coordinates of the two points in the i-th dimension, and n is the number of dimensions or features in the dataset.

4.2.2 Logistic Regression

Logistic regression is a model that estimates the probability of binary outcomes based on one or more predictor variables. It is well suited for problems in which the relationship between the input features and the output class is approximately linear. The logistic regression model uses a logistic function to transform a linear combination of input features into values between 0 and 1. Logistic regression was selected because of its ability to model the probability of binary outcomes, which aligns well with our goal of predicting whether the belt tension is within normal operating conditions or degraded. The linear decision boundary that logistic regression constructs is well suited for separating the data when the relationship between the input features (such as acceleration, tension, and speed) and the belt’s wear status is approximately linear. Additionally, the robustness of logistic regression in handling large datasets makes it a good fit for extensive sensor data, ensuring reliable predictions across different operational scenarios. The logistic function equation was as follows [28]:
(2)
f(x)=1(1+e-x)
The logistic function, also known as the sigmoid function f(x), is the output (the predicted probability of an event occurring). Where e is the Euler number (approximately 2.718), which is the base of the natural logarithm, and x is the input (the linear combination of the model’s features).

4.2.3 Naive Bayes

Naive Bayes is a probabilistic classifier based on Bayes’ theorem, which assumes independence among features. Despite its simplicity, Naive Bayes can be considerably effective for high-dimensional datasets and is particularly robust in handling noisy data. The goal of the naïve Bayes classifier is to calculate conditional probability. The Bayes equation is shown as [29]:
(3)
P(AB)=P(B|A).P(A)P(B)
The naïve Bayes algorithm is a probability function that determines the probability of A given that B occurs. A is the hypothesis, and B is the evidence, where P(B|A) is the probability of B, given that A is True. However, P(A) and P(B) are independent probabilities of A and B. The Naive Bayes (NB) classifier was selected for this study owing to its learning speed and storage space utilization efficiency. It is highly effective in training environments with maximum likelihood estimation (MLE) and Bayesian probability without extensive training. In addition, the NB classifier performs well with minimal datasets and is adept at handling complex real-world problems.

4.2.4 Decision Tree

A decision tree is a supervised learning algorithm for classification and regression tasks. The model is structured as a tree, where internal nodes represent tests on features, branches represent the outcomes of these tests, and leaf nodes represent class labels (in classification) or continuous values (in regression). The path from the root node to the leaf represents the classification rule. The algorithm recursively splits the dataset into subsets based on feature values that result in the most homogeneous sets of the target variable. A decision tree was selected for its ability to model non-linear relationships and interpretability. The tree structure allowed us to clearly understand the impact of each feature (e.g., belt tension and velocity) on the robot’s operational state and how these factors contribute to potential faults. In our system, the decision tree effectively captured the thresholds in belt tension that signified early wear, and its hierarchical structure made it easy to trace how decisions were made. By recursively splitting the data at critical thresholds (such as tension levels), a decision tree provides clear rule-based insights into when a fault may occur. The Gini impurity is the most used splitting criterion for classification [30].
(4)
G(t)=1-i=1cpi2
Where c is the number of classes, and Pi is the proportion of samples of class i in node t. A feature was selected to split the data at each node based on a criterion that minimizes the weighted Gini impurities of the child nodes.

4.2.5 Random Forest

Random forest is an ensemble learning method that constructs multiple decision trees during training and outputs each test sample’s class mode (classification). It is robust against overfitting, particularly in datasets with many features, and it provides a measure of feature importance. Random forest was included because of its ensemble learning approach, which enhances the robustness and accuracy of predictions by combining multiple decision trees. This is particularly useful in our system, where the sensor data may be noisy or incomplete. By averaging the projections of several trees, Random Forest reduces the risk of overfitting and improves generalization, ensuring that predictions are reliable across different belt tension levels and speeds. The random forest’s ability to handle many features effectively, such as different sensor readings, belt tensions, and velocities, makes it an ideal candidate for this study. The random forest classifier employs the Gini index (G(t)) as its attribute-selection measure, which quantifies the impurity of an attribute related to the predicted classes. Specifically, for a given training set T, the Gini index evaluates the randomness of assigning a selected case to a class Ci [30].
(5)
G(t)=i=1(f(Ci,T)|T|)j=1(f(Cj,T)|T|)
where f(Ci,T)|T| denotes the probability that the selected case belongs to class Ci each time a tree was grown to the maximum depth of the new training data using a combination of features. The random forest (RF) algorithm was employed in this study because of its intrinsic advantages. It provides robust performance through its ensemble learning approach, which combines multiple decision trees to improve accuracy and reduce overfitting. Additionally, RF offers easy implementation and the ability to effectively handle numerous features. These characteristics make RF a powerful and versatile choice for classification tasks.
Each classifier was trained on the processed dataset using cross-validation to optimize the hyperparameters and prevent overfitting. The models were evaluated using key metrics, including accuracy, precision, recall, F1-score, and area under the ROC curve (AUC-ROC) to ensure a comprehensive performance assessment. Once trained, the classifiers were tested on an unseen validation dataset to assess their generalization capabilities. This step was critical to determine how well the models could predict belt tension faults under new unencountered conditions. The results from each model were compared to identify the best-performing classifier for this application.

5 Results and Discussions

Machine learning classifiers were used to predict the Z-axis belt tension wear in WTR to identify normal and abnormal operating conditions based on sensor data collected under varying belt tension and operating speeds. The WTR was operated under silicon (Si) and aluminum (Al) wafers to ensure robustness in the model. Standard conditions were defined as a belt tension of 350 Hz, whereas abnormal conditions were simulated by progressively reducing the belt tension to 300, 280, or 230 Hz. These changes in the belt tension were designed to mimic wear and degradation, which could affect the mechanical stability and precision of the robot.
Accuracy is a widely used metric for assessing the performance of ML algorithms. It quantifies the proportion of correct predictions made by a model on the test data and is expressed as a percentage. This metric was calculated by dividing the number of correct predictions (True Positives and True Negatives) by the total number of predictions. The accuracy [31] of a classifier is typically evaluated using the following formula:
(6)
Accuracy=TP+TNTN+TP+FN+FP
True positives (TP) and True Negatives (TN) represent instances in which the model correctly predicts the positive and negative classes. False Positives (FP) and False Negatives (FN) indicate cases where the model incorrectly predicts the positive and negative classes. Accuracy provides insight into the ability of the model to correctly classify instances across both positive and negative classes, reflecting a balance between true and false predictions. Sensitivity is also called recall and measures the ratio of accurate optimistic predictions to the total number of positive instances in a dataset. This is a crucial metric, mainly when the cost of missing positive instances (false negatives) is significant. Sensitivity is mathematically defined as follows:
(7)
Sensitivity=TPTP+FN
Specificity evaluates the accuracy of optimistic predictions generated by a classification model. This quantified the ratio of true-positive predictions to the total number of instances predicted as positive by the model. Mathematically, the specificity is expressed as
(8)
Specificity=TNTN+FP
The F1 score is a metric used in ML that combines precision and recall into a single value. It is beneficial in situations with uneven class distribution or when false positives and negatives have different implications. The F1 scores were calculated using the following formula:
(9)
F1-Score=2TP2TP+FN+FP
The evaluated classifiers included KNN, Logistic Regression, Naive Bayes, random forest, and decision Tree, each selected for its potential to classify normal and abnormal conditions based on sensor data effectively. The performance of these classifiers was evaluated using key metrics, such as accuracy, sensitivity (recall), specificity, and F1-score, and the results are summarized in Table 4.
The results indicated a significant variation in performance across the classifiers. The KNN is highly effective in capturing the nonlinear relationships between the belt tension, wafer material, and speed. It achieved impressive sensitivity and specificity values (99.80 and 99.93%, respectively) with a high accuracy of 99.78 and an F1-score of 99.80%, indicating its strength in accurately identifying normal and abnormal belt tension states. While performing well in binary classification tasks, logistic regression was less successful in detecting abnormal wear patterns than KNN and Random Forest, achieving an accuracy of 84.19 and an F1-score of 84.23%. Its sensitivity was also low (85.76%), suggesting it is better suited for identifying normal conditions than subtle degradation patterns in belt tension. Naive Bayes struggled with this dataset because it assumed feature independence, which may not hold in real-world interactions between belt tension, wafer type, and speed. It underperformed with an accuracy of 67.78, a sensitivity of 70.94, and an F1-score of 66.86%, making it less effective at distinguishing between normal and abnormal wear conditions. However, the decision Tree performed exceptionally well, with an accuracy of 99.79, sensitivity of 99.8%, and specificity of 99.93%. Its ability to model nonlinear relationships allows it to detect wear thresholds accurately, providing valuable insights into belt tension wear. Random forest emerged as the top-performing model, slightly outperforming KNN and decision tree, with an accuracy of 99.80, sensitivity of 99.82, and specificity of 99.93%. Its ensemble nature helped mitigate overfitting and effectively handled the noisy sensor data. Its high F1-score of 99.80% highlights its balanced performance across all metrics, making random forest the most reliable choice for detecting Z-axis belt wear.
Following the classifier evaluation, Fig. 7 compares the training and test accuracies of the various ML classifiers used in this study for fault prediction in the Z-axis belt tension wear. The Logistic Regression model demonstrated equal training and testing accuracies of 84%, suggesting that the model generalized well to the unseen data. However, it performed poorly than the other models because of its linear nature, which may have limited its ability to capture the nonlinear relationships between the belt tension, speed, and wafer type.
In contrast, the decision tree, random forest, and KNN methods exhibited perfect training and testing accuracy (100%), indicating exceptional performance. This result highlights the ability of these models to effectively capture both normal and abnormal wear conditions in the belt, mainly when nonlinear interactions exist between variables such as belt tension and operational speed. The random forest ensemble method likely contributed to its ability to handle noisy data and avoid overfitting, whereas the KNN proximity-based approach helped detect nuanced patterns in the dataset. In contrast, naïve Bayes, which assumes feature independence, achieved an accuracy of 68% on both the training and testing sets. This relatively lower accuracy compared to other models indicates that the feature dependencies in the dataset (such as the relationship between speed, belt tension, and wafer material) reduced the Naive Bayes’ predictive capabilities, as the assumption of feature independence did not hold in this context.
The figure demonstrates that the decision tree, random forest, and KNN are better suited for this fault prediction task. In contrast, logistic regression and naïve Bayes show limitations in handling the complex, nonlinear relationships present in the dataset. The consistent performance between the training and test sets suggests that the top-performing models generalized well to the new data, making them highly suitable for real-world applications in predicting belt tension wear in WTRs.
Fig. 8 illustrates the ROC (Receiver Operating Characteristic) curves for the Random Forest classifier applied to Z-axis belt wear prediction. The plot includes a central black diagonal line, known as the “line of no discrimination”, which represents the performance of a random classifier, where the true positive rate (TPR) equals the false positive rate (FPR) at all points, resulting in an area under the curve (AUC) of 0.5. However, the Random Forest model’s ROC curves for each class closely follow the top-left corner of the plot, achieving a perfect AUC of 1.0 across all classes (class 0, class 1, class 2, and class 3). This ideal AUC score of 1.0 for each class indicates flawless classification performance, meaning the model effectively distinguishes between normal and abnormal belt wear conditions without error. The ROC curves, which plot the TPR against the FPR, confirm the model’s high sensitivity and specificity, demonstrating the Random Forest classifier’s exceptional capability for accurately monitoring belt wear conditions.
Random forest consistently outperformed the other models, including KNN, logistic regression, and Naive Bayes, despite needing further validation across all key metrics. Its ensemble approach and ability to model nonlinear relationships between belt tension, speed, and wafer material make it highly effective for fault detection. The balanced performance of the RF model in terms of accuracy, precision, recall, and F1 scores (as seen in earlier evaluations) further justifies its selection as an optimized model for this system. This robust performance ensures the model can accurately predict the Z-axis belt wear under various operational conditions, which is crucial for predictive maintenance in WTRs. By choosing a random forest as the final model, the system benefits from reliable early detection of belt wear, which helps prevent costly downtime and ensures efficient operation. Nevertheless, given the risk of overfitting, additional steps such as cross-validation with different datasets and real-world testing are recommended to confirm the generalizability and robustness of the model in diverse operating environments. This validation is essential to ensuring the long-term reliability and capacity of the system to handle new, unseen data in real-world scenarios effectively.

6 Conclusions

In conclusion, this study explored the application of ML to predictive maintenance in WTR, focusing on detecting Z-axis belt-tension wear. The robots’ high-frequency acceleration sensor data (local X, Y, Z, and global Z axes) were utilized to create a data-driven approach that improved traditional fault-detection methods by offering early and accurate predictions of belt degradation. Various machine-learning classifiers, including logistic regression, naive Bayes, random forest, and KNN, have been implemented and evaluated. Among these, RF, decision tree, and KNN exhibited superior performances, with RF achieving an impressive accuracy of 99.8%, demonstrating its capability to outperform conventional techniques. This high accuracy establishes Random Forest as this study’s most optimized model for predicting belt wear. The ROC (Receiver Operating Characteristic) curve for the Random Forest model displays an impressive 100%, indicating the model’s exceptional ability to distinguish between different levels of degradation in the Z-axis belt wear. This perfect score reflects Random Forest’s effectiveness in accurately classifying belt conditions, making it a powerful tool for early detection and maintenance scheduling in real-world industrial applications. This study confirmed the value of ML in proactive maintenance, helping to identify early signs of wear before they result in costly downtime. By applying ML to WTR systems, this study emphasizes the importance of early detection in optimizing system uptime and reducing maintenance costs. Moreover, integrating wireless data acquisition enhances the system’s flexibility, reducing manual intervention and operational overhead. This study provides a strong foundation for future advancements, suggesting that ML models can be extended to monitor other critical components within WTR systems. Future research should focus on real-world deployment and cross-validation techniques to ensure these models generalize well across different environments. This study is foundational for using predictive maintenance to extend the lifespan of industrial robots, particularly in semiconductor manufacturing, where minimizing downtime is crucial. These findings underscore the broad potential of data-driven fault detection for improving reliability engineering across industries.

Acknowledgments

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT, RS-2023-00240530, A study on robot machining quality improvement using auxetic structure spindle holder and active chatter vibration control) and This work was also partly supported by the GRRC program of Gyeonggi province [(GRRC TUKorea2023-B02), Development of docking system and process technology for 3D printing post-processing automation].

Fig. 1
Overview of the proposed research
ijpem-st-2024-00157f1.jpg
Fig. 2
NT series Kawasaki wafer transfer robot (NT420)
ijpem-st-2024-00157f2.jpg
Fig. 3
Z-axis belt tension measurement method
ijpem-st-2024-00157f3.jpg
Fig. 4
Data acquisition system of the proposed research
ijpem-st-2024-00157f4.jpg
Fig. 5
Global Z-axis amplitude variation across belt tension classes at 50, 70, and 100% speeds
ijpem-st-2024-00157f5.jpg
Fig. 6
Global Z-Axis amplitude comparison between silicon (Si) and aluminum (Al) wafers under various belt tensions
ijpem-st-2024-00157f6.jpg
Fig. 7
Train test accuracy of various ML classifiers
ijpem-st-2024-00157f7.jpg
Fig. 8
ROC curves for Random Forest classifier
ijpem-st-2024-00157f8.jpg
Table 1
Specifications of the robot used in this study
Model Kawasaki: NT420
Basic structure Horizontal articulated type
Motion range θ1 axis (rotation) (°) 340
Z axis (up-down) (mm) 400
θ2 axis (rotation) (°) 340
H1 axis (rotation) (°) 340
H2 axis (rotation) (°) 340
Position repeatability (mm) ±0.1
Maximum reach (mm) 1,280
Table 2
Description of the experimental conditions
Test No. Belt tensions Fault type Velocity (%) Load: Wafer (300 mm)
Test 1 350 Hz Normal condition 50, 70, 100 Si (125 g), Al (135 g)
Test 2 300 Hz Abnormal condition 1
Test 3 280 Hz Abnormal condition 2
Test 4 230 Hz Abnormal condition 3
Table 3
Specification of the acceleration sensor
Parameter name Descriptions/Values
Type of data XYZ-axis of local and Z-axis global acceleration data
Real-time data processing 1,000 Hz
Communication speed 921,600 bps (bits per second)
Communication type Wireless
Table 4
Evaluation of different ML classifiers
ML classifier Accuracy (%) Sensitivity (%) Specificity (%) F1-score (%)
Logistic regression 84.19 85.76 94.53 84.23
Decision tree 99.79 99.81 99.93 99.79
Random forest 99.80 99.82 99.93 99.80
KNN 99.78 99.80 99.93 99.80
Naive bayes 67.78 70.94 88.84 66.86

References

1. Lee, I., Park, H. J., Jang, J.-W., Kim, C.-W. & Choi, J.-H. (2023). System-level fault diagnosis for an industrial wafer transfer robot with multi-component failure modes. Applied Sciences, 13(18), 10243.
crossref
2. Yoo, J.-H., Park, Y.-K. & Han, S.-S. (2022). Predictive maintenance system for wafer transport robot using k-means algorithm and neural network model. Electronics, 11(9), 1324.
crossref
3. Park, K.-B., & Lee, J. Y. (2022). SwinE-Net: hybrid deep learning approach to novel polyp segmentation using convolutional neural network and Swin transformer. Journal of Computational Design and Engineering, 9(2), 616–632.
crossref pdf
4. Son, Y. H., Kim, G.-Y., Kim, H. C., Jun, C. & Noh, S. D. (2022). Past, present, and future research of digital twin for smart manufacturing. Journal of Computational Design and Engineering, 9(1), 1–23.
crossref pdf
5. Shin, J.-H., & Jun, H.-B. (2015). On condition based maintenance policy. Journal of Computational Design and Engineering, 2(2), 119–127.
crossref pdf
6. Tyagi, S., & Panigrahi, S. K. (2017). An improved envelope detection method using particle swarm optimization for rolling element bearing fault diagnosis. Journal of Computational Design and Engineering, 4(4), 305–317.

7. Oh, H., Lee, Y., Lee, J., Joo, C. & Lee, C. (2022). Feature selection algorithm based on density and distance for fault diagnosis applied to a roll-to-roll manufacturing system. Journal of Computational Design and Engineering, 9(2), 805–825.
crossref pdf
8. Xiong, G., Yuan, X., Mohamed, A. W., Chen, J. & Zhang, J. (2022). Improved binary gaining–sharing knowledge-based algorithm with mutation for fault section location in distribution networks. Journal of Computational Design and Engineering, 9(2), 393–405.
crossref pdf
9. Lee, H., Son, S., Jeong, D., Sun, K. H., Jeon, B. C. & Oh, K. Y. (2024). A finite element model of an electric motor with an unbalanced rotor for vibration data generation. International Journal of Precision Engineering and Manufacturing, 2(1), 47–56.
crossref pdf
10. Cho, S., Seo, H.-R., Lee, G., Choi, S.-K. & Choi, H.-J. (2023). A rapid learning model based on selected frequency range spectral subtraction for the data-driven fault diagnosis of manufacturing systems. International Journal of Precision Engineering and Manufacturing, 1(1), 49–62.
crossref pdf
11. Kim, B.-S., & Ro, S.-K. (2024). Development of a real-time anomaly detection system for dry vacuum pumps using low-cost IoT devices and machine learning. International Journal of Precision Engineering and Manufacturing, 25, 2133–2143.
crossref pdf
12. Liu, Y., Wang, J. & Shen, Y. (2020). Research on verification of sensor fault diagnosis based on BP neural network. Proceedings of the 2020 11th International Conference on Prognostics and System Health Management (PHM-2020 Jinan) (pp. 456–460.
crossref
13. Zhan, H., (2010). Application of rough set and support vector machine in fault diagnosis of power electronic circuit. Proceedings of the 2010 2nd IEEE International Conference on Information Management and Engineering (pp. 289–292.
crossref
14. Yang, S., Li, W. & Wang, C. (2008). The intelligent fault diagnosis of wind turbine gearbox based on artificial neural network. 2008 International Conference on Condition Monitoring and Diagnosis (pp. 1327–1330.

15. Lu, K., Chen, C., Wang, T., Cheng, L. & Qin, J. (2022). Fault diagnosis of industrial robot based on dual-module attention convolutional neural network. Autonomous Intelligent Systems, 2(12).
crossref pdf
16. Kim, B., Park, W., Kim, K. & Kim, H.-Y. (2023). CNN-based classification of the laser assembly process for ultra-small batteries. Journal of Mechanical Science and Technology, 37, 6181–6192.
crossref pdf
17. Kim, E., Yun, H., Araujo, O. C. & Jun, M. B. G. (2023). Sound recognition based on convolutional neural network for real-time cutting state monitoring of tube cutting machine. International Journal of Precision Engineering and Manufacturing, 1(1), 1–18.
crossref pdf
18. Cheng, W., Zhang, Y., Gao, X., Pocorni, J. K. & Jiang, X. (2024). A low-cost laser welding monitoring framework based on depth-wise separable convolution with photoelectric signals. International Journal of Precision Engineering and Manufacturing (pp. 1–15.
crossref pdf
19. Xu, Y., Li, Z., Wang, S., Li, W., Sarkodie-Gyan, T. & Feng, S. (2020). A hybrid deep-learning model for fault diagnosis of rolling bearings. Measurement, 164, 108502.
crossref
20. Zhang, S., He, W. & Huang, D. (2016). Active vibration control for a flexible string system with input backlash. IET Control Theory & Applications, 10(7), 800–805.
crossref pdf
21. Kim, Y., & Kim, Y.-K. (2023). Time-frequency multi-domain 1D convolutional neural network with channel-spatial attention for noise-robust bearing fault diagnosis. Sensors, 23(23), 9311..
crossref pmid pmc
22. Jaber, A. A., & Bicker, R. (2016). Industrial robot backlash fault diagnosis based on discrete wavelet transform and artificial neural network. American Journal of Mechanical Engineering, 4(1), 21–31.

23. Vernekar, K., Kumar, H. & Gangadharan, K. (2014). Gear fault detection using vibration analysis and continuous wavelet transform. Procedia Materials Science, 5, 1846–1852.
crossref
24. Qu, Y., Van Hecke, B., He, D., Yoon, J., Bechhoefer, E. & Zhu, J. (2013). Gearbox fault diagnostics using AE sensors with low sampling rate. Journal of Acoustic Emission, 31(1).

25. Raouf, I., Lee, H. & Kim, H. S. (2022). Mechanical fault detection based on machine learning for robotic RV reducer using electrical current signature analysis: A data-driven approach. Journal of Computational Design and Engineering, 9(2), 417–433.
crossref pdf
26. Goto, H., Yoshida, M., Inoue, M., Arita, T. & Shibata, T. (2012). NT & NV series advanced semiconductor-transferring robot achieving both high speed and ease of use. Kawasaki Technical Review, 172, 25–32.

27. Khalid, S., Lim, W., Kim, H. S., Oh, Y. T., Youn, B. D., Kim, H.-S. & Bae, Y.-C. (2020). Intelligent steam power plant boiler waterwall tube leakage detection via machine learning-based optimal sensor selection. Sensors, 20(21), 6356.
crossref pmid pmc
28. Bishop, C. M. (2009). Pattern Recognition and Machine Learning, Springer.

29. Pal, M., (2005). Random forest classifier for remote sensing classification. International Journal of Remote Sensing, 26(1), 217–222.
crossref
30. Breiman, L.Friedman, J.Olshen, R. A. & Stone, C. J. (1984). Classification and regression trees, Wadsworth International Group.

31. Uddin, M. G., Nash, S., Rahman, A. & Olbert, A. I. (2023). Performance analysis of the water quality index model for predicting water state using machine learning techniques. Process Safety and Environmental Protection, 169, 808–828.
crossref

Biography

ijpem-st-2024-00157i1.jpg
MD Saiful Islam is a Ph.D. student in IT Semiconductor Convergence Engineering at the Tech University of Korea. He earned his B.S. degree in Mechanical Engineering from Hanyang University, Korea, in 2021 and his M.S. degree in Mechatronics Engineering from the Tech University of Korea in 2023. His research interests encompass additive manufacturing, fault diagnosis, prognostics and health management (PHM), smart factories, data-driven methodologies, artificial intelligence (AI), and sustainable manufacturing.

Biography

ijpem-st-2024-00157i2.jpg
Young Kyoung Ji is an IT Semiconductor Convergence Engineering graduate student at the Tech University of Korea. He received his B.S. degree in Electrical Engineering from the Tech University of Korea in 2023. His research interests include semiconductors, prognostics and health management (PHM), fault diagnosis, smart factories, and artificial intelligence (AI). Kihyun Kim received B.S., M.S., and Ph.D. degrees in mechanical engineering from the Korea Advanced Institute of Science and Technology (KAIST), South Korea, in 1999, 2001, and 2006, respectively. He is a professor in the department of mechatronics engineering, Tech University of Korea. His current research interests include the design, control, and fault-diagnosis of a high-performance mechatronics system, e.g., Semiconductor-Display manufacturing equipment.

Biography

ijpem-st-2024-00157i3.jpg
Dr. Hyo-Young Kim is Professor of Mechatronics Engineering at Tech University of Korea. He received his B.S. in Mechanical Engineering in 2007 from the University of Hanyang University, Korea, and M.S. and Ph.D. degrees in Mechanical Engineering from the KAIST, in 2009 and 2013. He worked as a principal researcher at KITECH(Korea Institute of Industrial Technology) until 2021. Since 2022, he has been a professor of mechatronics engineering at Tech University of Korea. His research focuses on the fields of precision mechatronics systems and AI robot manufacturing systems. He is currently work on Precision stage and active vibration control system at the semicondouctor manufacturing system

Biography

ijpem-st-2024-00157i4.jpg
Dr. Hyo-Young Kim is Professor of Mechatronics Engineering at Tech University of Korea. He received his B.S. in Mechanical Engineering in 2007 from the University of Hanyang University, Korea, and M.S. and Ph.D. degrees in Mechanical Engineering from the KAIST, in 2009 and 2013. He worked as a principal researcher at KITECH(Korea Institute of Industrial Technology) until 2021. Since 2022, he has been a professor of mechatronics engineering at Tech University of Korea. His research focuses on the fields of precision mechatronics systems and AI robot manufacturing systems. He is currently work on Precision stage and active vibration control system at the semicondouctor manufacturing system
Editorial Office
12F, SKY1004 bldg., 50-1, Jungnim-ro, Jung-gu, Seoul 04508, Republic of Korea
TEL : +82-2-518-2928   E-mail : ijpem.st@kspe.or.kr
Developed in M2PI
Copyright © Korean Society for Precision Engineering.
Close layer
prev next