Int. J. Precis. Eng. Manuf.-Smart Tech. > Volume 3(1); 2025 > Article
Kim, Lee, Yuk, and Park: Real-time Defect Detection in Wire-laser Directed Energy Deposition Process Using U-Net-based Semantic Segmentation

Abstract

This study introduces a real-time monitoring system for wire-laser directed energy deposition (W-LDED) process, utilizing a U-Net-based semantic segmentation model. The system accurately identifies critical defective features in the process, such as the residual heat affected zone (HAZ) and dripping defects, at the pixel level. A dataset was collected using a high dynamic range camera, and the U-Net was trained for pixel-wise classification of process images. By fine-tuning hyperparameters of model and applying data augmentation, the segmentation performance was enhanced, enabling the precise extraction of positions and boundaries of distinct regions in real-time process images. Additionally, a pixel-based morphology measurement algorithm was developed to quantify the length of the residual HAZ and the area of the dripping defect. This system provides in-depth insights into the process and contributes to improved monitoring and control of key parameters, which ultimately enhances process stability and bead quality. The results suggest that this monitoring system could be integrated into future automated control frameworks for the W-LDED process, thereby enhancing productivity in the metal additive manufacturing industry.

1. Introduction

Wire-Laser Directed Energy Deposition (W-LDED) is a metal additive manufacturing (AM) process that utilizes a laser heat source to deposit wire-shaped feedstock material [1]. This process has gained substantial attention in the manufacturing sector due to its high production efficiency and ability to produce parts with various functional metallic materials. The laser, with its high energy density, enables precise and rapid operations. In comparison to powder-based feedstock, wire feedstock presents several advantages, such as ease of storage and handling, lower cost, and a higher deposition rate. These characteristics make W-LDED particularly attractive for a wide range of industrial applications where high deposition rates and material efficiency are critical [2,3].
The temperature of the melt pool directly influences the surface tension of the laser-molten material, which in turn determines the bead shape. Additionally, the cooling rate affects the grain size within the bead, making the melt pool temperature and cooling rate critical factors that govern both bead geometry and material properties. Key process parameters, such as laser power, wire feed rate, and print speed, play significant roles in shaping the bead and preventing defects like dripping and stubbing [4]. Variations in process parameters, along with heat dissipation conditions during deposition, can significantly affect bead quality and lead to process instability.
While numerical analysis has significantly contributed to understanding the physical phenomena underlying these processes [5,6], it remains insufficient for actively adapting to dynamic conditions during operation. This limitation highlights the necessity of vision-based process monitoring systems capable of managing these variables in real-time to mitigate potential instabilities. To overcome these challenges, research has focused on real-time process monitoring and optimization. Recent studies have explored innovative approaches to enhance bead quality and mitigate process instabilities, with deep learning techniques emerging as a powerful tool for analyzing image data obtained from process monitoring systems [79].
Vision-based monitoring techniques have been applied to various industrial sectors owing to the notable advantages, such as low cost, high resolution, and the capability to capture a high frames-per-second (FPS) rate. When integrated with deep learning models, they offer powerful tools for data processing, classification, and prediction of process-induced defects. Previous studies have employed convolutional neural networks (CNN) to classify defective bead formations in the wire deposition process [10]. More advanced CNN-based models have been applied to a variety of data types. For instance, convolutional long short-term memory, which combines CNN and LSTM (long short-term memory), has been used to predict future frames based on the current state, enabling real-time classification of abnormal melt pool images and subsequent defect detection [11]. Multi-modal deep learning models have also been explored, combining melt pool images with temperature data and process parameters to predict bead geometry or to classify the intersection point between the laser beam and the wire to the top layer distance, thus monitoring the state of the process [7].
Semantic segmentation, a method for pixel-wise object segmentation in images, can be applied to detect abnormalities in feature images. Unlike simple anomaly detection or defect classification, semantic segmentation can extract detailed information regarding the type, location, and shape of objects in real-time. Although CNN-based methods have been extensively applied to welding processes to identify melt pool states [1214], the application of semantic segmentation to AM processes is still in its early stages. Real-time monitoring of AM processes using semantic segmentation is particularly valuable when it effectively captures defect occurrences and the geometrical features of melt pools, such as size, type, and shape, which are crucial for correlating with process parameters and defects. However, comprehensive studies addressing these aspects are still relatively scarce.
In this study, we developed a real-time monitoring system capable of detecting defects and continuously analyzing bead formation during the process. Dripping defect occurs when molten metal accumulates at the tip of the wire, resulting in irregular bead deposition, which negatively affects the quality of final product. Detecting such defects in real-time is crucial for maintaining process consistency. Additionally, variations in laser power and layer height influence the cooling behavior of the bead immediately after deposition, which can lead to uneven mechanical properties in the part. To address this, we defined the light emission area of deposited bead with thermal radiation as a residual heat-affected zone (HAZ) and analyzed its geometric characteristics. We applied a U-Net-based semantic segmentation model to identify the feature regions within the process images during bead deposition. The model performance was optimized through hyperparameter tuning and data augmentation, thereby effectively achieving the real-time classification of both the residual HAZ and the dripping defect. Moreover, by applying pixel-based measurement techniques, we quantified the length of residual HAZ and the area of dripping defect from the segmented images, enabling a more detailed analysis for optimal bead deposition. The developed real-time process monitoring technique is expected to serve as an effective tool for quality control and productivity enhancement, especially when integrated with in-situ process control of the W-LDED process.

2 Method and Materials

2.1 W-LDED System and Experimental Setup

A custom W-LDED system (Klabs, South Korea) was used in this research. The system employed a continuous wave fiber laser (RFL-C1500, Raycus, China) with a wavelength of 1,080 nm and a maximum power of 1,500 W. The laser module was equipped with an irradiation head, allowing the laser beam to be projected in a straight line 3 mm in length, perpendicular to the printing direction. The operation and movement of the laser module, along with wire feeding, were controlled via Gcode. MGC-308 wire (Chosun Welding, diameter 1 mm, South Korea), commonly used for welding 304 stainless steel, The operation and movement of the laser module, along with wire feeding, were controlled via G-code. MGC-308 wire (Chosun Welding, diameter 1 mm, South Korea), commonly used for welding 304 stainless steel, was fed at a 45° angle from the front relative to the printing direction onto a stainless-steel substrate. The process parameters used in this experiment were a laser power of 1,140 W, a print speed of 20 mm/s, and a wire feed rate of 12 mm/s. The deposition width for bead formation was approximately 3 mm, corresponding to the laser beam diameter, while the height of each deposited layer was maintained at 0.25 mm. After each layer was deposited, the system incrementally moved 0.25 mm along the z-axis direction to prepare for the deposition of subsequent layer. The toolpath for each layer followed a straight trajectory with a length of 50 mm, producing consistent bead geometry. The dimensions and uniformity of the deposited beads were monitored to ensure repeatability under the given conditions.

2.2 Vision System Setup for Real-time Monitoring of W-LDED Process

To capture images of the W-LDED process, an off-axis high dynamic range (HDR) welding camera (XVC-1100e, Xiris, Canada) was installed to move in sync with the laser module. The camera was mounted to the laser module and positioned perpendicular to the printing direction, with a distance of 300 mm between the camera and the deposition area, to monitor the shape of melt pool and bead from a side view, recording at a resolution of 1,280 × 724 and 35 fps. To intuitively observe the visible light emitted from the heated region and distinguish the boundaries of the residual HAZ, the built-in lamp of the camera illuminated the deposited bead to ensure high-quality images even when the laser was not operating. The recorded images were captured frame by frame using Python code and used to create training data.

2.3 Dataset Preparation for U-Net Model Training

For the training of the U-Net model, a dataset was assembled, consisting of original images paired with corresponding annotated ground truth images. In this study, regions of interest (ROIs) that encompassed the bead, residual HAZ, and wire were extracted from the original 1,280 × 724 pixel images, and these ROIs were resized to 256 × 256 pixels. Ground truth images were prepared using the LabelMe annotation tool, where pixel-level manual classification was applied to distinguish between the two categories. The resulting annotations were saved in JSON format and later converted into single-channel label images, which served as the ground truth for training. The dataset comprised a total of 335 original and annotated image pairs, including both normal and defective states. From this dataset, 235 images were used for training, 50 for validation, and the remaining 50 for model evaluation. The training process was conducted in a Python 3.10 environment using PyTorch version 2.5.1, with computations performed on an Nvidia GTX 3070 Ti GPU equipped with 8 GB of GDDR6X memory, running on CUDA 11.8.

2.4 Architecture of the U-Net Model

The U-Net model in this study operates on input images with a resolution of 256 × 256 pixels. The encoder (contracting path) consists of four downsampling layers, each applying a 3 × 3 convolution followed by an exponential linear unit (ELU) activation, and then a 2 × 2 max-pooling operation, halving the spatial dimensions at each step. As the spatial resolution decreases from 256 × 256 to 16 × 16 pixels, the number of feature channels doubles at each level, reaching 256 at the bottleneck. The decoder (expansive path) utilizes 2 × 2 transposed convolutions to upsample the feature maps, progressively restoring the resolution back to 256 × 256 pixels. Skip connections between corresponding layers in the encoder and decoder paths help recover fine-grained spatial information. The final layer employs a 1 × 1 convolution to produce the output segmentation map of size 256 × 256 × 3.

2.5 Performance Evaluation

To assess the segmentation performance, several evaluation metrics were employed: IoU, mIoU, Recall, Precision, and F1 Score. These metrics were calculated for the two target classes, residual HAZ and dripping defects. The IoU for each class quantifies the overlap between the predicted and ground truth regions, calculated as the ratio of true positives (TP) to the sum of true positives, false positives (FP), and false negatives (FN):
IoU=TPTP+FP+FN
During training, mIoU on the validation set was used to evaluate the model’s performance, ensuring checkpoints were saved at the highest-performing state for accurate segmentation of residual HAZ and dripping defects.
Recall, measuring the model’s ability to identify true positives, was calculated as:
Recall=TPTP+FN
Precision, evaluating the proportion of correctly predicted positives, was calculated as:
Precision=TPTP+FP
The F1 Score, which balances Recall and Precision, was defined as:
F1   score=2·Precsion·RecallPrecision+Recall
These four performance metrics were used to evaluate the segmentation accuracy on the test data.

3 Results and Discussion

3.1 Framework of in-situ Real-Time Monitoring System for W-LDED

Fig. 1(a) shows a conceptual schematic illustration of the proposed in-situ monitoring system, designed to maintain quality during the W-LDED process. A side-view camera captures the melt pool of the current deposition layer. As the laser focuses on and melts the wire feedstock, visible light is emitted from the heated area due to thermal radiation [15]. The visible light emitted during the process, as described by Wien’s displacement law, reflects the relationship between temperature and the peak wavelength of thermal radiation. Steel initially emits red light at relatively low temperatures, transitioning to orange and eventually to white as the temperature increases [16]. Typically, steel emits red light at temperatures around 500°C, while bright regions are associated with temperatures exceeding 1,100°C [17]. The color of thermal radiation may vary depending on the metal’s emissivity and surface condition. The residual HAZ refers to regions with temperatures above 500°C, and its size reflects the cooling rate during the process, with larger zones indicating slower cooling. Since the cooling rate is a critical factor in microstructure formation, achieving uniform cooling is important for ensuring consistent mechanical properties in the final product [18]. Therefore, the size and shape of the residual HAZ serve as indicators of process stability. By comparing these features across images, the system can reliably evaluate process performance.
Additionally, defects like dripping, primarily caused by improper process parameters such as an excessively high laser power, an inadequate wire feed rate, or an overly large distance between the wire tip and the substrate, occur when excessive melting of the wire leads to molten droplets detaching from the tip [19]. These detached droplets, which are typically spherical due to surface tension minimizing their energy state, accumulate on the substrate or surrounding areas, disrupting bead formation and compromising the uniformity of the deposition. Such defects can create irregularities that propagate through subsequent layers and degrade process stability, making it crucial to monitor them closely throughout the operation (Fig. 1(c)) [19]. To achieve this, the system employed a U-Net semantic segmentation model for pixel-wise classification of the residual HAZ and defects in the images, enabling the extraction of specific features, including their geometry, size, and area (Fig. 1(d)).

3.2 Characterization of Defects in W-LDED

To develop an effective real-time monitoring system, it is essential to thoroughly characterize the defects occurring in the W-LDED process. The shape and size of these defects are related to the specific requirements for the monitoring system. Fig. 2 shows the printing process and defects caused by heat accumulation during the formation of the first layer and the deposition of subsequent layers. During the printing of the first layer, effective heat conduction to the substrate is achieved, resulting in a shorter length of the residual HAZ (Fig. 2(a)). However, as additional layers are deposited, heat conduction becomes less effective because the underlying material traps heat and limits its dissipation to the substrate. This leads to localized heat accumulation near the deposition area, slowing the cooling rate and enlarging the residual HAZ (Fig. 2(b)). Excessive heat accumulation reduces the cooling rate, prolonging the molten state of the material and leading to instability in the melt pool [20]. This instability amplifies minor irregularities in the previously deposited layers, causing geometric distortions that propagate through subsequent layers. Consequently, this cumulative effect results in uneven surface formation and compromises the overall stability and quality of the deposition process (Fig. 2(c)).
Fig. 3 shows the process of stable bead formation and the occurrence of dripping defects. The liquid bridge forms as the molten wire and melt pool merge under surface tension, facilitating continuous material transfer during the deposition process [21]. The stability of this bridge is critical for producing uniform and well-formed beads, as shown in Figs. 3(a) and 3(b). When the bridge becomes unstable, primarily caused by an excessively large distance between the deposition tool and substrate or an insufficient wire feed rate, it weakens and eventually breaks, causing the molten wire to form defective droplets (Fig 3(c)). Once these droplets exceed a certain size or touch the substrate, they result in dripping defects, potentially causing serious process failures (Fig. 3(d)).

3.3 Development of U-Net-Based Monitoring System for W-LDED

Defects and uneven residual HAZs occurring during the W-LDED process hinder the production of uniform structures. Therefore, it is necessary to develop a system capable of addressing these issues, and a monitoring system utilizing semantic segmentation has been developed. Semantic segmentation is an image-based deep learning technique that segments various objects in an image at the pixel level, providing information about the size and location of each object. To date, various models such as FCN, U-Net, and DeepLab have been developed. Among them, U-Net is widely used across different fields due to its simple architecture, which allows for fast computation in image processing [22].
As the name suggests, U-Net has a U-shaped structure, with the encoder and decoder symmetrically connected (Fig. 4(a)). To train this model, both the original image and the ground truth image are required. For this, a labeling process was conducted as shown in Fig. 4(b), where the boundary information of the residual HAZ and dripping defect was saved and assigned the colors white and red, respectively. Once the training data were prepared, the original image is input into the encoder, where features are extracted by reducing the resolution and increasing the dimensionality through convolution and pooling processes. Then, in the decoder, these extracted features are used to restore the resolution using up-sampling and transposed convolution, generating the final segmentation map. However, low-resolution images may lack sufficient information for precise restoration. To address this, the model applies a skip connection technique, which transfers the information extracted at each stage of the encoder to the corresponding decoder stage at the same resolution, enabling more precise segmentation [23].
Through this process, the output image displays each pixel in the color corresponding to the class with the highest probability. The model weights are then adjusted to maximize the alignment between the output image and the ground truth image, and this process is repeated to create a model with optimized weights. Once the model is well-trained, new process images can be input, and the pixels corresponding to the residual HAZ and dripping defect will be accurately colored. Additionally, data augmentation and hyperparameter tuning were performed to achieve the best-performing model.
It is common to utilize data augmentation to improve the learning performance of U-Net [24]. As shown in Fig. 5(b), data augmentation primarily consists of physical transformations such as brightness adjustment, blurring, rotation, and flipping. These techniques increase the size of the training dataset, allowing for more efficient learning even with limited data. In this study, 285 images, excluding the test set, were augmented fivefold, resulting in 1,425 images for training and validation. Each augmentation technique was applied randomly to enhance dataset diversity. The augmented dataset was divided into training and validation sets at a 5:1 ratio to ensure a sufficient number of samples for evaluation. However, since the training data and real-time process images always have the same orientation, there were concerns that transformations like rotation and flipping might degrade the model’s performance. To test this hypothesis, an experiment compared augmented data with and without rotation and flipping.
The results of the model trained with rotation and flipping are shown in Fig. 5(a). As seen in the first and second images, the model struggled to distinguish between the high-energy flame and the residual HAZ. Similarly, in the case of the dripping defect, there were instances where part of the residual HAZ was misclassified, and only portions of the defect were detected. In contrast, when rotation and flipping were excluded from the data augmentation (Fig. 5(c)), the model performance to differentiate between the flame and residual HAZ improved, as did its accuracy in detecting the dripping defect.
Table 1 highlights the impact of data augmentation on model performance. Without data augmentation, all performance metrics declined, with precision particularly low, indicating an excessive number of false positive predictions. Excluding rotation and flipping from the augmentation process significantly improved most metrics, with precision showing the most substantial increase. This suggests that rotation and flipping increased prediction errors, reducing the model’s accuracy and generalization capability. While U-Net, originally developed for cell segmentation, typically benefits from augmentation techniques like rotation and flipping to recognize objects at various angles, these techniques appear to have hindered performance in this study. As mentioned above, this is likely because the orientation of the training data and process images remained consistent throughout, making such augmentations unnecessary and even detrimental to the accuracy of the model.
To find the optimal performance based on the prepared dataset, hyperparameter tuning was conducted. By adjusting the learning rate and batch size, trained models were obtained, and their mIoU, Recall, Precision, and F1 score values were calculated using test data to compare the results. First, experiments were carried out to determine the appropriate learning rate using a fixed batch size of 4 and the stochastic gradient descent (SGD) optimizer. It is generally accepted that a low learning rate slows down convergence to optimal weights, while a high learning rate increases the risk of overfitting [25]. Experiments were conducted using three learning rates: 0.1, 0.01, and 0.001, yielding results consistent with theoretical expectations. Figs. 6(a) and 6(b) show graphs of the mIoU for the validation data, organized by epoch. IoU values were calculated for each object, and the averages of these values were used to derive the mIoU.
With a learning rate of 0.001, the training process was excessively slow, leading to suboptimal results with an mIoU of 65.4% and a significantly low Precision of 44.5%. Conversely, a learning rate of 0.1 resulted in higher mIoU (79.2%) and F1 Score (87.9%), but its Recall (93.4%) was relatively lower, indicating a tendency to miss some true positives. In contrast, when the learning rate was set to 0.01, the model achieved a well-balanced performance across all metrics, with an mIoU of 77.8, Recall of 98.6, Precision of 74.0, and F1 Score of 87.2%. These results suggest that a learning rate of 0.01 provides the optimal balance between convergence speed and segmentation accuracy, making it the most suitable choice for the given dataset and task.
To evaluate the impact of optimization functions on model performance, batch size tuning experiments were conducted for both SGD and adaptive moment estimation (Adam) optimizers to determine their optimal configurations. SGD, a traditional optimizer, updates model parameters by computing the gradient of the loss function with respect to the parameters and applying the update rule [26]:
θt+1=θt-ηL(θt)
where θt represents the model parameters at iteration t, η is the learning rate, and ∇L(θt) is the gradient of the loss function. While SGD is known for its stability and consistency in simpler datasets, it may require careful learning rate tuning to avoid convergence issues.
Adam, on the other hand, combines the benefits of SGD with momentum and adaptive learning rates. It uses moving averages of the gradient (mt) and squared gradient (vt) to update the parameters:
θt+1=θt-ηm^tv^t+ɛ
where mt and vt are estimates of the first and second moments of the gradient, respectively, and ɛ is a small constant to avoid division by zero. Adam’s dynamic adjustment of learning rates allows for faster convergence, particularly in complex datasets, but may introduce variability in well-defined tasks [27].
Training was conducted under various batch size conditions, and the model’s performance was assessed using four key metrics: mIoU, Recall, Precision, and F1 Score (Table 3). SGD achieved its highest performance with a batch size of 4, yielding an mIoU of 77.8%, Recall of 98.6%, Precision of 74.0%, and an F1 Score of 87.2%. Adam performed best with a batch size of 16, achieving an mIoU of 72.8%, Recall of 91.5%, Precision of 70.5%, and an F1 Score of 84.4%.
The results highlight that SGD consistently outperformed Adam, particularly in mIoU and Recall, which are crucial for accurate segmentation. This superior performance can be attributed to the simplicity of the dataset, where each image contains only one of two distinct classes with significant shape differences. This structure allows SGD to achieve stable and efficient convergence, effectively capturing the underlying data distribution. While Adam’s dynamic learning rate adjustment is beneficial in more complex or noisy datasets, it may introduce unnecessary variability when applied to datasets with well-defined class distinctions and minimal noise. These findings emphasize the effectiveness of SGD for this specific segmentation task and dataset configuration.
As a final step to validate the suitability of U-Net for the proposed monitoring system, it was compared with Unet 3+, a segmentation model that builds upon the original U-Net architecture with advanced feature extraction and multi-scale feature fusion. Unet 3+ employs densely connected convolutional layers and adaptive decoders, which enhance its ability to capture both global context and fine-grained details [28]. These structural improvements enable Unet 3+ to achieve higher segmentation accuracy, particularly in complex regions like the residual HAZ and dripping defects, making it an appealing choice for precision-demanding applications.
Table 4 presents a detailed comparison of the two models in terms of segmentation performance and latency. Unet 3+ demonstrated a higher mIoU of 81.1, surpassing U-Net’s 77.8%. While this improved segmentation accuracy reflects Unet 3+’s advanced feature representation capabilities, it came at the cost of increased computational latency, with Unet 3+ requiring 7.68 ms per image compared to U-Net’s 2.69 ms. Considering the 35 fps used in this study, equivalent to approximately 28.6 milliseconds per frame, the 5 ms latency difference between the two models becomes significant, particularly when accounting for the additional computational load from the shape measurement algorithm and potential future modules for real-time process parameter adjustments. This latency gap could widen further depending on the computing hardware available, emphasizing the importance of model efficiency in real-time systems.
By refining the data augmentation strategy and adjusting the hyperparameters, optimal model performance was achieved, resulting in predicted images such as those shown in Figs. 7 and 8. Fig. 7 presents prediction results for test images containing the residual HAZ, with the predicted images ranked in descending order based on IoU scores. The model achieved an average IoU of 74.6% across 25 test images containing residual HAZ. In semantic segmentation, an IoU exceeding 50% is generally considered indicative of a well-trained model, suggesting that the model developed in this study achieves competitive performance.
As shown in Fig. 7, the model was able to accurately segment even complex areas such as the residual HAZ, which can be attributed to the optimal performance achieved through the selectively refined data augmentation strategy and hyperparameter tuning. Additionally, Fig. 8 shows prediction results for the dripping region, with an IoU of 81.0%. This indicates that the model can consistently detect the dripping area with high accuracy, even in varied shapes of defects. These results suggest that the model is not simply overfitting to specific process image data but maintains strong generalization performance with newly acquired data as well.
Edge detection is one of the most basic image segmentation techniques, identifying boundaries in areas where pixel values change abruptly [29]. While it might seem that this method could be used to detect the boundaries of the residual HAZ or dripping defect, it quickly becomes evident that edge detection alone is insufficient. The flame and residual HAZ that occur during the process share similar color regions, making it difficult to achieve accurate separation by relying solely on pixel value differences. Furthermore, shape-based distinctions, such as those in dripping, cannot be adequately captured with edge detection techniques. In contrast, U-Net, with its encoder-decoder architecture and skip connections, analyzes both global and local features, enabling accurate differentiation between visually similar regions, such as the flame and reflections on the substrate [30]. By integrating pixel-level and contextual analysis, U-Net goes beyond pixel-level analysis to deliver a detailed understanding of the residual HAZ and process conditions. Therefore, to achieve the precise results seen in Figs. 7 and 8, more advanced models like U-Net are required. U-Net, by learning the deeper features of an image, goes beyond pixel-level analysis to provide a more detailed understanding of the residual HAZ and process conditions. Additionally, U-Net is capable of quickly detecting unpredictable dripping defects in a real-time manner. These factors make U-Net-based models far more effective and reliable for process monitoring than the conventional edge detection methods.

3.4 Development of Pixel-Based Morphology Measurement Algorithm

By utilizing U-Net, the effective detection of residual HAZ and dripping defects in W-LDED process images has demonstrated the potential of process monitoring technology based on this approach. The ability to accurately capture the boundaries of each object in the process images has enabled more precise analysis. In this study, a pixel-based measurement algorithm was applied to extract detailed information about the shape of each feature from these predicted images.
The length of the residual HAZ varies depending on the laser power and the height of the deposition, making real-time length measurement crucial during the process, as it is a key factor affecting the quality and performance of the process. To address this, a length measurement algorithm based on pixel information was developed in this study. The process image shows that a 1 mm wire diameter consists of 8 pixels, meaning each pixel edge corresponds to 0.125 mm. As shown in Fig. 9(a), the number of pixels between the leftmost and rightmost pixels of the residual HAZ was measured, and this value was multiplied by the pixel edge length of 0.125 mm to calculate the length of the zone.
To eliminate noise caused by elements such as flames, the contour detection function provided by OpenCV was utilized. This function identifies the boundaries of connected regions in an image, enabling precise segmentation of target areas based on pixel characteristics. The algorithm was configured to only consider contours with more than 150 pixels, determined by counting the number of pixels within each contour. This threshold was determined by analyzing the actual residual HAZ sizes in the dataset. Contour detection was applied to all ground truth data, revealing that the smallest residual HAZ contained 178 pixels. To allow for a margin of error and ensure robustness, a threshold of 150 pixels was selected, classifying contours with fewer pixels as noise. This method effectively minimized errors caused by flame-related noise and ensured accurate length measurements.
A similar method was used to measure the area of the dripping defect (Fig. 9(b)). The total number of red pixels in the predicted image is counted, and the area of each pixel is multiplied to calculate the total area of the defect. As with the length measurement method, contour detection was also used to ensure robustness against noise by identifying and only considering regions above a certain size. In the case of the dripping defect, since it is larger than any residual HAZ, this algorithm was directly reflected in the measurement. The prediction of the image and the execution of the morphology measurement algorithm have a combined latency of approximately 5.0±0.8ms per image, ensuring timely feedback. This measurement approach can play an important role in the development of a feedback system. For instance, when a dripping defect occurs, the system can immediately measure its area and provide feedback to prevent the defect from reaching the bed.
An experiment was also conducted to verify whether the developed shape analysis method could accurately measure the length of the residual HAZ. The experiment involved depositing a total of 31 layers of beads under two conditions: one with constant laser power and another with a reduction in laser power every 10 layers. This setup was used to evaluate the ability of the system to detect changes in the length of the residual HAZ during the process. Under the constant laser power condition, the residual HAZ showed a length of approximately 11.9 mm (Fig. 10(a)). As deposition continued to 21 layers, the length of the residual HAZ increased to 14.5 mm, and by 31 layers, it reached 18.4 mm, indicating a gradual increase in length. In contrast, when the laser power was reduced by 100 W every 10 layers, the length of the residual HAZ remained consistent at around 13 mm throughout the process.
In addition, a comparison of the output quality revealed that the uneven beads were formed when the laser power was kept constant (Figs. 11(a)–11(c)). In contrast, when the laser power was gradually reduced, a uniform residual HAZ was formed, and the height of the deposited beads remained consistent. These results demonstrate that adjusting laser power is a key factor in improving the precision and consistency of the process. Furthermore, the cooling time of molten metal would affect the formation of the microstructure, which could influence the mechanical properties. In the field of DED processes, previous studies have reported microstructural changes caused by differences in cooling rates across layer heights [31,32]. It has been demonstrated that rapid cooling in the lower layers results in fine and uniform microstructures, while slower cooling rates lead to coarser microstructures [33]. These microstructural variations are known to directly influence material properties, including mechanical characteristics such as hardness, strength, ductility, and residual stress [34]. Therefore, forming a uniform residual HAZ is crucial for ensuring consistent mechanical performance throughout the product. The monitoring system developed in this study enables real-time detection of variations in the residual HAZ, facilitating the observation of changes under different process parameters. Although this experiment applied a simple method of reducing laser power by 100 W at each stage, future research could leverage this monitoring system to develop an automated control system. A future system will optimize process parameters based on the measured residual HAZ length and dripping defect size. Additionally, to address the limitations of continuously reducing laser power for taller structures, this system could incorporate additional parameters, such as laser scanning speed, into a comprehensive control framework. By dynamically balancing heat input, this approach would ensure consistent bead quality, stable process conditions, and enhanced overall product performance.

4 Conclusion

In this study, we developed a real-time monitoring system for the W-LDED process, utilizing a U-Net semantic segmentation model to detect and analyze defects and the residual HAZ. The system effectively segmented both defective regions and the residual HAZ, providing detailed, pixel-level insights into process stability. Additionally, pixel-based morphology measurements were employed to accurately quantify the length of the residual HAZ and the area of dripping defects, enabling precise analysis of process conditions. Through hyperparameter tuning, we identified optimal conditions that achieved high predictive performance and low latency, ensuring the system’s suitability for real-time applications. Our experiments highlighted the importance of precise process control and monitoring in achieving uniform bead formation and minimizing defects, particularly in relation to thermal management. The proposed model demonstrated consistent segmentation of critical regions throughout the process, indicating its potential for integration into automated feedback systems capable of real-time process adjustments to mitigate defects such as dripping. Future work could explore extending this real-time monitoring system by incorporating active control mechanisms that dynamically adjust process parameters based on real-time segmentation feedback. Further refinement of the model could also improve its performance under varying operational conditions, enhancing the robustness and applicability across a wide range of manufacturing systems.

Acknowledgements

This work was supported by the Korea Basic Science Institute (National Research Facilities Equipment Center) grant funded by the Ministry of Education (No. 2021R1A6C101A449) and National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. RS-2023-00213269).

Fig. 1
(a) Schematic illustration of real-time monitoring system for W-LDED. Schematic illustration of (b) residual HAZ and (c) dripping defect. (d) Semantic segmentation of residual HAZ and dripping defect
ijpem-st-2024-00178f1.jpg
Fig. 2
Side-view images of the melt pool during W-LDED. (a) Printing on the substrate with a high cooling rate. (b) Printing on a previously deposited layer with a lower cooling rate under constant laser power. (c) Defective surface caused by heat accumulation. Defects are indicated by arrows below the normal bead line (yellow dashed line)
ijpem-st-2024-00178f2.jpg
Fig. 3
Comparison of stable and defective in W-LDED. (a) Side view of a stable melt pool with proper alignment indicated by arrows. (b) Resulting bead under stable conditions with uniform geometry. (c) Dripping defect in process due to poor wire feeding. (d) Resulting bead showing defects caused by dripping, with noticeable material buildup and irregularities
ijpem-st-2024-00178f3.jpg
Fig. 4
(a) Architecture of U-Net deep learning model used for semantic segmentation, illustrating the convolutional layers, max-pooling, up-convolutions, and final segmentation output and (b) Labeling process for semantic segmentation: Original images depicting the residual HAZ and dripping defect are shown on the left, while the corresponding ground truth images used for training the segmentation model are shown on the right
ijpem-st-2024-00178f4.jpg
Fig. 5
Data augmentation and its effect on semantic segmentation performance. (a) Training results with data augmentation including rotation and flipping, (b) Illustration of different data augmentation techniques, including brightness adjustment, blurring, rotation and flipping, and (c) Training results showing improved segmentation performance after training with an augmented dataset excluding rotation and flipping
ijpem-st-2024-00178f5.jpg
Fig. 6
Training performance of the U-Net model under different hyperparameter configurations, visualized through loss and mIoU curves over 100 epochs. (a) and (b) represent the effect of learning rates (0.1, 0.01, 0.001) on training dynamics using the SGD optimizer with a batch size of 4. (c) and (d) show the impact of varying batch sizes (4, 8, 16) under the SGD optimizer with a fixed learning rate of 0.01. (e) and (f) illustrate the performance for batch sizes (8, 16, 32) using the Adam optimizer with a learning rate of 0.01
ijpem-st-2024-00178f6.jpg
Fig. 7
Comparison of input images, ground truth, and predicted segmentations with their corresponding IoU values for detecting the residual HAZ
ijpem-st-2024-00178f7.jpg
Fig. 8
Comparison of input images, ground truth, and predicted segmentations along with their corresponding IoU values for detecting the dripping defect.
ijpem-st-2024-00178f8.jpg
Fig. 9
Quantitative analysis of residual HAZ length and dripping area during the W-LDED process. (a) Measurement of the length of the residual HAZ: The white pixels corresponding to the residual HAZ are identified, and the total length (L) is calculated by multiplying the number of pixel rows (n) by the pixel width (l). (b) Measurement of the area of the dripping defect: The red pixels representing the dripping defect are segmented, and the total area (A) is calculated as the product of the pixel area (l2) and the number of red pixels (m)
ijpem-st-2024-00178f9.jpg
Fig. 10
Processing images of bead formations with their corresponding U-Net-segmented images under the conditions of (a) constant and (b) controlled laser power over multiple layers
ijpem-st-2024-00178f10.jpg
Fig. 11
Processing images of the layered beads under (a) constant and (b) controlled laser power conditions across multiple layers. The white arrows and the yellow dashed lines indicate the defect formation and the expected reference lines for normal bead layering, respectively
ijpem-st-2024-00178f11.jpg
Table 1
Comparison of performance metrics across different data augmentation strategies
Model Data augmentation mIoU (%) Recall (%) Precision (%) F1 Score (%)
U-Net Without data augmentation 66.2 86.4 57.0 78.4
Including rotation and flipping 76.0 89.8 75.1 85.5
Excluding rotation and flipping 77.8 98.6 87.2 81.9
Table 2
Performance comparison of U-Net under different learning rate configurations
Learning rate mIoU (%) Recall (%) Precision (%) F1 Score (%)
0.1 79.2 93.4 72.5 87.9
0.01 77.8 98.6 74.0 87.2
0.001 65.4 97.8 44.5 77.9
Table 3
Performance comparison of U-Net with SGD and Adam optimizers across different batch sizes
Optimizer Batch size mIoU (%) Recall (%) Precision (%) F1 Score (%)
SGD 4 77.8 98.6 74.0 87.2
8 70.0 98.7 61.3 81.7
16 71.1 98.5 60.2 82.5
Adam 8 71.2 83.1 72.6 83.3
16 72.8 91.5 70.5 84.4
32 64.3 78.1 65.8 80.1
Table 4
Comparison of performance metrics and latency for U-Net and Unet 3+ models
Model Batch size mIoU (%) Recall (%) Precision (%) F1 Score (%) Latency (ms)
U-Net 4 77.8 98.6 74.0 87.2 2.69
Unet 3+ 4 81.1 95.5 80.8 89.0 7.68
8 76.3 97.3 70.8 86.2 7.82

References

1. Svetlizky, D., Das, M., Zheng, B., Vyatskikh, A. L., Bose, S., Bandyopadhyay, A., Schoenung, J. M., Lavernia, E. J. & Eliaz, N. (2021). Directed energy deposition (DED) additive manufacturing: Physical characteristics, defects, challenges and applications. Materials Today, 49, 271–295.
crossref
2. Ahn, D.-G., (2021). Directed energy deposition (DED) process: state of the art. International journal of precision engineering and manufacturing-green technology, 8(2), 703–742.
crossref pdf
3. Thompson, S. M., Bian, L., Shamsaei, N. & Yadollahi, A. (2015). An overview of Direct Laser Deposition for additive manufacturing; Part I: Transport phenomena, modeling and diagnostics. Additive Manufacturing, 8, 36–62.
crossref
4. DebRoy, T., Wei, H. L., Zuback, J. S., Mukherjee, T., Elmer, J. W., Milewski, J. O., Beese, A. M., Wilson-Heid, A. d., De, A. & Zhang, W. (2018). Additive manufacturing of metallic components–process, structure and properties. Progress in materials science, 92, 112–224.
crossref
5. Sun, Z., Guo, W. & Li, L. (2020). Numerical modelling of heat transfer, mass transport and microstructure formation in a high deposition rate laser directed energy deposition process. Additive Manufacturing, 33, 101175.
crossref
6. Zhang, Z., Ge, P., Yao, X., Li, T. & Liu, W. (2020). Numerical studies of residual states and scaling effects in laser-directed energy deposition additive manufacturing. The International Journal of Advanced Manufacturing Technology, 108, 1233–1247.
crossref pdf
7. Cai, Y., Zhang, S., Wang, Y., Chen, H. & Xiong, J. (2024). Monitoring process stability in robotic wire-laser directed energy deposition based on multi-modal deep learning. Journal of Manufacturing Processes, 128, 111–124.
crossref
8. Tang, Z.-j., Liu, W.-w., Wang, Y.-w., Saleheen, K. M., Liu, Z.-c., Peng, S.-t. & Zhang, Z. (2020). A review on in situ monitoring technology for directed energy deposition of metals. The International Journal of Advanced Manufacturing Technology, 108, 3437–3463.
crossref pdf
9. Kong, J. H., & Lee, S. W. (2023). Development of melt-pool monitoring system based on degree of irregularity for defect diagnosis of directed energy deposition process. International Journal of Precision Engineering and Manufacturing-Smart Technology, 1(2), 137–143.
crossref pdf
10. Assad, A., Bevans, B. D., Potter, W., Rao, P., Cormier, D., Deschamps, F., Hamilton, J. D. & Rivero, I. V. (2024). Process mapping and anomaly detection in laser wire directed energy deposition additive manufacturing using in-situ imaging and process-aware machine learning. Materials & Design, 245, 113281.
crossref
11. Abranovic, B., Sarkar, S., Chang-Davidson, E. & Beuth, J. (2024). Melt pool level flaw detection in laser hot wire directed energy deposition using a convolutional long short-term memory autoencoder. Addit Manufacturing, 79, 103843..
crossref
12. Wang, L., & Shen, Q. (2021). Visual inspection of welding zone by boundary-aware semantic segmentation algorithm. IEEE Transactions on Instrumentation and Measurement, 70, 1–9..
crossref pmid
13. Baek, D., Moon, H. S. & Park, S.-H. (2024). In-process prediction of weld penetration depth using machine learning-based molten pool extraction technique in tungsten arc welding. Journal of Intelligent Manufacturing, 35(1), 129–145..
crossref pdf
14. Knaak, C., Kolter, G., Schulze, F., Kröger, M. & Abels, P. (2019). Deep learning-based semantic segmentation for in-process monitoring in laser welding applications. Proceedings of the Applications of machine learning (pp. 10–22.
crossref
15. Gobbi, C., (2016). Low cost thermal imaging system for welding applications. M.Sc. Thesis. University of Waterloo.

16. Fisenko, A. I., & Ivashov, S. N. (1999). Determination of the true temperature of emitted radiation bodies from generalized Wien’s displacement law. J. ournal of Physics D: Applied Physics, 32(22), 2882–2885.

17. Lei, J. B., Chen, J., Wang, Z. & Wang, Y. S. (2010). Measurement on Thermal radiation of standard high-temperature blackbody furnace in visible spectrum range. Materials Science Forum, 663–665, 223–226..
crossref
18. Costa, J. N., de Assis Faria, G., Porcaro, R. R. & Pereira, I. C. (2024). Evaluation of near immersion active cooling on the microstructure and mechanical properties of AISI 316L stainless steel obtained with additive manufacturing by DED-Arc. The International Journal of Advanced Manufacturing Technology, 134(3–4), 1419–1432..
crossref pdf
19. Motta, M., Demir, A. G. & Previtali, B. (2018). High-speed imaging and process characterization of coaxial laser metal wire deposition. Additive Manufacturing, 22, 497–507.
crossref
20. Balla, S. K., Mallaiah, M., Nagamuthu, S., Gurugubelli, R. C., Aranas, C. & Bontha, S. (2024). The influence of laser direct energy deposition processing parameters on Al7075 alloy and Zr-modified Al7075 alloy. The International Journal of Advanced Manufacturing Technology, 135(1), 181–201.
crossref pdf
21. Chen, J., Ouyang, Z., Du, X. & Wei, Y. (2022). Weld pool dynamics and joining mechanism in pulse wave laser beam welding of Ti-6Al-4V titanium alloy sheets assembled in butt joint with an air gap. Optics & Laser Technology, 146, 107558..
crossref
22. Mo, Y., Wu, Y., Yang, X., Liu, F. & Liao, Y. (2022). Review the state-of-the-art technologies of semantic segmentation based on deep learning. Neurocomputing, 493, 626–646.
crossref
23. Ronneberger, O., Fischer, P. & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. Medical image computing and computer-assisted intervention–MICCAI 2015, 9351.
crossref pmid
24. Nemoto, T., Futakami, N., Kunieda, E., Yagi, M., Takeda, A., Akiba, T., Mutu, E. & Shigematsu, N. (2021). Effects of sample size and data augmentation on U-Net-based automatic segmentation of various organs. Radiological Physics and Technology, 14, 318–327.
crossref pmid pdf
25. Igiri, C. P., Anyama, O. U. & Silas, A. I. (2015). Effect of learning rate on artificial neural network in machine learning. International Journal of Engineering Research. & Technology, 4(2), 359–363.

26. Bottou, L., (2010). Large-scale machine learning with stochastic gradient descent. P. roceedings of COMPSTAT’2010 (pp. 177–186.

27. Zhang, Z., (2018). Improved adam optimizer for deep neural networks. 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS) (pp. 1–2.
crossref
28. Huang, H., Lin, L., Tong, R., Hu, H., Zhang, Q., Iwamoto, Y., Han, X., Chen, Y.-W. & Wu, J. (2020). Unet 3+: A full-scale connected unet for medical image segmentation. ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1055–1059.
crossref
29. Nadernejad, E., Sharifzadeh, S. & Hassanpour, H. (2008). Edge detection techniques: evaluations and comparisons. Applied Mathematical Sciences, 2(31), 1507–1520.

30. Du, G., Cao, X., Liang, J., Chen, X. & Zhan, Y. (2020). Medical image segmentation based on U-Net: A Review. J. ournal of Imaging Science & Technology, 64(2).

31. Fu, J., Yang, Y., Guo, J. & Tong, W. (2008). Effect of cooling rate on solidification microstructures in AISI 304 stainless steel. Materials Science and Technology, 24(8), 941–944.
crossref pdf
32. Zheng, C., & Yu, W. (2018). Effect of low-temperature on mechanical behavior for an AISI 304 austenitic stainless steel. Materials Science and Engineering: A, 710, 359–365.
crossref
33. Saboori, A., Piscopo, G., Lai, M., Salmi, A. & Biamino, S. (2020). An investigation on the effect of deposition pattern on the microstructure, mechanical properties and residual stress of 316L produced by Directed Energy Deposition. Materials Science and Engineering: A, 780, 139179.
crossref
34. Saboori, A., Gallo, D., Biamino, S., Fino, P. & Lombardi, M. (2017). An overview of additive manufacturing of titanium components by directed energy deposition: microstructure and mechanical properties. Applied Sciences, 7(9), 883.
crossref

Biography

ijpem-st-2024-00178i1.jpg
Yeong-Seo Kim is a Ph.D. candidate at the School of Mechanical Engineering, Pusan National University. His research interests include metal-based additive manufacturing, machine learning-based monitoring, and advanced process optimization. He focuses on developing vision-driven auto-calibration systems to improve precision and efficiency in both metal and polymer manufacturing processes.

Biography

ijpem-st-2024-00178i2.jpg
Seung-Mun Lee is a Ph.D. student at the School of Mechanical Engineering, Pusan National University. His research interests include metal and plastic-based additive manufacturing, machine-learning-driven monitoring systems, and advanced process optimization techniques for additive manufacturing. He focuses on developing real-time monitoring solutions by utilizing data acquired through vision systems and sensors.

Biography

ijpem-st-2024-00178i3.jpg
Ju-Chan Yuk is a Ph.D. candidate at the School of Mechanical Engineering, Pusan National University. His research interests include metal-based additive manufacturing, machine-learning-assisted design optimization, and advanced process optimization techniques. He focuses on developing innovative structure and process design methods using deep learning and reinforcement learning.

Biography

ijpem-st-2024-00178i4.jpg
Suk-Hee Park received his Ph.D. degree from the Department of Mechanical Engineering at the Korea Advanced Institute of Science and Technology (KAIST). He is currently an associate professor at Pusan National University (PNU). His recent research interests include AI-assisted intelligent manufacturing and AI-based design optimization.
Editorial Office
12F, SKY1004 bldg., 50-1, Jungnim-ro, Jung-gu, Seoul 04508, Republic of Korea
TEL : +82-2-518-2928   E-mail : ijpem.st@kspe.or.kr
Developed in M2PI
Copyright © Korean Society for Precision Engineering.
Close layer
prev next