A Digital Approach to via Edge Roughness Characterization and Quantification

Article information

Int. J. Precis. Eng. Manuf.-Smart Tech.. 2024;2(2):93-99
Publication date (electronic) : 2024 July 1
doi : https://doi.org/10.57062/ijpem-st.2024.00010
1J. Mike Walker ‘66 Department of Mechanical Engineering, Texas A&M University 3123 TAMU, College Station, TX 77843-3123, USA
ChaBum Lee, cblee@tamu.edu
Received 2024 April 2; Revised 2024 June 18; Accepted 2024 June 24.

Abstract

This paper introduces a new digital integration that combines edge diffractometry with convolutional neural networks (CNN) for via metrology and inspection. The beam propagation method (BMP) was used to simulate the interferogram generated by edge diffractometry to characterize via edge roughness (VER). A comprehensive database was established to link different fringe patterns to VER for CNN training. The well-trained CNN-based methodology provided a fast and accurate assessment of VER, with a root mean squared error (RMSE) of 0.073 and an average mean absolute deviation ratio (MADR) of 2.274%. In addition, the proposed digital approach was compared to the multilayer perceptron machine (MLP) in terms of computational efficiency and predictive accuracy. The proposed digital integration greatly improved the accuracy and speed of VER measurement, characterization, and quantification, potentially enhancing device yield and reliability. The successful application of this digital approach could open up possibilities for various types of via or pattern metrology.

1 Introduction

Via metrology and inspection technology are vital to verify that the vias produced through machining, laser drilling, or chemical etching meet their designated specifications such as diameter, circularity, roundness, and edge roughness conditions [1]. The via technology enabled the 3D stacked chips for miniaturization, low power consumption, and high performance [2], which is also essential for signal transmission and the overall performance of the devices [3]. The dimension and shape accuracy of the vias is important since performance degradation or complete failure of the whole device could result from small discrepancies [4]. Similar to the line-edge roughness (LER) of the photomask patterns, the VER is also vital to ensure the reliability of the system, since the rough edges could lead to increased electron scattering, affecting the conductivity and cause variations in the electrical characteristics [5]. Currently, various via-measuring methods are utilized to perform the measurements. Optical microscopy is a widely used via inspection method, providing rapid assessment of size and shape, but its ability is limited for deep or small vias [6]. Scanning electron microscopy (SEM) and atomic force microscopy (AFM) enable detailed analysis of via edge roughness and its microstructure, but both methods are expensive and time-consuming, making them unsuitable for in-process inspection [7,8].

With the advancement of artificial intelligence (AI), Convolutional Neural Networks (CNNs) can automatically conduct feature extraction from images and predict performance of semiconductor component [9]. With the aid of big data technology and Industry 4.0 [10], various manufactured data will be continuously collected and stored, making the training sets more comprehensive. Thus, the robustness and reliability of CNN will further be enhanced over time [11]. However, the majority of deep learning-based methods are applied at wafer level. There are only a few studies that have applied deep learning for TSV inspection and quality evaluation. Current research mainly focuses on the detection of TSV extrusion, overall performance evaluation, and the dimension prediction of TSVs, including array alignment and critical dimensions [9,1214]. To our best knowledge, the application of deep-learning for edge roughness characterization, which is a very challenging metrology task, has not been developed yet.

Inspired by previous studies [1517], numerical optical simulation and CNN were integrated to automate via edge roughness characterization. By analyzing various via diffractive patterns and their corresponding VER, the CNN can automatically extract subtle patterns from these fringe images, and further correlate those features with roughness metrics. Thus, the well-trained model can directly predict the roughness from the diffraction pattern, enabling an accurate and fast via metrology and inspection. Instead of utilizing a simplified aperture diffraction model in previous studies, the beam propagation method (BPM) was applied to simulate the fringe patterns generated from the vias. In this paper, details of dataset construction and digital approaches to via characterization are explained, and the potential to be integrated into existing via inspection frameworks is also discussed.

2 Framework of the Digital Approach

Fig. 1 compares the flow charts of the proposed digital via inspection approach to the AS-IS hole metrology method. Instead of conducting complicated fringe preprocessing [17], a well-trained CNN model is capable of predicting the edge roughness condition from the diffraction image directly.

Fig. 1

Comparison between AI-based digital approach and AI-IS hole inspection method

CNN is widely used in computer vision and image processing since it simulates the cognitive pattern of the human brain [18], being successfully applied in various industrial scenarios [19,20]. In this study, the cropped diffraction image served as input and predicted VER values were the output.

The proposed digital approach consists of two major steps: the first step involves the establishment of a database that links distinct fringe patterns with their corresponding edge roughness conditions. The second step focuses on the setup of neural networks, including the training of models and optimization of hyperparameters. Details of each step will be discussed in the following sections.

3 Establishment of the Training Database

3.1 Experiments Setup

A grayfield through silicon via (TSV) inspection system was constructed as shown in Fig. 2(a), which features enhanced sensitivity to scattered light and remarkable diffractive image contrast. A collimated laser (λ 450 nm) was utilized as the light source. The laser was further shaped into a donut shape by the two axicon lenses installed below it, as depicted in Fig. 2(b). To reduce the intensity saturation in the imaging device, a 12.5% neutral density (ND) filter was applied. The filtered donut-shaped beam was then formed into a spherical wavefront through the first objective lens. Once the light interacted with the edge of the TSV, the Charge-coupled Device (CCD) on the bottom could capture the diffraction images. In this study, all diffraction images were captured from φ100 μm TSVs fabricated on 210 μm thick undoped single-crystalline silicon wafers.

Fig. 2

Schematics of grayfield Si-via inspection system

3.2 Simulation Model

The BPM was used to simulate the laser propagation inside the via sample. This method is widely applied for simulating electromagnetic (EM) wave propagation through waveguides [21,22] and other devices and media, such as waveguide junctions [23], optical switches [24], optical fiber amplifiers [25] and atmosphere with turbulence [26]. The BPM assumes that the envelope of the optical wave would not change abruptly during the propagation [23]. Therefore, the transverse electrical field density of the optical wave traveling in z-direction is represented as:

(1) Et=Ψte-in0k0z

where Ψt is the Slowly Varying Envelope (SVE) function, n0 is the reference refractive index, and k0 is the wave number of the laser in the vacuum.

By substituting this expression into the Maxwell equation and applying the paraxial approximation, the governing equation for a scalar wave propagation can be written as [23]:

(2) 2in0k0Ψtz2Ψtx2+2Ψty2+k02(n2-n02)Ψt

where n is the refractive index of the media. After giving the initial condition Ψt(z = 0) = Ψ0(x,y), the Ψt cou.ld be solved for each z value by finite difference method in the alternative direction implicit (ADI) scheme. In this study, the initial condition was set as the Bessel-Gaussian function to represent the laser beam generated by passing a Gaussian beam through a pair of axicon lenses [27,28]:

(3) Ψt(ρ,z=0)=Amexp[-ρ2wo2]Jm(aρ)

where w0 is the size of the beam waist, ρ=x2+y2 is the polar coordinate, Am, m and a are constant values. By choosing different combinations of w0, Am, m and a, various types of laser beams, including Gaussian beam and Donut-Beam, could be represented.

The simulation of BPM started and ended at the top (z = 0) and bottom (z = h) of via sample, respectively. Once the field value U0(x,y) = Ψt(z = hein0k0h was obtained at the bottom, the scalar diffraction theory was used to compute the diffraction pattern received by the CCD. The expression for computing the diffraction filed after propagating a distance Z is written as [29]:

(4) U2(x,y)=-1{{U0(x,y)}ejkzexp[-jπλZ(fX2+fY2)]}

where ℑ and ℑ−1 are Fourier transform and inverse Fourier transform, fX and fY are frequency variables associated with x and y respectively. The diffraction pattern on the CCD could be computed as [29]:

(5) I(x,y)=|U2(x,y)|2

where I(x,y) represents the irradiance of the light field, which is the time-averaged radiometric energy flux per unit area.

3.3 Training Set Preparation

Fig. 3 depicts the workflow of training datasets processing. After the acquisition of diffraction patterns through interferogram, image segmentation was conducted in a 10-degree interval. Subsequently, the measured diffraction pattern of each segment was extracted and compared to the BPM simulation fringe in ideal condition to obtain the similarity value (SI). Diffraction fringes of multiple edge conditions with different VER values were further simulated using BPM methods, which was used to compared with the ideal fringe. Thus, the relationship between VER and SI was obtained, and this calibration result was used to estimate the VER of experimental fringe pattern. This method was described in detail in a previous work [17].

Fig. 3

Schematics of the establishment of training datasets

After performing the same operation on each fringe segment, a dataset containing diffractive images and its associated VER was established, consisting of 1,872 entries total. The dataset served as input for training the CNN model, aiming for accurate edge roughness prediction based on precise pattern recognitions. In this study, five experiments were conducted: in each experiment, 100 samples were randomly selected as the testing set while the remaining 1,772 samples served as training sets.

3.4 Model Optimization

In order to improve the prediction accuracy and obtain the best CNN model, the hyperparameters, including the number and properties of different types of layers, were optimized in this study. Both perdition accuracy (RMSE and MADR) and computational burden (training time) were taken into consideration for choosing the best model. The final architecture of the CNN is shown in Fig. 4, which consists of 2 convolutional layers, 2 max-pooling layers, and 1 fully connected dense layer. In corresponding sub boxes of Fig. 4, @ denotes the number of feature maps, and × represents the dimensions of the filter or pooling window. During the training process, the adaptive learning rate strategy was applied to reduce the training time needed to reach convergence.

Fig. 4

The final architecture of the CNN model

4 Results

4.1 CNN Prediction Results

Fig. 5 shows the CNN prediction results of five experiments, allowing an intuitive evaluation of its performance. In each subplot, the blue points represented the relationship between the actual value and CNN’s predicted value, while the red dashed line represented the ideal prediction scenario where the predicted value equaled the actual value (Y = X). These charts visualized the prediction accuracy: the closer the data points were to the red line, the higher the accuracy of the predictions. As shown in Fig. 5, in all five scenarios, blue points were scattered in a region close to the red reference line along the whole VER range, indicating the CNN model was capable of making relatively precise VER predictions.

Fig. 5

Predicted vs actual scatter plot results of five datasets

Two evaluation metrics were further used to numerically evaluate the results: the root mean square error (RMSE), and the mean absolute deviation ratio (MADR) which is computed as mean absolute deviation divided by average. The average RMSE for all testing sets was 0.073, with the first and second sets having values less than 0.069 while the highest, 0.082, was from the fourth set. Considering the average VER was more than 2.480 μm, all RMSEs were considered relatively small and reflected the precision of the predictions. Upon further calculation of the MADR, the average value obtained was only 2.274%, with the fourth set having the highest value at just 2.767%. This further demonstrated that CNN’s average predictive error is below 2.300%, being considered highly accurate.

4.2 Additional Multilayer Perceptron (MLP) Results

The MLP, also a neural network-based model, is widely used to solve classification and prediction problems [3032]. In the MLP model, all the layers, including the input, hidden, and output layers, are fully connected layers [33]. As shown in Fig. 6, the MLP consists of one input layer, one output layer and 4 hidden layers. The hyper parameters of the MLP were also tuned through a 10-fold cross-validation for achieving the best performance. This relatively simple model structure can make the model training and prediction process more efficient by reducing the number of parameters. However, this smaller number of parameters limits the model’s capability for complicated tasks such as complicated image and video processing [34].

Fig. 6

The final architecture of the MLP model

A comprehensive comparison of the prediction accuracy and computation time between CNN and MLP was conducted, as shown in Fig. 7. From the results, CNN showed a significant advantage in prediction accuracy: with a lower average RMSE and a better MADR. The average RMSE of MLP results was 0.0872, 19.78% higher than that of CNN. Similarly, the average MADR results for MLP was 2.507%, 10.249% higher than that of CNN. In semiconductor industry, the TSVs provide electrical connection among multiple layers, even a small increase in VER could result in attenuation and distortion of the signal being transferred, due to the skin effect. Therefore, a small difference in prediction performance can have huge impacts in engineering practice.

Fig. 7

Result comparisons: (a) average RMSE, (b) average MADR, and (c) average training time

However, due to the simpler network structure of MLP, 23.144% less training time was needed, making it a more efficient inspection method in industrial applications. It is noteworthy that by comparing the standard deviation, CNN showed a more robust predictive performance, while MLP’s performance was significantly influenced by the training sets itself.

The choice between utilizing CNN or MLP should be guided by the specific industrial requirements, depending on whether the priority lies with achieving the highest accuracy possible or with efficiency and speed of computation.

5 Conclusion

Through integration of experimental and theoretical edge diffractometry with CNN, a novel digital approach to automatic via metrology and inspection was proposed in this paper. The electromagnetic wave propagation-based computational model of the via created a database of simulated fringe profiles, the experiment by the grayfield edge diffractometry was conducted to obtain the fringe pattern dataset to further compute VER. The CNN model was then trained to analyze various via diffractive profiles and their corresponding VER, automatically extracting subtle patterns from these fringe images. A further correlation between the fringe image features and VER showed the linear characteristics.

The proposed CNN-based digital approach showed a 0.073 RMSE and a 2.274% MADR in VER prediction results, proving its capacity in complex fringes analysis, offering a robust solution for the semiconductor industry’s demand for rapid and precise via inspection. An additional study on an MLP-based model offered a trade-off between prediction efficiency and accuracy.

In summary, the implementation of the proposed digital approach in via metrology is promised to improve the yield and reliability of semiconductor devices, further supporting the technological advancements in the industry. Future research could focus on further optimizing the AI models and exploring potential applications in other areas of precision engineering.

Acknowledgment

This research has been supported by the National Science Foundation (CMMI #212499).

Notes

Author’s Contribution

All authors equally contributed to this study: design, experiment, and data analysis.

Data Availability

The data that supports the findings of this study are available from the corresponding author upon reasonable request.

Disclosures

The authors declare no conflicts of interest.

References

1. Alapati R., Travaly Y., Van Olmen J., Teixeira R. C., Vaes J., van Cauwenbergh M., Jourdain A., Verbinnen G., Marcucilli G., Florence G., Wolfling S., Pelissier C., Zhang H., Sinha J., Machura A., Malik I.. 2009. TSV metrology and inspection challenges. In : Proceedings of the IEEE International Conference on 3D System Integration. p. 1–4.
2. Lu M. C.. 2022;Advancement of chip stacking architectures and interconnect technologies for image sensors. Journal of Electronic Packaging 144(2):020801.
3. Dong G., Biao Y., Xidong D., Yuan L.. 2017. Research on the influence of vias on signal transmission in multi-layer PCB. In : Proceedings of the 13th IEEE International Conference on Electronic Measurement & Instruments (ICEMI). p. 406–409.
4. Persson E., Eriksson I., Zackrisson L.. 1997;Effects of hole machining defects on strength and fatigue life of composite laminates. Composites Part A: Applied Science and Manufacturing 28(2):141–151.
5. Duan S., Zhang Z., Wei K., Wang F., Han X.. 2020;Theoretical study and physical tests of circular hole-edge stress concentration in long glass fiber reinforced polypropylene composite. Composite Structures 236:111884.
6. Liao H.-C., Lim Z.-Y., Hu Y.-X., Tseng H.-W.. –2018. Guidelines of automated optical inspection (AOI) system development. In : Proceedings of the IEEE 3rd International Conference on Signal and Image Processing (ICSIP). p. 362–366.
7. Kaspar J., Luft A., Nolte S., Will M., Beyer E.. 2006;Laser helical drilling of silicon wafers with ns to fs pulses: Scanning electron microscopy and transmission electron microscopy characterization of drilled through-holes. Journal of Laser Applications 18(2):85–92.
8. Bayat P., Vogel D., Rodriguez R. D., Sheremet E., Zahn D. R. T., Rzepka S., Michel B.. 2015;Thermo-mechanical characterization of copper through-silicon vias (Cu-TSVs) using micro-Raman spectroscopy and atomic force microscopy. Microelectronic Engineering 137:101–104.
9. Kim J., Kim S., Kwon N., Kang H., Kim Y., Lee C.. 2018. Deep learning based automatic defect classification in through-silicon via process: Fa: Factory automation. In : Proceedings of the 29th Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC). p. 35–39.
10. Popkova E. G., Ragulina Y. V., Bogoviz A. V.. 2019. Industry 4.0: Industrial revolution of the 21st century Springer.
11. Ding N., Möller K.. 2022;Robustness evaluation on different training state of a CNN model. Current Directions in Biomedical Engineering 8(2):497–500.
12. Jalilvand G., Lindsay J., Reidy B., Shukla V., Duggan D., Zand R., Jiang T.. 2021. Application of machine learning in recognition and analysis of TSV extrusion profiles with multiple morphology. In : Proceedings of the IEEE 71st Electronic Components and Technology Conference (ECTC). p. 1652–1659.
13. Kim K., Park H., Lho D., Kim M., Son K., Son K., Kim S., Shin T., Choi S., Kim J.. 2020. Deep reinforcement learning-based through silicon via (TSV) array design optimization method considering crosstalk. In : Proceedings of the IEEE Electrical Design of Advanced Packaging and Systems (EDAPS). p. 1–3.
14. Li J.-W., Su E., Ho C.-C.. 2024;Prediction of critical dimensions for 3D TSV structures using artificial neural network. Metrology, Inspection, and Process Control XXXVIII 12955:649–653.
15. Lu K., Wang Z., Chun H., Lee C.. 2023;Curved-edge diffractive fringe pattern analysis for wafer edge metrology and inspection. Proceedings of the Metrology, Inspection, and Process Control XXXVII 12496:105–106.
16. Wang Z., Lu K., Lee C.. 2023. A fringe pattern analysis technique for photomask line-edge-roughness characterization. In : Proceedings of the International Manufacturing Science and Engineering Conference. 87240p. V002T06A007.
17. Lu K., Lee C.. 2024;Hole edge metrology and inspection by edge diffractometry. Journal of Manufacturing Science and Engineering 146(7):070905.
18. Goodfellow I., Bengio Y., Courville A.. 2016. Deep learning The MIT press.
19. Kim I. S., Lee M. G., Jeon Y.. 2023;Review on machine learning based welding quality improvement. International Journal of Precision Engineering and Manufacturing-Smart Technology 1(2):219–226.
20. Shin H., Ahn J., Beak S. W., Lee S. W.. 2024;Development of 1D-convolutional neural network-based height profile prediction model in directed energy deposition process using melt-pool image data. International Journal of Precision Engineering and Manufacturing-Smart Technology 2(1):57–65.
21. Raghuwanshi S. K., Palodiya V.. 2016;Beam propagation and mode coupling study in a coupled waveguide structure by using scalar finite element method. Optik 127(3):1237–1244.
22. Masanori K.. 2014. Optical waveguide theory by the finite element method Springer Dordrecht.
23. Pedrola G. L.. 2015. Beam propagation method for design of optical waveguide devices John Wiley & Sons.
24. Wu Y.-D.. 2005;All-optical logic gates by using multibranch waveguide structure with localized optical nonlinearity. IEEE Journal of Selected Topics in Quantum Electronics 11(2):307–312.
25. Jauregui C., Eidam T., Limpert J., Tünnermann A.. 2011;Impact of modal interference on the beam quality of high-power fiber amplifiers. Optics Express 19(4):3258–3271.
26. Chahine Y. K., Tedder S. A., Vyhnalek B. E., Wroblewski A. C.. 2020;Beam propagation through atmospheric turbulence using an altitude-dependent structure profile with non-uniformly distributed phase screens. Proceedings of the Free-Space Laser Communications XXXII 11272:263–277.
27. Dallaire M., McCarthy N., Piché M.. 2009;Spatiotemporal bessel beams: theory and experiments. Optics Express 17(20):18148–18164.
28. Dickey F. M.. 2018. Laser beam shaping: theory and techniques CRC press.
29. Voelz D.. 2011. Computational fourier optics: A MATLAB Tutorial SPIE.
30. Amani S., Lee J. B., Park S. S.. 2023;Prediction of spread in steel wire rod rolling: transferable and explainable approach. International Journal of Precision Engineering and Manufacturing- Smart Technology 1(1):19–33.
31. Kim S., Kim Y., Sugita N., Mitsuishi M.. 2023;Surface assessment of transparent glass plate with wavelength-modulated interferometry and harmonic phase-iterative method. International Journal of Precision Engineering and Manufacturing-Smart Technology 1(1):71–81.
32. Chae S., Bae S.. 2024;Bayesian hyper-parameter optimization in one-dimensional convolutional autoencoder for monitoring bearing health status. International Journal of Precision Engineering and Manufacturing-Smart Technology 2(1):15–22.
33. Avuçlu E., Başçiftçi F.. 2018;New approaches to determine age and gender in image processing techniques using multilayer perceptron neural network. Applied Soft Computing 70:157–168.
34. Desai M., Shah M.. 2021;An anatomization on breast cancer detection and diagnosis employing multi-layer perceptron neural network (MLP) and Convolutional neural network (CNN). Clinical eHealth 4:1–11.

Biography

Mr. Kuan Lu is a PhD student in J. Mike Walker ‘66 Department of Mechanical Engineering at Texas A&M University. His research interest includes optical system design, signal/image processing and artificial intelligence-based inspection algorithms.

Mr. Pengfei Lin is a PhD student in J. Mike Walker ‘66 Department of Mechanical Engineering at Texas A&M University. His research interest includes optical system modeling, optical analysis tool development, and artificial intelligence-based inspection algorithms.

Dr. ChaBum Lee is currently an associate professor in J. Mike Walker ‘66 Department of Mechanical Engineering at Texas A&M University. Prior to joining Texas A&M University, he served an assistant professor at Tennessee Tech University from 2015 to 2018. He received his PhD from Gwangju Institute of Science and Technology in 2012 and worked at LG Display from 2012 to 2013. His research interest includes precision metrology, instrumentation and inspection for semiconductors and optical devices.

Article information Continued

Fig. 1

Comparison between AI-based digital approach and AI-IS hole inspection method

Fig. 2

Schematics of grayfield Si-via inspection system

Fig. 3

Schematics of the establishment of training datasets

Fig. 4

The final architecture of the CNN model

Fig. 5

Predicted vs actual scatter plot results of five datasets

Fig. 6

The final architecture of the MLP model

Fig. 7

Result comparisons: (a) average RMSE, (b) average MADR, and (c) average training time