A comparative analysis of machine learning approach for optimizing antenna design

Sarbagya Ratna Shakya; Matthew Kube; Zhaoxian Zhou

doi:10.1017/S1759078723001009

A comparative analysis of machine learning approach for optimizing antenna design

Published online by Cambridge University Press: 29 August 2023

Sarbagya Ratna Shakya ,

Matthew Kube and

Zhaoxian Zhou

Show author details

Sarbagya Ratna Shakya: Affiliation:
Department of Mathematical Sciences, Eastern New Mexico University, Portales, NM, USA
Matthew Kube: Affiliation:
Department of Mathematical Sciences, Eastern New Mexico University, Portales, NM, USA
Zhaoxian Zhou*: Affiliation:
School of Computing Sciences and Computer Engineering, University of Southern Mississippi, Hattiesburg, MS USA
*: Corresponding author: Zhaoxian Zhou; Email: zhaoxian.zhou@usm.edu

Article contents

Abstract
Introduction
Literature review
Methodology
Experiments/results
Conclusion
Funding statement
Competing interests
References

Rights & Permissions

Abstract

With the increasing demand for smarter antenna design in advanced technology applications, well-designed antennas have been an important factor in enhancing system performance. Most traditional antenna design requires multiple iterations and extensive testing to produce a final product. Machine learning (ML) algorithms have been used as an alternative to predict the optimal design parameters, but the outcome depends highly on the ML model efficiency. With recent development in machine learning algorithms and the availability of data for antenna design, we investigated different machine learning algorithms for optimizing the output strength of three basic antennae by analyzing the signal strength of the antenna for various antenna parameters. Different regression-based ML models were used to learn the behaviors and efficiency of three different antennas and to predict the output strength (S11) for different ranges of frequencies. The experiment compared and analyzed these ML regression algorithms for three different antennas: shot antenna, patch antenna, and bowtie antenna. In addition, the paper also provides comparison of ensemble ML models for performance analysis using the best three ML algorithms from the preliminary study. This study optimizes antenna parameters and quicker and smarter antenna design procedure using ML algorithms as compared to traditional design methods.

Keywords

antenna design bowtie antenna machine learning optimization patch antenna regression slot antenna

Type: Research Paper
Information: International Journal of Microwave and Wireless Technologies , First View , pp. 1 - 11

DOI: https://doi.org/10.1017/S1759078723001009 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2023. Published by Cambridge University Press in association with The European Microwave Association.

Introduction

In this coming era of the Internet of Things and advanced wireless communication, antenna design has become one of the most important components to achieve the standards defined in communication and electromagnetic wave propagation. To meet the requirements for modern applications for multiband operations like 5G/6G [Reference Alieldin, Huang, Boyes, Stanley, Joseph, Hua and Lei1, Reference Pandey, Singh and Kumar2], the receiving and transmitting efficiency of signals is highly dependent on the antenna configuration. This has increased the need to optimize the design of these antennae for an efficient communication system and performance improvement. In the traditional design process, antenna parameters such as size, shape, substrate, and materials used for the antenna are tuned, tested, and fabricated based on experience driven [Reference Mendes and Peixeiro3], consuming high computational resources and time [Reference Chen, Elsherbeni and Demir4]. Also, with increased challenges such as enhancing efficiency [Reference Liu, Aliakbarian, Ma, Vandenbosch, Gielen and Excell5] of radiated power, thermal considerations, increased bandwidth for higher data transfer speeds, and the demand for application-specific antennas, the complexity of designing antennas of variable functions and compact sizes has increased. Electromagnetic simulation has been performed for antenna synthesis and design, which also requires high-performance requirements [Reference John and Ammann6].

With the use of ML algorithms [Reference Wu, Cao, Wang and Hong7, Reference El Misilmani and Naous8], these computational challenges can be minimized and in return yield greater optimized results. These statistical learning techniques, along with numerical optimization methods, known as surrogate-based optimization [Reference Forrester and Keane9], have proved to improve antenna synthesis [Reference Koziel and Ogurtsov10]. Various ML algorithms have been used to develop complex antenna technology with automated and cost-effective design [Reference Zhang, Akinsolu, Liu and Vandenbosch11, Reference Shrestha, Fu and Hong12]. Application of these ML algorithms for antenna design has proved more efficient than the traditional process, which mainly depends on trial-and-error methods and mathematical equations and formulas.

In this paper, we have analyzed different ML algorithms to compute the output strength for different dimensions of the proposed multiple antennas. The paper also provides a dataset generated from the simulation for these multiple antennas. The details of the dataset are provided in the section Methodology. The study provides a comparative analysis of different ML regression methods to analyze the antenna strength for different data samples of the antenna. These methods have also been used for different types of antennas.

The main contribution of this paper is as follows:

a. We provide a simulated dataset for two different antenna designs using MATLAB antenna tool. This dataset provides signal strength for various antenna dimensions as the input parameters.
b. This paper shows the comparison of the performance of different ML algorithms for predicting antenna strength. This baseline performance shows that the model selection holds great importance in the prediction performance.
c. We analyze the ensemble methods to enhance the performance of the model. This shows that different models can be utilized to increase the model’s performance.

The paper is structured in the following way. The Introduction provides an introduction of ML application in antenna design. The Literature review section presents the literature review of machine learning in antenna design. The Methodology section provides the details of the dataset and methodology used in the research process, and the Experiments/Results section provides the experimental results. Finally, the Conclusion section concludes the presented research.

Literature review

In recent years, machine learning has shown great results in applications related to antenna design. From multiantenna wireless technology to a millimeter-wave antenna design, the machine learning approach has been used for different applications. Optimizing the results with better performance, minimizing the errors, better computational efficiency, and time saving with the reduced number of necessary simulations are some of the advantages of using ML in antenna design [Reference El Misilmani and Naous8].

Although simulating the models for antenna design reduces the cost of prototyping, it requires a large time-consuming process. Different data-driven surrogating models [Reference Sendrea, Zekios and Georgakopoulos13] and inverse surrogate modeling [Reference Koziel, Belen, Çaliskan and Mahouti14] has been proposed for multiple different types of antenna [Reference Zhang, Song and Rahmat-Sammi15, Reference Kalayci, Ayten and Mahouti16] design to reduce the computational cost and accelerate the simulation process for optimizing and regularizing the antenna design process. In [Reference Pietrenko-Dabrowska, Koziel and Ullah17], the author has proposed a novel modeling techniques using small training data sets using surrogate antenna modeling to reduce cost. In [Reference Piltan, Kizilay, Belen and Mahouti18], data-driven surrogate models like artificial intelligence algorithms, including deep learning algorithms, are used to design horn antenna to achieve a computationally efficient design and reduce the computational cost by more than 80%. The authors have used these surrogate models to predict the gain of the horn antenna using geometrical dimensions, frequency, and radiation directions as the design parameters and compared the computational cost of these models with the electromagnetic simulation models cost.

For parameter optimization, different ML algorithms have been used where simulated data are used to train these algorithms. These trained models were then used to predict the performance of the antenna for a different design. ML algorithms are also used to analyze the speed of the computation process with good accuracy, where different approaches such as support vector machine (SVM) and Method of Moments [Reference Florencio and Encinar19] are compared based on the speed and accuracy in antenna design [Reference Prado, López-Fernández, Arrebola and Goussetis20]. ML algorithms have shown better results in many antenna design than the theoretical results. In [Reference Güneş21], SVM algorithms are also used to design rectangular microstrip antennae. The research shows the results from this model and then compared with the result of the artificial neural network (ANN). The results show that SVM has a higher convergence rate and better computational efficiency than an ANN. Bayesian Regularization algorithms have been implemented to design a planar inverted F-antenna where the ML algorithm has been used to minimize the error and accelerate the cycle time for the new material synthesis with fewer simulations time [Reference Gianfagna, Swaminathan, Raj, Tummala and Antonini22]. In [Reference Tan23], authors have used linear regression (LR) methods to evaluate the feasibility of antenna design, a heuristic algorithm-enhanced ANN to model the embedded antenna, and then a multi-fidelity neural network to model and optimize the antenna design. These surrogate models can be used and replace EM simulations for optimization, decreasing the simulation time and with satisfactory accuracy. Authors have used Kriging algorithms to design high-performance reflect arrays, reducing computational time. In [Reference Tak, Kantemur, Sharma and Xin24], the authors show the comparison of the simulated S11 characteristics and gain of the antenna obtained using ANNs and antenna fabricated from the SLA 3D printing techniques, with optimized design parameters obtained from ML techniques. The results show a good value with slight errors for the W-Band slotted waveguide antenna array.

Also, in recent years, different optimization approaches have been analyzed with evolutionary algorithms such as genetic algorithms (GA) [Reference Korkmaz, Alibakhshikenari and Kouhalvandi25], swarm intelligence [Reference Wang, Wang and Yi26], and differential evolution (DE) algorithms, which are designed from the inspiration of nature and have been used to optimize the antenna design [Reference Gregory, Bayraktar and Werner27, Reference Hoorfar28]. In [Reference Silva and Martins29], GAs have been combined with interpolation to design an ultrawide band ring monopole antenna. Also, in [Reference Oliveri, Gottardi, Robol, Polo, Poli, Salucci, Chuan, Massagrande, Vinetti, Mattivi and Lombardi30], GAs have been used to design unconventional antenna systems for 5G-based stations. The proposed innovative methodological paradigm codesign the antenna elements and clustered array to optimize the multiobjective antenna shape. Swarm optimization has been applied in [Reference Dai and Luk31] to design a planar wideband antenna array for millimeter-wave operation. The methods used the optimized array configuration to suppress the sidelobe level of the antenna array. Particle swarm optimization algorithm has also been used to design a multiband patch antenna using ANN. The fabricated antenna design from the optimized parameter has shown good analogy between the measured and simulated results [Reference Jain32]. Particle swarm optimization methods along with ANN has been used in [Reference Anuradha and Sinha33] to design custom-made fractal antennas. DE algorithms have been used for antenna array design where large unequally spaced planar arrays have been designed using a new encoding mechanism that has achieved much lower sidelobe levels. These optimization techniques were used for determining the design parameters and optimizing the shape of the antenna for the required frequency. For the optimal structure of the E-shaped antenna, an efficient electromagnetic structure has been proposed combining Kriging algorithms and DE to optimize the resonant frequency [Reference Chen, Guo, Pei and Man34].

These algorithms have shown many advantages and are more efficient compared to traditional stochastic optimization algorithms [Reference Liu, Ban and Jay Guo35]. Similarly, ML algorithms has been combined with other evolutionary algorithms like self-adaptive DE [Reference Gregory, Bayraktar and Werner27] and wind driven optimization [Reference Bayraktar, Komurcu, Bossard and Werner36] to optimize the design of different antenna to achieve optimized antenna design with minimum computational cost and faster convergence rate.

Methodology

To compare the performance of different ML algorithms for antenna design, we used three different approaches. In approach 1, (Figure 1) we analyzed our dataset with different individual ML algorithms. In this experiment, all the datasets were divided into (80/20) training and testing datasets. First, we analyze the performance of the base model with the default hyperparameter values of the ML algorithms. As the selection of the hyperparameters of the ML models are important to increase the model performance, we used Bayesian Optimization algorithms to tune the model parameters. So, to determine the optimized hyperparameter of the ML models, we implement Bayesian Optimization techniques along with 10-fold cross validation (CV), integrated with each of these ML models to tune selected hyperparameters on the training dataset. We run the iterations 250 number of times. The models with these optimal tuned hyperparameters were used to test the performance on the testing dataset using different evaluation metrics. The k-fold CV was used to ensure that the model is not overfitted. Apart from the selected hyperparameters used for optimization, all other parameters’ default value were used in the experiments as described in the models’ algorithms. The details of the tuned hyperparameters obtained from Bayesian Optimization algorithms for each ML algorithm are provided in the section Models used. For evaluation, two evaluation metrics were used to compare and analyze the performance of the ML algorithms. Among 10 different ML regression algorithms, the top 3 performers were used for ensemble methods in approach 3.

Figure 1. ML regression method.

In approach 2, we apply the k-fold CV method in the entire dataset to train and test the model. The entire dataset was divided into k-folds, where k − 1 folds were used for training and the kth fold was used for testing. The process was repeated for all the folds. The results show the average scores and deviations after all the iterations. We have selected 10 folds in our study. The objective of this approach is to measure consistency in the model avoiding overfitting or underfitting while training the model and helps to show the generalization patterns, robustness, and uniformity of the model performance in the entire dataset. Figure 2 [Reference Ren, Li and Han37] shows the block diagram for the k-fold CV method.

Figure 2. K-fold cross validation method.

In approach 3, we used ensemble techniques of ML algorithms to compare the performance. In this method, we used the three best ML models obtained from approach 1 for each of our three antennas. For our experiment, we selected three antennas of different types: rectangular microstrip (Patch) antenna (microstrip antenna), slot antenna (aperture antenna), and bowtie antenna (log periodic antenna). Figure 3 shows the methodology for the ensemble methods using three high-performance ML models.

Figure 3. Ensemble method methodology with three high-performance ML models.

Dataset description

In our study, we have used three datasets, Dataset 1 for slot antenna, Dataset 2 for microstrip patch antenna, and Dataset 3 for bowtie antenna to compare the performance of different ML algorithms. Among these three datasets, dataset 1 of slot antenna [Reference Shreya38] is a publicly available dataset. The samples of dataset 2 of the microstrip antenna and dataset 3 of the bowtie antenna are simulated and collected using MATLAB for experimental purposes. In our simulated dataset, sample points were generated with all possible combinations of the input parameters like the dimension of the antenna, which increased at a fixed step size. The values of the antenna strength (in dB) is extracted for these sample points for a frequency range of step size 0.1 GHz. As most of the mobile services, like Long-Term Evolution (LTE) and mid-band 5G, operate in microwave frequency spectrum range from 1.5 to 3.5 GHz, we selected this frequency range as our operation frequency in the experiment.

Dataset 2 was simulated on a CPU using Ryzen 5700 U microprocessor with 16 GB RAM. The system took around 3 hours to generate the dataset. Dataset 3 was generated with the system having Ryzen 1600C CPU with Nvidia RTX2060 super graphics card with 16 GB RAM. The dataset was generated in about 30 minutes. Simulating these models is challenging and requires high computational cost and time, which can vary from about 15 seconds to more than few days, based on the model specification [Reference Koziel, Belen, Çaliskan and Mahouti14]. The proposed approach can be applied to efficiently design the antenna using the information learned from these generated data without the need to simulate more to generate new data.

The details about the three datasets, including their features and the collection process, are described below.

Dataset 1 – slot antenna

This dataset [Reference Shreya38] is a publicly available dataset of slot antenna, which consists of 1267 samples with five different features (slot length, slot width, patch length, patch width, and frequency) and strength. The whole frequency range for the sample is between 1.5 and 3.5 GHz. Each parameter of the sample is increased by 0.5 mm. The signal strength of the antenna is simulated for each combination of input parameters. The slot length is from 0 to 85 mm; slot width is from 0 to 115 mm; patch length is from 29.4 to 87 mm; and patch width is from 26 to 130 mm. This dataset was created by using HFSS software. Figure 4 [Reference Balanis39] shows the general structure of a slot antenna.

Figure 4. Slot antenna.

Dataset 2 – microstrip patch antenna

This dataset consists of 10,453 samples with three input design features of patch antenna (patch length, patch width, and frequency). The total frequency range that we used for simulating the antenna parameters in the design is also from 1.5 to 3.5 GHz. MATLAB Simulink Antenna Toolbox was used to simulate the samples to generate the output signal strength for all combinations of input design parameters. The patch length and patch width were increased by a step size of 0.01 mm, and frequency was increased by 0.1 GHz. All other values were kept to their default value during the simulation. Figure 5 [Reference Kashyap and Sandeep40] shows the general structure of the microstrip patch antenna.

Figure 5. Microstrip patch antenna.

Dataset 3 – bowtie antenna

The bowtie antenna is a simple antenna where the antenna feed is at the center of the antenna. This antenna is also called butterfly antenna or a biconical antenna. The angle between the two metal pieces (D) is dependent on the length and width of the antenna and is calculated by the equation.

(1)

\begin{equation}D = 2 \times a\,\tan \left( {L/W} \right)\end{equation}

The bowtie antenna dataset consists of a total of 14,280 samples. This dataset also consists of three features (length, width, and frequency) as a design parameter. This dataset was also generated using MATLAB Simulink Antenna Toolbox. The length and width of the bowtie antenna were varied by increasing by 0.01 mm from 0 to a maximum value of 4 mm. The frequency also varied from 0.1 GHz in a range from 1.5 to 3.5 GHz. For each combination of these inputs, the signal strength was simulated for the bowtie antenna. All other parameters were kept to their default value during simulation in MATLAB. Figure 6 [Reference Balanis39] shows the general structure of the bowtie antenna. Table 1 shows a detailed comparison between these three datasets.

Figure 6. Planar bowtie antenna.

Table 1. Comparison of different parameters of three antenna datasets

Models used

In our initial experiment, we used 10 commonly used ML regression algorithms, LR, stochastic gradient descent (SGD) regressor, random forest (RF) regression, decision tree (DT) regressor, gradient boosting (GB), bagging regressor (BR), XGB regressor (XGB), support vector regressor (SVR), MLP regressor (MLP), and K Nearest Neighbors regressor (KNN). For uniform comparison, all the ML models were trained with the same training data and tested with the same testing data. We use ML regression models from the scikit-learn python library with hyperparameter tuning using Bayesian Optimization Algorithm for initial evaluation. The performance of the model with tuned hyperparameter is compared with the base model with default parameter value. The parameter of the base ML model is replaced with only those optimized parameters which improved the performance on the testing data. The models and its tuned hyperparameters used in the experiments are listed below.

RF(n_estimators=8, max_depth=10, criterion= ‘squared_error’, min_samples_split=0.0021, min_samples_leaf=0.0005, ma_features=’log2’)
DT(criterion=’squared_error’, max_features=’sqrt’, min_samples_leaf=0.00106, min_samples_split=0.00088, splitter=’best’)
GB(criterion=’squared_error’, learning_rate=0.7, loss=’friedman_mse’, n_estimators=140, max_depth=50, max_features=None, min_samples_leaf=0.0144, min_samples_split=0.1808)
BR(n_estimators=70)
XGB(n_estimators= 1000, max_depth=5, eta=0.1)
KNN(n_neighbors=3, weights=distance, algorithm=’kd_tree’, leaf_size=40).

Apart from the aforementioned, all other hyperparameters were used with their default values. Also, for dataset 2 (patch antenna) and dataset 3 (bowtie antenna), DT and RF were used with their default hyperparameters as the optimization techniques did not improve the model performance.

We used four ensemble methods that have been popular for regression process. In each ensemble model, we used three high-performance models determined from our preliminary study to predict the output. The details of these four ensemble methods are described below.

• Averaging method: In this method, the model returns the average of the prediction of the three best performing for each antenna. This method reduces the variance, and the combined results is better than the individual output.
• Stacking method: In this method, the whole dataset is divided into training and testing dataset. The three high performing ML models are trained individually on the training dataset as base model. This trained model is used to predict on the testing dataset. Then the predictions on the training data set are used as a feature to build the new meta-regression model. This model uses the training stacking features to train, and then the final model is used to predict the output on the test dataset.
• Blending method: In blending method, we split the dataset into training, testing, and validation datasets. Then, we used the three top-performing ML regression algorithms to train on the testing dataset. The performance is tested on validation and test datasets. Based on the predictions of the individual models, the final model is built using the meta features. Then this model is used for predicting the output and for performance analysis.
• Bagging method: In this method, the base model, XGBoost regressor method, is trained on bags, which are the subsets of the dataset made to the size of the whole dataset with replacements. These multiple bags that are created from the train dataset are trained on the base model individually. The output coming from all these experiments are combined to get the final output.

Evaluation metrics

To evaluate the performance of our ML models, we used two popular evaluation metrics: root mean square error (RMSE) score and R-squared (R ² or coefficient of determination) score, to measure the model performance in our testing data. In both these evaluation metrics, the target value is compared with the actual value to analyze the performance of the machine learning algorithms. RMSE metrics tell us about the average distance between the predicted and actual values and is calculated as:

(2)

\begin{equation}{\textrm{RMSE}} = \sqrt {\mathop \sum {{\left( {{P_i}^2 - {Q_i}^2} \right)}^2}/n} \end{equation}

where ∑ is summation, P_i is the predicted value for ith observations, Q_i is the observed value for the ith observations, and N is the sample size.

R ² represents the proportion of the variance for a dependent variable that is explained by an independent variable or variable in a regression model. It measures how well the regression line approximates the actual data. The variance can be calculated as

(3)

\begin{equation}{{\textrm{R}}^2} = 1 - {{\rm sum\, squared\, regression\left( {SSR} \right)} \over {\rm total\, sum\, of\, square\left( {SST} \right)}}\end{equation}

(4)

\begin{equation}{R^2} = 1- \frac{ {\sum _{i = 1}^n{{\left( {{y_i} - \hat y} \right)}^2}}} {{\sum_{i = 1}^n{{\left( {{y_i} - \widehat {\bar y}} \right)}^2}}}\end{equation}

where y_i = actual y value, $\hat y$ = predicted y value, and $\widehat {\bar y}$ = mean of y value.

Experiments/results

Our preliminary results show that there is a great possibility to utilize machine learning algorithms to optimize the design of an antenna. Table 2 shows the results for the RMSE score and R ² value for different ML algorithms for all three antennae. For slot antenna, RF, GB, and KNN have the top three performances with 0.4087, 0.7484, and 0.3752 RMSE values, respectively. Similarly, for the microstrip patch antenna, RF, BR, and XGB are the top three performers. Each has an R ² value of 99.83%. For the bowtie antenna, RF with a 0.959, BR with 0.5889, and XGB with a 0.7712 RMSE scores are the top three performers. For all antenna datasets, RF is in the top three for all the antenna datasets. SGD regressor and SVR have the lowest performance among all regression methods. As we have used optimization techniques for selecting the optimal hyperparameters for each model, most of the models’ performance has improved, decreasing the RMSE scores.

Table 2. RMSE and R ² values for all the ML algorithms for all antennae

Table 3 shows the scatter plot of the actual strength and predicted strength for different dimensions of different antennae for different ML algorithms. From the figure, we can see that the points are gathered near the regression line in RF, DT, GB, BR, and XGB, which have higher performance and low RMSE score. This shows that these ML models can correctly predict the strength of an antenna based on the input dimension parameters. The point is scattered far away from the regression line in other ML algorithms. This shows that this model has not been able to learn much from the training data and is not suitable for the prediction for these kinds of dataset with these input features. These models may improve the performance of the dataset with different features, along with changing the model parameters for better prediction.

Table 3. Scatter plot for all the ML algorithms for three antennas

Table 4 provides the performance evaluation using 10-fold CV methods. This table shows the average (mean score) of the RMSE score and standard deviation of all 10 iterations. In the 10-fold CV also, RF has shown great performance for slot antenna and bowtie antenna, whereas XGBregressor has the highest performance for the patch antenna. Also, apart from some scores, the CV score is quite like the individual ML performance where we have divided the whole dataset into training and testing datasets. This also shows the uniformity of the model performance and removes the overfitting of data while training the model.

Table 4. Mean score and standard deviation using CV methods

Table 5 shows the RMSE score and R ² value for the four ensemble methods which use the top three performers from approach 1. For the analysis of slot antenna, we used RF, GB, and KNN regressor models, whereas for patch antenna and bowtie antenna, we used RF, BR, and XGB regressor as they are the top three highest performing models among all the ML models. Blending methods has shown a very good performance in slot antenna (having an R ² value more than 98%). All the ensemble methods have shown a comparable result in patch antenna, where R ² value is more than 99% in all the methods. Similarly, blending method has proved to be better with more than 99% of R ² value for bowtie antenna as compared to other ensemble methods. It shows that using these methods, more than 99% of the variation in the predicted values is accounted for by the input parameters of the antenna. Also, looking at the performance of the patch antenna and bowtie antenna as compared to the slot antenna, the number of data points has shown great importance in the prediction performance. With larger number of sample size in patch antenna and bowtie antenna as compared to very small sample size in slot antenna, experiment shows that increasing sample size can increase the prediction accuracy. This may be because more sample points provide more information about the data and therefore provide better estimation results with higher probability of convergence while training the model.

Table 5. RMSE and R ² results for all the ensemble models for all our datasets

Table 6 shows the scatter plot using the ensemble methods for all three antennas. Apart from some outliers which are very far away from the regression line, we can see that most of the points are grouped near the regression line showing the accurate prediction of the antenna strength as compared to the actual values from the dataset.

Table 6. Scatter plot for all the ensemble ML algorithms for three antennas

Conclusion

In this paper, we presented the prediction of signal strength of three types of antennas with different machine learning models. Three datasets with various antenna parameters were used to optimize the design process. The three types of antennas are slot antenna, microstrip patch antenna, and bowtie antenna. The one dataset is publicly available generated with HFSS software, and the other two datasets are simulated ones with MATLAB software.

Among these algorithms, the top three regressor models for antenna design are RF, GB, and KNN for slot antennas; RF, BR, and XGB for patch antennas; and RF, BR, and XGB for bowtie antennas. We conclude that these approaches can predict the strength of the antennas accurately based on a set of different design parameters in an insignificant amount of time, replacing the traditional methods which includes multiple iterations and testing. By using the top three performers, we implemented four types of ensemble models for each type of antenna. The four methods for the ensemble modes are averaging, stagging, blending, and bagging. We conclude that the ensemble methods give better results compared with the individual ML algorithms. Overall, we have confirmed the robustness of the selected ML models and approaches.

We have applied Bayesian Optimization algorithms to get the optimized hyperparameters of the machine learning algorithms in our research. As shown in our research, the quality of the dataset also plays a critical role in the algorithm performance. Building datasets with better features, including wider bandwidth and finer antenna structure, either by simulation or from physical experiments, will be our future interest. Also, to investigate more machine learning algorithms and possibly deep learning algorithms like pyramidal deep neural network and convolutional neural network into this and other types of advanced antennas.

Funding statement

This research received no specific grant from any funding agency, commercial, or not-for-profit sectors.

Competing interests

The authors report no conflict of interest.

Sarbagya Ratna Shakya received the B. Eng. in Electronics Engineering from National College of Engineering, Tribhuvan University of Nepal, in 2009; received his M. Eng. in Computer Engineering from Nepal College of Information Technology, Pokhara University of Nepal, in 2014; and received his Ph.D. in Computational Science (Computer Science) from the School of Computer Science and Computer Engineering, University of Southern Mississippi, USA, in 2021. Currently, he is working as an Assistant Professor at Eastern New Mexico University, New Mexico, USA. He has more than 10 years of teaching experience in undergraduate level courses. His research interests include machine learning, deep learning, image processing, Internet of Things, rapid prototyping and wearable devices, and antenna design. He has published papers in academic journals and conference proceedings in different domains on applications of machine learning and deep learning.

Matthew Kube received his bachelor’s degree in Electrical Engineering and Technology from Eastern New Mexico University in the spring of 2023. His research interests primarily include ham radio communications, microwave communication, and computer science topics. He has shown an aptitude for engineering and technology and plans to pursue further education in the field.

Zhaoxian Zhou received the Bachelor of Engineering in Electrical Engineering from the University of Science and Technology of China in 1991; Master of Engineering in Electrical Engineering from the National University of Singapore in 1999; and the PhD degree in Electrical and Computer Engineering from the University of New Mexico in 2005. From 1991 to 1997, he was an electrical engineer in China Research Institute of Radio Wave Propagation. He serves the University of Southern Mississippi as a professor in the School of Computing Sciences and Computer Engineering. His research interests include wireless sensor networks, human activity recognition, face recognition, and image processing; big data analytics, machine learning; rapid prototyping, and wearable devices; computational and ultrawide band electromagnetics; numerical analysis, high performance, and scientific computing; antennas, radio wave propagation, and wireless communications; high power microwave, pulsed power, and plasma science; and engineering education. He has published more than 40 papers in academic journals and conference proceedings.

References

Alieldin, A, Huang, Y, Boyes, SJ, Stanley, M, Joseph, SD, Hua, Q and Lei, D (2018) A triple-band dual-polarized indoor base station antenna for 2G, 3G, 4G and Sub-6 GHz 5G applications. IEEE Access 6, 49209–49216.CrossRef Google Scholar

Pandey, A, Singh, AP and Kumar, V (2023) Design and optimization of circularly polarized dielectric resonator-based MIMO antenna using machine learning for 5G Sub-6 GHz. AEU – International Journal of Electronics and Communications 162, .CrossRef Google Scholar

Mendes, C and Peixeiro, C (2017) A dual-mode single-band wearable microstrip antenna for body area networks. IEEE Antennas and Wireless Propagation Letters 16, 3055–3058.CrossRef Google Scholar

Chen, Y, Elsherbeni, AZ and Demir, V (2022) Machine learning for microstrip patch antenna design: Observations and recommendations. In 2022 United States National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM). 256–257.CrossRef Google Scholar

Liu, B, Aliakbarian, H, Ma, Z, Vandenbosch, GAE, Gielen, GGE and Excell, PS (2014) An efficient method for antenna design optimization based on evolutionary computation and machine learning techniques. IEEE Transactions on Antennas and Propagation 62, 7–18.CrossRef Google Scholar

John, M and Ammann, MJ (2009) Antenna optimization with a computationally efficient multiobjective evolutionary algorithm. IEEE Transactions on Antennas and Propagation 57(1), 260–263.CrossRef Google Scholar

Wu, Q, Cao, Y, Wang, H and Hong, W (2020) Machine-learning-assisted optimization and its application to antenna designs: Opportunities and challenges. China Communications 17, 152–164.CrossRef Google Scholar

El Misilmani, HM and Naous, T (2019) Machine learning in antenna design: An overview on machine learning concept and algorithms. In 2019 International Conference on High Performance Computing & Simulation (HPCS). 600–607.CrossRef Google Scholar

Forrester, AIJ and Keane, AJ (2009) Recent advances in surrogate-based optimization. Progress in Aerospace Sciences 45(1–3), 50–79.CrossRef Google Scholar

Koziel, S and Ogurtsov, S (2014) Antenna Design by Simulation-Driven Optimization. Surrogate-Based Approach. New York: Springer.CrossRef Google Scholar

Zhang, J, Akinsolu, MO, Liu, B and Vandenbosch, GAE (2021) Automatic AI-driven design of mutual coupling reducing topologies for frequency reconfigurable antenna arrays. IEEE Transactions on Antennas and Propagation 69(3), 1831–1836.CrossRef Google Scholar

Shrestha, S, Fu, X and Hong, M (2023) Towards efficient and optimal joint beamforming and antenna selection: A machine learning approach. In ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 1–5.CrossRef Google Scholar

Sendrea, RE, Zekios, CL and Georgakopoulos, SV (2022) Multi-fidelity surrogate modeling based on analytical eigenfunction expansions. In IEEE Transactions on Antennas and Propagation.CrossRef Google Scholar

Koziel, S, Belen, MA, Çaliskan, A and Mahouti, P (2023) Rapid design of 3D reflectarray antennas by inverse surrogate modeling and regularization. IEEE Access 11, 24175–24184.CrossRef Google Scholar

Zhang, B, Song, L and Rahmat-Sammi, Y (2023) Textile patch antenna surrogate-based optimization: Kriging surrogate modeling on equivalent circuit components. In 2023 United States National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM). 145–146.CrossRef Google Scholar

Kalayci, H, Ayten, UE and Mahouti, P (2022) Ensemble-based surrogate modeling of microwave antennas using XGBoost algorithm. International Journal of Numerical Modelling: Electronic Networks, Devices and Fields 35(2), .CrossRef Google Scholar

Pietrenko-Dabrowska, A, Koziel, S and Ullah, U (2022) Reduced-cost two-level surrogate antenna modeling using domain confinement and response features. Scientific Reports 12(1), .CrossRef Google Scholar PubMed

Piltan, OC, Kizilay, A, Belen, MA and Mahouti, P (2023) Data driven surrogate modeling of horn antennas for optimal determination of radiation pattern and size using deep learning. Microwave and Optical Technology Letters (accessed 5 April 2023).CrossRef Google Scholar

Florencio, R and Encinar, JA (2013) Enhanced MoM analysis of the scattering by periodic strip gratings in multilayered substrates. IEEE Transactions on Antennas and Propagation 61(10), 5088–5099.CrossRef Google Scholar

Prado, DR, López-Fernández, JA, Arrebola, M and Goussetis, G (2018) Efficient shaped-beam reflectarray design using machine learning techniques. In 2018 15th European Radar Conference (EuRAD). 525–528.CrossRef Google Scholar

Güneş, F (2008) Support vector characterisation of the microstrip antennas based on measurements. Progress In Electromagnetics Research B 5, 49–61.Google Scholar

Gianfagna, C, Swaminathan, M, Raj, PM, Tummala, R and Antonini, G (2015) Enabling antenna design with nano-magnetic materials using machine learning. In 2015 IEEE Nanotechnology Materials and Devices Conference (NMDC). 1–5.CrossRef Google Scholar

Tan, J (2023) Machine learning-assisted method for efficient and accurate antenna modelling, https://etheses.whiterose.ac.uk/32549/ (accessed 15 March 2023).Google Scholar

Tak, J, Kantemur, A, Sharma, Y and Xin, H (2018) A 3-D-printed W-band slotted waveguide array antenna optimized using machine learning. IEEE Antennas and Wireless Propagation Letters 17, 2008–2012.CrossRef Google Scholar

Korkmaz, S, Alibakhshikenari, M and Kouhalvandi, L (2023) A framework for optimizing antenna through genetic algorithm-based neural network. Acta Marisiensis. Seria Technologica 20(1), 49–53.CrossRef Google Scholar

Wang, J, Wang, X and Yi, J (2023) A hybrid particle swarm optimization algorithm with dynamic adjustment of inertia weight based on a new feature selection method to optimize SVM parameters. Entropy 25(3), .Google Scholar PubMed

Gregory, MD, Bayraktar, Z and Werner, DH (2011) Fast optimization of electromagnetic design problems using the covariance matrix adaptation evolutionary strategy. IEEE Transactions on Antennas and Propagation 59(4), 1275–1285.CrossRef Google Scholar

Hoorfar, A (2007) Evolutionary programming in electromagnetic optimization: A review. IEEE Transactions on Antennas and Propagation 55(3), 523–537.CrossRef Google Scholar

Silva, CRM and Martins, SR (2013) An adaptive evolutionary algorithm for UWB microstrip antennas optimization using a machine learning technique. Microwave and Optical Technology Letters 55(8), 1864–1868.CrossRef Google Scholar

Oliveri, G, Gottardi, G, Robol, F, Polo, A, Poli, L, Salucci, M, Chuan, M, Massagrande, C, Vinetti, P, Mattivi, M and Lombardi, R (2017) Codesign of unconventional array architectures and antenna elements for 5G Base Stations. IEEE Transactions on Antennas and Propagation 65(12), 6752–6767.CrossRef Google Scholar

Dai, X and Luk, K-M (2021) A planar wideband millimeter-wave antenna array with low sidelobe using ‘±1’ excitations. IEEE Transactions on Antennas and Propagation 69(10), 6999–7004.CrossRef Google Scholar

Jain, SK (2016) Bandwidth enhancement of patch antennas using neural network dependent modified optimizer. International Journal of Microwave and Wireless Technologies 8(7), 1111–1119.CrossRef Google Scholar

Anuradha, AP and Sinha, SN (2011) Design of custom-made fractal multi-band antennas using ANN-PSO [Antenna Designer’s Notebook]. IEEE Antennas and Propagation Magazine 53(4), 94–101.CrossRef Google Scholar

Chen, XH, Guo, XX, Pei, JM and Man, WY (2017) A hybrid algorithm of differential evolution and machine learning for electromagnetic structure optimization. In 2017 32nd Youth Academic Annual Conference of Chinese Association of Automation (YAC). 755–759.CrossRef Google Scholar

Liu, F, Ban, Y-L and Jay Guo, Y (2020) Synthesis of large unequally spaced planar arrays utilizing differential evolution with new encoding mechanism and cauchy mutation. IEEE Transactions on Antennas and Propagation 68(6), 4406–4416.CrossRef Google Scholar

Bayraktar, Z, Komurcu, M, Bossard, JA and Werner, DH (2013) The wind driven optimization technique and its application in electromagnetics. IEEE Transactions on Antennas and Propagation 61(5), 2745–2757.CrossRef Google Scholar

Ren, Q, Li, M and Han, S (2019) Tectonic discrimination of olivine in basalt using data mining techniques based on major elements: A comparative study from multiple perspectives. Big Earth Data 3, 1–18.CrossRef Google Scholar

Shreya, S (2021) Dataset containing antenna parameters. https://www.kaggle.com/datasets/shreyasinha/dataset-containing-antenna-parameters.Google Scholar

Balanis, CA (2005) Antenna Theory: Analysis and Design. Hoboken, New Jersey: Wiley-Interscience.Google Scholar

Kashyap, S and Sandeep, BS (2012) Design and simulation of micro strip patch array antenna for wireless communications at 2.4 GHz. International Journal of Scientific Engineering and Research 3, 1–5.Google Scholar