This paper presents a reliability analysis method for automated vehicles equipped with adaptive cruise control (ACC) and autonomous emergency braking (AEB) systems to avoid collision with an obstacle in front of the vehicle. The proposed approach consists of two main elements, namely uncertainty modeling of traffic conditions and model-based reliability analysis. In the uncertainty modeling step, a recently developed Gaussian mixture copula (GMC) method is employed to accurately represent the uncertainty in the road traffic conditions using the real-world data, and to capture the complicated correlations between different variables. Based on the uncertainty modeling of traffic conditions, an adaptive Kriging surrogate modeling method with an active learning function is then used to efficiently and accurately evaluate the collision-avoidance reliability of an automated vehicle. The application of the proposed method to the Department of Transportation Safety Pilot Model Deployment database and an in-house built Advanced Driver Assist Systems with ACC and AEB controllers demonstrate the effectiveness of the proposed method in evaluating the collision-avoidance reliability.

## Introduction

Autonomous vehicle (AV) is considered as one of the game-changing technologies that will change the transportation styles of current society. Research shows that AV has promising potential in improving the road safety by eliminating human errors [1], and easing the traffic congestions through optimal control [2,3]. Before the mass production and the deployment of AVs in real traffic environment, the safety and reliability of AVs are recognized as one of the most important factors that need to be assured.

Vehicle collision avoidance system is an essential element of the autonomous driving technology. It plays a vital role in avoiding collision during the driving process and warning the passengers to switch to traditional driving if they feel uncomfortable. Analyzing the risk of collision of AVs in real-world traffic condition is an indispensable step in the development of AVs and has gained much attention in recent years. For instance, European New Car Assessment Program (NCAP) [4] has developed multiple testing scenarios to verify the versatility of the control logic of AVs in the collision avoidance situation. In order to provide accurate warning in a timely manner to the driver, Wu et al. [5] proposed a method for the vehicle motion prediction and collision risk assessment based on the simulation of vehicular cyber physical system. Chang et al. [6] suggested a collision avoidance system based on short range communications. Deng et al. [7] investigated the risk of collision in the car-following scenario under bad weather conditions. Similarly, Zhao et al. [8–10] applied importance sampling approach to test-based collision risk assessment of AVs. They investigated the performance of collision avoidance systems in different traffic conditions, such as car-following scenario and lane-change scenario. Inspired by the application of importance sampling approach to the collision-avoidance performance evaluation of AVs [8], various approaches have been proposed recently including piecewise mixture models-based method [11], application to cut-in scenario [12], and integration of importance sampling approach with kernel method [13].

Even though various approaches have been developed to evaluate the collision risk of vehicles under various traffic conditions, most of the approaches make assumptions on the randomness or uncertainty variables in the collision-avoidance system. To release this assumption and simplification, naturalistic driving data [10] have been extracted to capture the variability in the real-world traffic conditions. The current methods [10,11], however, usually employ a specific statistical model such as exponential distribution or piecewise mixture models. These statistical models cannot accurately capture the complicated correlations between different random variables and thus lead to errors in the collision reliability analysis of AVs. In addition, a major challenge in the collision reliability analysis of AVs is how to effectively reduce the number of function evaluations to the computer simulation model or the number of vehicle tests if testing-based method is employed. The application of importance sampling approach to collision avoidance system performance analysis has shown promising potential in reducing the number of tests and reducing the testing time [8]. The required number of tests, however, is still very high to reach to an accurate estimation of the collision-avoidance reliability. Especially when the collision-avoidance reliability is high, the required computational or testing resources are practically not affordable.

Aims to efficiently perform reliability analysis without sacrificing the accuracy, various adaptive surrogate modeling-based methods have been proposed in recent years. Two most widely used approaches are the efficient global reliability analysis method [14] and adaptive-Kriging Monte Carlo simulation (AK-MCS) approach [15]. Following these two methods, various adaptive surrogate modeling-based reliability analysis methods have been proposed, such as the global sensitivity analysis enhanced method [16], adaptive surrogate-based system reliability analysis [17], and single-loop Kriging for time-dependent problems [18]. Results of numerical case studies of these methods have shown that the adaptive surrogate modeling-based reliability analysis methods can dramatically reduce the number of function evaluations in reliability analysis. It provides a promising direction to overcome the challenges in the collision-avoidance reliability analysis of AVs.

This paper aims to develop a systematic collision-avoidance reliability analysis framework for AVs equipped with adaptive cruise control (ACC) and autonomous emergency braking (AEB) systems. The developed framework consists of two main elements, namely uncertainty modeling and reliability analysis. In the uncertainty modeling module, a recently developed Gaussian mixture copula (GMC) is employed to model the uncertain traffic conditions based on the naturalistic driving data. The Gaussian mixture copula allows for not only accurate modeling of the marginal distributions of various traffic conditions but also accurate representation of the complicated dependence between different variables. In the reliability analysis element, the adaptive surrogate modeling-based reliability analysis method with active learning function is employed to efficiently perform collision-avoidance reliability analysis. The synthesis of these two elements enables us to accurately and efficiently evaluate the collision-avoidance reliability of an AV control system. Based on the developed collision-avoidance reliability analysis framework, the potential application of the proposed method in reliability-based design optimization of controller parameters is discussed. The effectiveness of the developed approaches is demonstrated using the Department of Transportation Safety Pilot Model Deployment database and an in-house built Advanced Driver Assist Systems. The contributions of this research are therefore summarized as: (1) uncertainty modeling of the uncertainty sources in the real-world driving environment using a new Gaussian mixture copula method based on naturalistic driving data; (2) employment of adaptive surrogate modeling approach to reduce the number of simulations/tests in AV collision reliability analysis; and (3) synthesis of these two approaches and demonstration of the developed framework using the Safety Pilot Model Deployment database.

The remainder of this paper is organized as follows: Section 2 provides a background introduction of collision-avoidance reliability analysis of automated vehicles. Section 3 gives the adaptive surrogate modeling-based collision-avoidance reliability analysis method. Section 4 illustrates the effectiveness and the implementation procedure of the proposed method using a case study, and Sec. 5 presents the concluding remarks.

## Collision Avoidance System of an Automated Vehicle

This section briefly introduces the background of collision-avoidance reliability analysis of an automated vehicle.

### Collision Avoidance System of an Automated Vehicle.

Collision avoidance system is usually designed to avoid the collision between an automated vehicle and an obstacle in front of the vehicle (as shown in Fig. 1). The obstacle can be any object in front of the automated vehicle, such as a leading passenger car, a bicyclist, a truck, or any object that appears suddenly on the road and in front of the vehicle. These obstacles can be detected by the sensors equipped on the automated vehicle. The commonly used sensors include ultrasonic, LIDAR, radar, and camera [19]. The detection sensor can determine the distance between itself and the obstacle, as well as the relative speed between the two objects. Based on the information from the detection sensors, the collision-avoidance system will implement certain control strategy to decide if a certain braking system should be activated and thus avoid the collision with the obstacle.

*t*is defined as

*t*.

Note that the TTC is also affected by other vehicle physical parameters and environmental variables such as the friction coefficient between the vehicle and road surface, the road condition, and the weather. More sophisticated simulations or expressions need to be employed to compute the TTC if the physical parameters need to be considered. Since $vA(t)$ is changing over time and is affected by the dynamics of vehicle systems and controllers, vehicle dynamic simulations need to be employed to predict $vA(t)$ and thus compute the TTC over a simulation period. In this paper, the speed of the obstacle $v0$ is assumed to be constant in this short time period of collision-avoidance event. Since the speed of the obstacle at that short time period or time instant is just one realization of the random speed, the collision-avoidance reliability obtained in this paper is only the reliability at that short time period or time instant. The developed method is also applicable in the case that $v0$ is modeled as a time-series model. When the time-series model is used to represent $v0$, the correlation of the speeds between different time instants needs to be considered and the reliability becomes time-dependent. In that case, the time-dependent reliability analysis approaches [18,21,22] need to be employed to consider the correlations of speed over time.

Depending on the TTC at any time instant, the collision-avoidance system will take a specific action according to a predefined control logic. Figure 2 gives a schematic control logic for collision avoidance based on TTC.

As shown in Fig. 2, if $TTC>t1$, the automated vehicle is safe at this moment and it will continue with the normal driving using ACC. If $t2<TTC<t1$, the risk of collision is increasing and a warning will be given to the passengers. The system will let the passengers to make decision in this case. Otherwise, the AEB system will be active to avoid the collision. Even for the AEB system, there are also several braking stages, such as full brake and other braking scenarios [23]. For different companies, the control logic may be different. In this paper, an in-house-built ACC and AEB systems are used to illustrate the proposed collision-avoidance reliability analysis in Sec. 4. The developed framework is applicable to any automated vehicle collision simulation models, such as the commercial software carsim [24].

### Reliability of Collision Avoidance System.

The advanced driver assistant system with adaptive cruise control and autonomous emergency breaking is able to dynamically adjust the vehicle speed to avoid collision. The automated vehicle, however, still has a certain probability of collision with the obstacle due to various uncertainty sources in the traffic conditions and vehicle systems.

where $Pr{\u22c5}$ stands for probability operator, $X$ is a vector of random variables (e.g., $V0$, $VA$, and $ldis$) in the collision-avoidance system, $\beta $ is a vector of controller parameters, and $[0,\u2009t]$ is a time period of interest.

The reliability-based optimization of the controller parameters (i.e., Eq. (4)) enables us to optimize the controller parameters in the early design stage with the consideration of reliability constraints. It can dramatically accelerate the development process of the autonomous vehicles. As an essential step in solving the optimization model, evaluating the reliability constraints (i.e., Eq. (3)), however, is not straightforward. The following two main challenges need to be addressed to accurately and efficiently solve Eq. (3):

First, as inputs of Eq. (3), the random variables $X$ need to be accurately modeled based on the real-world traffic data. The modeling of these random variables requires statistical approaches that are able to accurately model both the marginal probability density function (PDF) of each random variable and the joint PDF of multiple variables by accounting for the complicated dependence between variables. How to accurately model the uncertainty sources in the collision-avoidance system based on data (which could be noisy) is the first challenge that needs to be solved.

Second, collision-avoidance reliability needs to execute the collision-avoidance vehicle simulation numerous times (e.g., greater than 1 × 10

^{5}) at different realizations of random variables [8]. This could be very computationally expensive if sophisticated computer simulation model (e.g., carsim [24] or Hardware-in-the-loop simulations) is employed. How to reduce the required computational/testing effort in collision-avoidance reliability analysis without sacrificing the accuracy is the second challenge that needs to be addressed.

Next, a collision-avoidance reliability analysis framework is developed to assess the reliability of the AV collision-avoidance system by addressing the above two challenges.

## Collision Avoidance Reliability Analysis Based on Adaptive Surrogate Modeling

There are two main elements in the developed collision-avoidance reliability analysis framework, namely: (1) uncertainty modeling and (2) reliability analysis. The uncertainty modeling element, which addresses the first challenge, aims to accurately represent the uncertainty in the traffic conditions such as vehicle speed and relative distance between the automated vehicle and the obstacle (see Fig. 1). Following that, the reliability analysis element evaluates the collision-avoidance reliability by addressing the second challenge. In the subsequent sections (i.e., Secs. 3.1 and 3.2), these two elements are explained in details.

### Uncertainty Modeling.

The objective of uncertainty modeling is to build a probabilistic model using data analytic methods to accurately represent both the marginal and joint PDFs of the random variables in the collision-avoidance system. In the past decades, various data-driven approaches have been developed to approximate the PDF of random variables based on data. The commonly used approaches can be roughly classified into two groups: parametric approach and nonparametric approach.

Parametric approach: This group of approaches fit the data to a specific probability distribution, such as exponential, Gaussian, or Weibull distributions [25–27]. For these specific distributions, methods are then developed to estimate the distribution parameters using maximum likelihood estimation method or Bayesian approach [28–30].

Nonparametric approach: Without assuming a certain distribution type, nonparametric approaches represent any arbitrary distribution shapes (e.g., multimodal distributions) purely based on data using kernel functions or mixture models. The commonly used nonparametric approaches include kernel density smoothing function [31,32], Gaussian mixture model (GMM) [33], and copula functions [34].

The above data-driven uncertainty modeling approaches [28,31,33] have their own advantages and disadvantages. For example, kernel density smoothing method can flexibly represent an arbitrary distribution based on data. However, it has difficulty in handling high-dimensional problems. The same issue exists for copula function-based methods. GMM is the most flexible method among the available methods. The use of Gaussian components in the mixture model, however, adds limitations to the GMM method. To overcome the disadvantages of different approaches in data-driven uncertainty modeling, a GMC method is developed recently in Ref. [35]. The GMC method seamlessly integrates kernel density function and GMM with copula function. It inherits the advantages of these three methods and performs better than all these three approaches for different data sets [35]. In this paper, the GMC method is employed to model the uncertainty sources in the collision-avoidance system based on data. Next, we briefly review the GMC method.

*n*-dimensional distribution function, $F(x1,\u2009x2,\u2009\u2026,\u2009xn)$ is connected with the marginal distributions, $F1(x1),\u2009\u2026,\u2009Fn(xn)$ as follows:

where $C(\u22c5)$ is a copula function and $Fi(xi)=ui$ is the cumulative density function (CDF) of the *i*th random variable $Xi,\u2009\u2200i=1,\u2009\u2026,\u2009n$, and *n* is the number of random variables. $C(u1,\u2009u2,\u2009\u2026,\u2009un)$ is a unique function if $Fi(xi),\u2009\u2200i=1,\u2009\u2026,\u2009n$ are all continuous [34].

*t*copula, and several other bivariate copulas [37]. In many practical applications, the current available copula functions, however, cannot accurately capture the complicated dependences between multiple variables. Inspired by the fact that kernel density function or GMM model can be employed to approximate an arbitrary probability distribution, a GMC model is developed in Ref. [35]. The GMC model preserves the characteristics of the copula function given in Eq. (5) (i.e., the copula function is independent from the marginal distribution). The marginal CDFs (i.e., $F1(x1),\u2009\u2026,\u2009Fn(xn)$) are connected with $F(x1,\u2009x2,\u2009\u2026,\u2009xn)$ in GMC through a generalized copula as follows:

where $\Phi \u22121(\u22c5)$ is the inverse CDF of a standard normal variable, $Cnew(\u22c5,\u2009\u22c5,\u2009\u2026,\u2009\u22c5;\u2009\theta )$ is an unknown new copula function, and $\theta $ is a vector of unknown parameters of the new copula function. Note that if a multivariate Gaussian distribution is employed for $Cnew(\u22c5)$, the new copula will become a Gaussian copula.

where $\varphi (\u22c5)$ is the PDF of a standard normal variable.

where $z=[\Phi \u22121(u1),\u2009\Phi \u22121(u2),\u2009\u2026,\u2009\Phi \u22121(un)]$, *K* is the number of Gaussian components, $\lambda i$, $\mu i$, and $\Sigma i$, which are parameters $\theta $ of the new copula function, are, respectively, the weight, mean, and covariance of the *i*th Gaussian component.

The generalized procedure for uncertainty modeling in collision-avoidance system using GMC can be summarized as below:

- (1)
Collect data of random variables in the collision-avoidance systems of automated vehicles.

- (2)
Approximate the marginal distribution of each random variable using kernel density smoothing function method [31,32].

- (3)
Convert the original data into the standard normal space using kernel density smoothing function and learn a GMM model in the transformed space;

- (4)
Integrate the GMM model with kernel density function to obtain the GMC model.

- (5)
Predict the joint PDF of any uncertainty realization using the learned GMC model.

In Sec. 3.2, we will discuss how to perform collision-avoidance reliability analysis based on the uncertainty modeling.

### Collision-Avoidance Reliability Analysis Based on Adaptive Surrogate Modeling Uncertainty Propagation.

A straightforward way of performing collision-avoidance reliability is to repeatedly run the collision-avoidance simulations or tests at different realizations of random variables using MCS method. The MCS method, however, is computationally prohibitive if high-fidelity simulation model is used or actual tests are performed. As mentioned previously, importance sampling approach has been employed in collision-avoidance analysis in Refs. [8] and [9]. But the required computational effort is still high due to the iterative procedure to determine the optimal proposal density function using the cross-entropy method [38].

Since the development of the first adaptive surrogate modeling-based reliability analysis method in 2008 by Bichon et al. [14], adaptive surrogate modeling has gained much attention in recent year in the reliability analysis field due to its promising potential in overcoming the drawbacks of traditional reliability analysis methods, such as the first-order reliability method [39] and second-order reliability method [40]. It has shown that the adaptive surrogate modeling-based reliability analysis method can dramatically improve the efficiency of reliability analysis while maintaining satisfactory accuracy [41].

The basic idea of adaptive surrogate modeling-based reliability analysis methods is to then adaptively refining the surrogate model in the limit state (i.e., failure boundary) region. The refining of the surrogate model nearing the limit state allows us to efficiently and accurately build a classifier to classify the random samples into fail or safe regions and thus reduce the required computational effort. Figure 3 gives a generalized flowchart of adaptive surrogate modeling-based reliability analysis method. As indicated in the figure, there are two main elements in the flowchart, namely surrogate modeling and learning function. The surrogate model is a data-driven model such as support vector machine [42], polynomial chaos expansion [43], and Gaussian process model [22], which is built using training points to replace the original model. The learning function is a function used to identify new training points near the limit state for the refinement of the surrogate model. In this paper, the adaptive Gaussian process surrogate modeling-based reliability analysis method is employed for the collision-avoidance reliability analysis. Next, we briefly review the Gaussian process surrogate modeling method and various learning functions. Based on that, we discuss the application of the method in the reliability analysis of the collision-avoidance system.

where $\epsilon L=\u22122\sigma L(x)$ and $\epsilon U=2\sigma L(x)$ are user-specified error bounds.

*U*functions have been the most widely used learning functions. But they both have some drawbacks. For instance, it has been pointed out in Ref. [16] that both the EFF and

*U*functions cannot account for the complicated correlations among the Gaussian process predictions when they are utilized to select the new training points. Inspired by the EFF and

*U*functions and with a hope to further improve the efficiency of reliability analysis, various learning functions have been proposed in recent years. For example, the

*H*function [45]

where $fL(x)(l)$ is the PDF of the uncertain prediction.

where $\alpha t$ is the exploration–exploitation tradeoff coefficient that measures the uncertainty in the prediction.

The above learning functions are derived from different perspectives of reliability analysis. For example, the EFF function evaluates how close is a new point to the failure boundary; the *U* function determines the probability of making a classification error; the *H* function is defined according to the Shannon information entropy theory; the ERF function measures the expected risk/probability of making a mistake on the prediction, which is similar to the *U* function; the MCE function accounts for the probability density function in the training points selection; and the EU function includes the exploration and exploitation into the learning function. In addition to these learning functions, efforts have been made to integrate adaptive surrogate modeling methods with importance sampling method [16,51], which further improve the efficiency of reliability analysis.

*U*function performs better than the EFF function [15]. It is also observed that the fundamental idea of the

*H*function is similar to that of the EFF function since they follow a similar derivation procedure [45]. In addition, the ERF, EU, and MCE functions are all similar to

*U*function expect that EU and MCE functions have an additional term to account for the probability density function of input random variables. While EU function has a similar form as MCE, it has more parameters to tune and thus is more complicated for implementation. Based on these observations, the

*U*function and the MCE learning function are employed in this paper as illustrative examples for the collision-avoidance reliability analysis and the convergence criterion given in Ref. [16] is employed to check the convergence. The convergence criterion is defined as $\epsilon max<5%$, where $\epsilon max$ is the maximum potential percentage error of failure probability estimate and is given by [16]

where $N1$ is the number of samples that are classified as fail and with *U* values (i.e., Eq. (17)) greater than 2, $N2$ is the number of samples that are classified as fail and with *U* values less than 2, and $Ne$ is the total number of samples that have *U* values less than 2.

The overall procedure is summarized as follows:

Step 1: Generate initial training points for the random variables in the collision-avoidance system using Latin hypercube sampling (LHS) approach [52].

Step 2: Run the collision-avoidance simulation and obtain the minimum distance ($Lmin$) as the quantity of interest.

Step 3: Construct a surrogate model using Gaussian process surrogate modeling method.

Step 4: Check the convergence using the criterion given in Eq. (22) [16]. If the criterion is satisfied, stop and obtain the reliability estimate. Otherwise, go to the next step.

Step 5: Identify new training point using the

*U*function (i.e., Eq. (17)) or MCE function (i.e., Eq. (20)), and go back to step 3 after performing a new collision-avoidance simulation at the new training point.

In Sec. 4, a case study is used to demonstrate the application of aforementioned uncertainty modeling and reliability analysis framework to collision-avoidance reliability analysis.

## Case Study

A collision-avoidance system equipped with ACC and AEB given in Ref. [53] (as shown in Fig. 4) is adopted as our case study to demonstrate the proposed collision-avoidance reliability analysis framework. The simulation model is a simulink model built according to the control logic presented in Fig. 5. Figure 4 presents an overview of the simulink model. For the purpose of illustration, the simulation model used in this case study is relatively computationally cheap. Each execution of the simulink model [53] takes about 0.2 s. This computationally cheap simulation model allows us to verify the effectiveness of the proposed collision-avoidance reliability analysis approach using the brute-force Monte Carlo simulation. The proposed approach, however, is not limited to the simulation model used in this case study. It is also applicable to actual vehicle tests in the lab or simulation models (e.g., carsim [24], hardware-in-the-loop simulations), which are much more sophisticated and computationally expensive than our in-house built simulation model (i.e., Fig. 4).

In the case study problem, the system has five-level control strategies, which are explained as below and depicted in Fig. 5.

- (1)
Level 1: If $TTC>t4$, the vehicle will drive normally under ACC. No braking action is required.

- (2)
Level 2: If $t3<TTC<t4$, the system will give alerts to the passengers and let the passenger to make decision.

- (3)
Level 3: If $t2<TTC<t3$, the vehicle will brake and the acceleration is −0.3

*g*, where*g*is the gravitational constant. - (4)
Level 4: If $t1<TTC<t2$, the vehicle will brake with an acceleration of −0.6

*g*. - (5)
Level 5: If $0<TTC<t1$ or the relative distance between the vehicle and the obstacle is less than 2 m, the vehicle will brake with an acceleration of −

*g*.

In the above control strategies, the relationship between $ti$ and $ti+1$ is defined as $ti+1=2ti$. $t1$ is therefore a parameter that will affect the performance of the collision-avoidance system. The random variables in the system include the initial speed of the automated vehicle ($vA$), the speed of the obstacle ($v0$), and the relative distance between the vehicle and the obstacle ($Ldis$). Figure 6 presents the simulation results of one collision-avoidance simulation for a given realization of the random variables. Figure 6(a) gives the position of the vehicle and the obstacle with respect to the initial position over the simulation time period of interest. Figure 6(b) depicts the relative distance between these two objects over the simulation time period under the control strategy given in Fig. 5. It shows that the relative distance has a wavy nature due to the influence of the control logic depicted in Fig. 5. When the relative distance is too small, the vehicle will brake through AEB with a certain deceleration determined by the TTC (see Fig. 5). Due to the deceleration, the relative distance will increase. When the TTC meets the safety requirement, the vehicle will then accelerate to travel at a specific speed through ACC. We then perform reliability analysis based on this collision-avoidance simulation model using the methods presented in Sec. 3.

### Uncertainty Modeling of Traffic Conditions.

In order to model the uncertainty in the traffic conditions, we first extract data of $v0$, $vA$, and $Ldis$ from the Department of Transportation Safety Pilot Model Deployment database [54]. The data were collected in Ann Arbor, MI starting in August 2012 and contain various real-world vehicle operation information, such as vehicle speed, and relative distance between a vehicle and an object in front of the vehicle. It contains the naturalistic driving data of over 2800 vehicles equipped with wireless connected vehicle devices for more than 2 years. As of April 2015, 34.9 × 10^{6} miles were logged [8]. Figure 7 gives a snapshot one of the data sets (i.e., DataFrontTargets) [54] from the Safety Pilot Model Deployment database. Even though the data have been collected over a long time period with a large number of vehicles, it may not fully account for all the uncertainty sources in the real-world driving environment. In this paper, it is assumed that the data presented in this database are representative of the uncertainty sources in the driving environment. Including other uncertainty sources such as weather, road condition, and road profile, in the autonomous vehicle reliability analysis is worth pursuing in future.

Based on the extracted driving data, we then build probability models for the random variables. Figure 8 shows the marginal PDFs of the three random variables in the collision-avoidance simulation model. Following that, Fig. 9 gives the joint PDF of different variables. As shown in Figs. 8 and 9, the distributions of vehicle speed and relative distance present multimodal properties. It implies that it is not appropriate to use parametric approaches to model these uncertainty sources.

### Collision-Avoidance Reliability Analysis.

Based on the uncertainty modeling, we then perform collision-avoidance reliability analysis using the adaptive surrogate modeling method discussed in Sec. 3.2. The Gaussian process surrogate model with zero-order trend function is employed as our surrogate model.

We first generate 200 initial training points for the surrogate modeling using the LHS approach. Using the 200 initial training points, we then perform leave-one-out cross validation for Kriging surrogate models with different types of correlation function. The cross-validation mean square errors for the surrogate models with the squared-exponential function (the most commonly used one), the exponential function, and cubic spline function are, respectively, 68.19, 14.33, and 50.73. In this paper, the exponential correlation function is therefore used in Eq. (14).

The initial surrogate model is then refined for the reliability analysis purpose using the learning functions given in Sec. 3.2. Figure 10 shows the convergence history of the failure probability estimate as a function of the number of collision-avoidance simulations obtained using the MCE learning function and the *U* function. Along with the results of adaptive surrogate modeling methods, in this figure, we also plot the failure probability estimate obtained by directly constructing Kriging surrogate model using different numbers of LHS samples. It shows that the adaptive surrogate modeling methods (*U* or MCS learning functions) are much more efficient than the direct surrogate modeling using LHS samples. For this particular example, the *U* function given in Eq. (17) performs much better than the MCE function given in Eq. (20)). We, therefore, employ the *U* function as our learning function for the surrogate model refinement in reliability analysis.

In addition, the figure shows that the proposed collision-avoidance reliability analysis framework only needs to evaluate the collision-avoidance simulation model 1037 times to get a very accurate estimate of the failure probability (i.e., 1—reliability). The MCS result in Fig. 10 is based on 2 × 10^{6} samples. This demonstrates the effectiveness of the proposed framework in improving the accuracy and efficiency of collision-avoidance reliability analysis.

Figure 11 presents the collision failure probability as a function of the controller parameter *t*_{1}. It indicates that the collision failure probability will decrease when *t*_{1} increases. Increasing the value of *t*_{1}, however, will increase the frequency of unnecessary brake actions, which may cause uncomfortable feelings for the passenger and increase the fuel consumption. Based on the curve given in Fig. 11, we can determine an appropriate value for *t*_{1} by solving the optimization model given in Eq. (4). For example, a value of 0. 8 should be employed if we have a reliability requirement of 0.998.

### Discussion and Path Forward.

Even though the adaptive Kriging surrogate model drastically reduces the required number of simulations in collision-avoidance reliability analysis, the number of simulations (i.e., 1037) is still quite high. In the reliability analysis literature [14–16], it is observed that the adaptive Kriging surrogate modeling-based reliability analysis methods usually can reduce the number of simulations/tests to a number close to or less than 100, for problems with three random variables. It implies that the performance of the adaptive Kriging approach in this paper is not as good as expected, even if it is much more efficient than current available methods for autonomous vehicle collision-avoidance reliability analysis [8,11]. In order to explain this phenomenon and provide guidance for future improvements, we plot the minimum relative distance response of this case study. Figures 12–14 show the response function by, respectively, fixing one of the three input variables. We obtain the following two observations from the results of these figures: (1) the response is quite nonsmooth with almost discontinuous response in certain regions (see Fig. 13) and (2) the failure boundary or limit state (i.e., minimum relative distance = 0 m) is quite flat (see Figs. 12 and 14).

The almost discontinuous response in certain regions (i.e., the first observation) poses challenges to the construction of Kriging surrogate model. The reason is that the ordinary Kriging employed in this paper uses a fixed correlation function (i.e., kernel) for the whole design domain. When the response exhibits certain discontinuous characteristics, it is hard to determine an optimal correlation function that is universal to the whole design space. Even though the correlation function is not optimal, we can still construct a Kriging surrogate model. But the required number of function evaluations will be much higher than that for the response without discontinuous regions. The adaptive surrogate modeling approach used in this paper can reduce the influence of the discontinuity on the required effort for Kriging surrogate modeling, since we are only interested in the prediction accuracy near the failure boundary in the adaptive surrogate modeling method. This is also why the adaptive surrogate modeling is much more efficient than the direct surrogate modeling using space-filling LHS samples (as shown in Fig. 10). However, the adaptive surrogate modeling methods cannot completely eliminate the influence of the discontinuity due to the issue of using a fixed correlation function as been discussed previously. This is one of the reasons that the adaptive Kriging surrogate modeling method is not as good as expected. There are several possible ways to address the challenges caused by the discontinuity in the function response.

First, the nonstationary Kriging surrogate modeling methods [55,56] can be employed to tackle the abrupt changes (i.e., discontinuity) in the function response. In nonstationary Kriging surrogate model, a nonstationary covariance function is used in the surrogate modeling rather than a fixed stationary correlation function. The nonstationary covariance function allows us to determine the best correlation structure for different local regions, and thus better capture the statistical relationship between the responses even if there are some discontinuous regions. This type of Kriging surrogate modeling method has been widely used in the geostatistics communities [57] to handle the discontinuity in the soil properties, and been introduced into engineering design field in recent years [58,59].

Second, other types of surrogate modeling-based reliability analysis methods, which can directly handle the discontinuity in the function response, can be investigated for the autonomous vehicle reliability analysis. For instance, the support vector machine-based approach suggested in Ref. [60], and the neural network-based method [61].

For the second observation (i.e., the limit state is quite flat in certain regions), it requires a large number of training points to accurately perform reliability analysis, no matter what type of surrogate model is employed. This is also the main reason that the adaptive Kriging-based method is not as good as expected. This challenge can be addressed by combining the adaptive surrogate modeling method with importance sampling approach or subset simulation method. The integration of these methods will enable us to further concentrate on small regions, which are critical for reliability analysis, and thus further improve the efficiency of the collision-avoidance reliability analysis.

Given the previous observations and discussions, possible approaches that can further improve the efficiency of the autonomous vehicle collision-avoidance reliability analysis method presented in this paper are the syntheses of importance sampling approach or subset simulation method with adaptive sampling method and other advanced surrogate modeling techniques, such as nonstationary Kriging, support vector machine, and neural networks. It is also worth investigating which surrogate modeling method is the best among the suggested options. It should be noted that approaches have been investigated for adaptive surrogate modeling using nonstationary Kriging and support vector machine. But the integration of adaptive sampling strategy with neural networks has rarely been studied.

## Conclusions

Assuring the safety and reliability of autonomous vehicles is one of the important steps in the development of AVs. The collision-avoidance simulations or real-world tests need to be evaluated numerous times in the analysis. This brings significant challenges to the reliability assurance of AVs if computationally expensive simulation models are used or actual field vehicle tests are performed.

Using an in-house built collision-avoidance system as an example, this paper develops a collision-avoidance reliability analysis framework by integrating a recently developed uncertainty modeling approach with adaptive surrogate modeling-based reliability analysis method. A Gaussian copula model is employed in the uncertainty modeling step to accurately represent various uncertainty sources based on real-world traffic data. Based on the uncertainty modeling, the adaptive surrogate modeling method is used to improve the efficiency of reliability analysis without sacrificing the accuracy. Results of a case study show that the developed method can efficiently estimate the failure probability of a collision-avoidance system. The developed method focuses on collision-avoidance reliability analysis of car-following scenario. The developed approach can be extended to other types of collision-avoidance reliability analysis (e.g., lane changing or head-on collision) by replacing the simulation models and data with the corresponding models and data.

In addition to the uncertainty in the traffic conditions, there are also many sources of uncertainty in the physical parameters of AVs in the mass production. Considering the physical uncertainty sources in the collision-avoidance reliability analysis will be investigated in our future study. In this paper, the adaptive surrogate modeling with active learning function is employed for reliability analysis. Integrating the adaptive surrogate modeling with importance sampling or subset simulation can further improve the efficiency of collision- avoidance reliability analysis. In addition, the response function as shown in Figs. 12–14 is quite highly nonlinear and even discontinuous in certain region. Replacing the ordinary Kriging model used in this paper with more advanced nonstationary Kriging models [55,56] or other types of surrogate models (e.g., support vector machine, neural networks) is also an interesting topic that worth pursuing.

## Acknowledgment

This research was partially supported by the Institute for Advanced Vehicle Systems (IAVS) seed funding at the University of Michigan-Dearborn for the third author, and by the Automotive Research Center (ARC) in accordance with Cooperative Agreement W56HZV-04-2-0001 U.S. Army Tank Automotive Research, Development and Engineering Center (TARDEC), Warren, MI, for the fourth author. The support is gratefully acknowledged.

## Funding Data

Automotive Research Center (Award No. W56HZV-04-2-0001; Funder ID: 10.13039/100008192).