Adaptive mesh refinement method for optimal control based on Hermite-Legendre-Gauss-Lobatto direct transcription

Lei, Humin; Liu, Tao; Li, Deng; Ye, Jikun

doi:10.21595/jve.2017.18146

Journal of Vibroengineering

Browse Journal

Submit article

Published: 31 December 2017

Check for updates

Adaptive mesh refinement method for optimal control based on Hermite-Legendre-Gauss-Lobatto direct transcription

Humin Lei¹

Tao Liu²

Deng Li³

Jikun Ye⁴

^{1, 2, 4}Air and Missile Defense College, Air Force Engineering University, Xi’an, 710051, P. R. China

³College of Education, Shaanxi Normal University, Xi’an, 710062, P. R. China

³Nanyang No. 2 High School, Nanyang, 473000, P. R. China

Corresponding Author:

Tao Liu

Cite the article Download PDF

Downloads 1648

WoS Core Citations 2

CrossRef Citations 2

Abstract

Direct transcription has been employed to transcribe the optimal control problem into a nonlinear programming problem. This paper presents a trajectory optimization method based on a combination of the direct transcription and mesh refinement algorithm. Hermite-Simpson method has the advantage of reasonable accuracy with highly sparse Hessian matrix and constraint Jacobians, and the pseudospectral method provides spectral accuracy for optimal control problems. The optimal control problem is discretized at a series of Legendre-Gauss-Lobatto points, then the trajectory states are approximated by using local Hermite interpolating polynomials. Thus, the method produces significantly smaller mesh size with a higher accuracy tolerance solution. The derived relative error estimation is then used to trade the number of mesh polynomials degree within each mesh interval with the number of mesh intervals. As a result, the suggested method can produce more small mesh size, requires less computation solution for the same optimal control problem. The simulation experiment results show that the suggested method has many advantages.

1. Introduction

Direct methods have been widely applied for the numerical solution of nonlinear optimal control problems [1-3]. The state and control of the optimal control problems are discretized at a series of suitable points in a direct method, then the continuous-time optimal control is converted into a finite dimensional nonlinear programming problem (NLP), the resulting NLP can be solved by NLP solver software [4].

With the raid development of computer technology, direct method is more and more widely applied to trajectory optimization problems [5-6], however, the low computational performance and accuracy make it difficult to use for real-time calculations. Therefore, the pseudospectral (PS) method has the advantage of high rate of convergence and large convergence radius [7-8], and provides spectral accuracy for smooth problems, but produces much denser constraints Jacobian as compared with other methods. In order to increase the sparsity of constraints Jacobians in PS method, Ross and Fahroo introduce the concepts of the knots for the Legendre PS method [9], Poustini et al. develop a trajectory optimization method based on some combination of the direct optimization method and differential flatness theory [10]. Some researchers combine the PS method and heuristic optimization method to improve trajectory method [11, 12], other researchers improve the adaptive mesh refinement algorithm to obtain higher accuracy solution with less computation time [6, 13-16]. Lei et al. develop an adaptive mesh refinement of hp PS method, the high accuracy and efficiency can be achieved by adaptive mesh refinement strategy [14]. Among the above methods, the Hermite-Simpson method has the advantage of reasonable accuracy with highly sparse Hessian matrix and constraint Jacobians [15-16]. Herman and Conway propose an additional high-order methods, however, when the Hermite interpolating polynomials extended to arbitrary higher orders [3], the framework needs more detailed derivation.

The purpose of this paper is to provide an alternative method to produce an optimal trajectory as based on a combination of the direct transcription and mesh refinement algorithm. The optimal control problem is discretized at a series of Legendre-Gauss-Lobatto (LGL) collocation points, then the state trajectories are approximated by using local Hermite interpolating polynomials, that the method in this paper is referred to as the Hermite-Legendre-Gauss-Lobatto (HLGL) method. It is noted that Williams provides a framework for arbitrary order and arbitrary number of intervals for implementation on digital computers [16]. It can be known from Ref. [16] that better accuracy can be achieved by increasing mesh polynomial degree $n$ for smooth regions, and increasing the number of subintervals for the corresponding nonsmooth regions of the solution. However, it is a fact that smooth regions and nonsmooth regions together exist in one solution of the problem. As the mesh polynomial degree and the number of mesh intervals are preset and the mesh polynomial degree $n$ are the same within each subinterval in Ref. [16], it is difficult to determine the mesh polynomial degree $n$ and the number of mesh intervals for that situation. Motived by the desire to trade the number of mesh polynomial degree $n$ with the number of mesh intervals, we develop an adaptive mesh refinement method based on direct transcription. A key contribution of this paper is that both mesh polynomial degree $n$ and the number of mesh intervals are allowed to vary, and the mesh polynomial degree $n$ within each mesh interval is not necessarily equal. Furthermore, the method also can improve computational efficiency by reducing the size of the mesh.

2. Optimal control problem

Without loss of generality, consider the following optimal control problems with inequality path constraints:

1

J = M (x (- 1), t_{0}, x (+ 1), t_{f}) + \frac{t_{f} - t_{0}}{2} \int_{- 1}^{+ 1} L (x (τ), u (τ), t (τ, t_{0}, t_{f})) d τ .

Subject to the constraints:

2

\frac{d x}{d τ} = \frac{t_{f} - t_{0}}{2} f (x (τ), u (τ), t (τ, t_{0}, t_{f})),

3

C (x (τ), u (τ), t (τ, t_{0}, t_{f})) \leq 0,

4

B (x (- 1), t_{0}, x (- 1), t_{f}) \leq 0,

here the term $x (τ) \in R^{n_{x}}$ denotes the state, and the term $u (τ) \in R^{n_{u}}$ denotes the control. In the Eqs. (1-4), the time domain $τ \in [- 1, + 1]$ is transformed from the time domain $t \in [t_{0}, t_{f}]$ by the following affine transformation:

5

τ = \frac{2 t}{t_{f} - t_{0}} - \frac{t_{f} + t_{0}}{t_{f} - t_{0}}, t = t (τ) = \frac{1}{2} (τ (t_{f} - t_{0}) + t_{0} + t_{f}),

where the terms $t_{0}$ and $t_{f}$ are represent for initial time and terminal time respectively. The basic idea of the approach is based on interpolating functions for state and costate on LGL quadrature nodes [17]. As the LGL nodes points are distributed over the interval [–1, 1], so it will be useful to transform the time interval.

The optimal control problem is described as to find the control variables $u (τ) \in R^{n_{u}}$ that making sure the performance index Eq. (5) is minimized, subject to the state Eq. (2), path constraints Eq. (3) and boundary conditions Eq. (4).

3. Adaptive mesh refinement methodology

3.1. Numerical discretization and approximation

The domain $τ \in [- 1, + 1]$ is divided into $K$ mesh subintervals $S_{k}$ when using mesh refinement method. Then we have:

6

⋃_{k = 1}^{K} S_{k} = [- 1, + 1], S_{k} = [T_{k - 1}, T_{k}] .

The mesh points have the property $- 1 = T_{0} < T_{1} < \dots < T_{k} = + 1$ . The state in the subintervals $S_{k}$ is approximated by the Hermite interpolating polynomial with nth order [9]:

7

x (τ) = a_{0} + a_{1} τ + \dots + a_{n} τ^{n}, τ \in [- 1, + 1] .

For ensure the integration accuracy and interpolation accuracy, the collocation points and nodes are defined as LGL points $υ_{k} (k = 1, . . ., n)$ within each interval. Note that there is no distinction between collocation points $τ_{j}$ and nodes $ς_{j}$ in some PS method [2], whereas the collocation points are used to formulate the residual equations for the NLP, and the nodes are used to form the interpolating polynomial in this paper. Thus, the collocations points and nodes defined according to:

8

τ_{j} = υ_{2 j - 1}, j = 1,2, . . ., m, ς_{j} = υ_{2 j}, j = 1,2, . . ., m - 1, m = (n + 1) / 2 .

Note that the values of the states and states derivatives at the points $τ_{j}$ determine the Hermite interpolating coefficients $a_{k} (k = 1, . . ., n)$ :

9

[\begin{matrix} x (τ_{1}) \\ x (τ_{2}) \\ ⋮ \\ x (τ_{m}) \\ h_{i} f (τ_{1}) \\ ⋮ \\ h_{i} f (τ_{m}) \end{matrix}] = [\begin{array}{l} 1 & τ_{1} & τ_{1}^{2} & \dots & τ_{1}^{n} \\ 1 & τ_{2} & τ_{2}^{2} & \dots & τ_{2}^{n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & τ_{m} & τ_{m}^{2} & \dots & τ_{m}^{n} \\ 0 & 1 & 2 τ_{1}^{2} & \dots & n τ_{1}^{n - 1} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 1 & 2 τ_{m}^{} & \dots & n τ_{m}^{n - 1} \end{array}] [\begin{matrix} a_{0} \\ a_{1} \\ ⋮ \\ ⋮ \\ a_{n - 1} \\ a_{n} \end{matrix}],

where the interval length is defined by $h_{i} = (t_{i + 1} - t_{i}) / 2$ .

It is noted that the left side of Eq. (9) determined by the number and location of the nodes. Let $a = [a_{0}, a_{1}, . . ., a_{n}]^{T}$ , then $a = A^{- 1} b$ , where $A$ is the matrix on the right side of Eq. (9), and $b$ is the vector of the left side of Eq. (9). Then we have:

10

x (ς_{j}) = (c_{j} A^{- 1}) b = ϕ_{j}^{T} b, c_{j} = [\begin{array}{l} 1 & ς_{j} & \dots & ς_{j}^{n} \end{array}], j = 1,2, . . ., m - 1 .

Differentiating $x (ς_{j})$ in Eq. (10) with respect to $τ$ , we obtain:

11

\frac{d x (ς_{j})}{d τ} = x^{'} = a_{1} + 2 a_{2} ς_{j} + \dots + n a_{n} ς_{j}^{n - 1} = (d_{j}^{T} A^{- 1}) b = {ϕ^{'}}_{j}^{T} b,

where:

12

d_{j} = [\begin{array}{l} \begin{array}{l} 0 & 1 \end{array} & 2 ς_{j} & \dots & n ς_{j}^{n - 1} \end{array}], j = 1,2, . . ., m - 1 .

The system constraints in per interval can be expressed as:

13

△_{i} = [\begin{matrix} x' (ς_{1}) - h_{i} f (x (ς_{1}), u (ς_{1}), t (ς_{1})) \\ x' (ς_{2}) - h_{i} f (x (ς_{2}), u (ς_{2}), t (ς_{2})) \\ ⋮ \\ x' (ς_{m - 1}) - h_{i} f (x (ς_{m - 1}), u (ς_{m - 1}), t (ς_{m - 1})) \end{matrix}] = {Φ'}^{T} b - h_{i} f (Φ^{T} b, u (ς), t (ς)) = 0,

where:

14

Φ = [ϕ_{1}, ϕ_{2}, . . ., ϕ_{m - 1}], Φ^{'} = [{ϕ^{'}}_{1}, {ϕ^{'}}_{2}, . . ., {ϕ^{'}}_{m - 1}] .

The cost function is approximated by Gauss-Lobatto quadrature rule as:

15

J \approx M (x_{1}^{(1)}, t_{0}, x_{N}^{(K)}, t_{f}) + \frac{t_{f} - t_{0}}{2} \sum_{k = 1}^{K} \frac{h_{i}}{2} \sum_{j = 1}^{N_{k}} ω_{j}^{(k)} L (x_{j}^{(k)}, u_{j}^{(k)}, t (τ_{i}^{(k)})),

where the term $ω_{j}^{(k)}$ is the same as the $ω_{j}$ in Ref. [16].

3.2. Approximation of solution error

The estimate method for the relative error is similar to the error estimate obtained for numerically solving a differential equation through using the modified Euler Runge-Kutta scheme. Suppose the NLP of Eqs. (1-4) on a mesh $S_{k}$ , $k = 1, . . ., K$ with $N_{k}$ HLGL points has been solved. The ensuing mesh with $M_{k} = N_{k} + 1$ HLGL points $({\hat{τ}}_{1}^{(k)}, . . ., {\hat{τ}}_{M_{k}}^{(k)})$ .

where:

{\hat{τ}}_{1}^{(k)} = τ_{1}^{(k)} = T_{k - 1}, {\hat{τ}}_{M_{k}}^{(k)} = T_{k} .

Assume further that $(x ({\hat{τ}}_{1}^{(k)}), . . ., x ({\hat{τ}}_{M_{k}}^{(k)}))$ are the values of the state approximation at $({\hat{τ}}_{1}^{(k)}, . . ., {\hat{τ}}_{M_{k}}^{(k)})$ . We then have:

16

{\hat{x}}^{(k)} ({\hat{τ}}_{j}^{(k)}) = {\hat{x}}^{(k)} (τ_{k - 1}) + \frac{t_{f} - t_{0}}{2} \sum_{l = 1}^{M_{k}} (\frac{h_{l}}{2} f ({\hat{x}}^{(k)} ({\hat{τ}}_{j}^{(k)}), u^{(k)} ({\hat{τ}}_{j}^{(k)}), t ({\hat{τ}}_{i}^{(k)}, t_{0}, t_{f}))),

j = 1, . . ., M_{k} .

The absolute error and the relative error approximations at $({\hat{τ}}_{1}^{(k)}, . . ., {\hat{τ}}_{M_{k}}^{(k)})$ of the state are defined, respectively, as:

17

E_{i}^{(k)} ({\hat{τ}}_{l}^{(k)}) = |{\hat{x}}_{i}^{(k)} ({\hat{τ}}_{l}^{(k)}) - x_{i}^{(k)} ({\hat{τ}}_{l}^{(k)})|, e_{i}^{(k)} ({\hat{τ}}_{l}^{(k)}) = \frac{E_{i}^{(k)} ({\hat{τ}}_{l}^{(k)})}{1 + \underset{j \in [1, \dots, N_{k} + 1], k \in [1, \dots, K]}{m a x} |x_{i}^{(k)} (τ_{l}^{(k)})|},

[\begin{array}{l} l = 1, . . ., M_{k} \\ i = 1, . . ., n_{x} \end{array}] .

The maximum relative error in $S_{k}$ is then defined as:

18

e_{m a x}^{(k)} = \underset{i \in [1, \dots, n_{x}], l \in [1, \dots, M_{k} + 1]}{m a x} e_{i}^{(k)} ({\hat{τ}}_{j}^{(k)}) .

3.3. Refining the mesh

If a mesh interval has met the accuracy tolerance, that is $e_{m a x}^{(k)} \leq ε$ , where $ε$ is the desired relative tolerance, then mesh size is reduced by decreasing mesh polynomial degree or merging adjacent mesh interval, otherwise the mesh size need to be modified by increasing points or dividing the mesh interval into several subintervals. Let $κ^{(k)} (τ)$ be the curvature of the $i$ th component of the state in mesh interval $k$ , as:

19

κ^{(k)} (τ) = \frac{|{\ddot{X}}_{i}^{(k)} (τ)|}{|{(1 + {\dot{X}}_{i}^{(k)} (τ)^{2})}^{\frac{3}{2}}|} .

Let ${\bar{κ}}^{(k)}$ and $κ_{m a x}^{(k)}$ be the mean and maximum value of $κ^{(k)} (τ)$ , respectively. Then define $r_{k}$ as the ratio of the maximum to the mean curvature:

20

r_{k} = \frac{κ_{m a x}^{(k)}}{{\bar{κ}}^{(k)}} .

When $e_{m a x}^{(k)} > ε$ , and if $r_{k} < r_{m a x}$ , where $r_{m a x}$ is a user-defined parameter, the curvature is considered uniform in this interval mesh then the number of collocation points should be increased in interval $k$ . Let $N_{k}^{(M)}$ and $N_{k}^{(M + 1)}$ denote the number of collocation points in interval $k$ at mesh $M$ and $M + 1$ respectively, where $M$ is the mesh refinement iteration number. The number of points $N_{k}^{(M + 1)}$ at mesh $M + 1$ is calculated by the equation:

21

N_{k}^{(M + 1)} = N_{k}^{(M)} + P_{k}, P_{k} = 2 c e i l [(l o g_{10} (e_{m a x}^{(k)} / ε)) / 2] .

It is noted in Eq. (21) that the ratio of the maximum to the error tolerance have a direct effect on polynomial degree in mesh interval. An upper limit $N_{m a x}$ is set for the maximum allowable polynomial degree to make sure that the number of collocation points does not grow an unreasonably large value. If $N_{k}^{(M + 1)} > N_{m a x}$ (i.e. $N_{k}^{(M + 1)}$ exceeds the maximum allowable polynomial degree), then the mesh interval $S_{k}$ must be divided into equally spaced subintervals.

3.4. Generation of new mesh segment

Assume $e_{m a x}^{(k)} > ε$ and $r_{k} > r_{m a x}$ , then the $k$ th mesh interval should be refined. The following procedure is the strategy for mesh interval division. Firstly, the predicted polynomial determines the number of all the collocation points in the new subinterval. Secondly, the number of collocation points should be no fewer than the minimum allowable number. In other words, whenever dividing a mesh interval, each interval will contain at least $N_{m i n}$ collocation points. Third, the new number of mesh intervals $B_{k}$ , is given by the equation:

22

B_{k} = c e i l [B_{u} l o g_{10} (\frac{e_{m a x}^{(k)}}{ε})],

where $B_{u}$ is a user-defined positive integer. In this process, it is ensured that the number of new intervals should be at least two. Thus, the number of new subintervals, denoted as $B_{k}$ , can be rewritten as:

23

B_{k} = m a x \{2, c e i l [B_{u} l o g_{10} (e_{m a x}^{(k)} / ε)]\} .

3.5. Reducing the number of collocation points in a mesh interval

The relative error of the mesh interval is less than the desired relative tolerance, and if $r_{k} < r_{m a x}$ , then the number of collocation points should be decreased. The number of points $N_{k}^{(M + 1)}$ at mesh $M + 1$ is calculated by the equation:

24

N_{k}^{(M + 1)} = N_{k}^{(M)} - P_{k}^{'}, P_{k}^{'} = c e i l (l o g_{10} \sqrt{\frac{ε}{e_{m a x}^{(k)}}}) .

3.6. Merging adjacent mesh subintervals

Before the adjacent mesh subintervals merging, it is necessary to decrease the number of each interval according to the method in Section 3.5, then generally estimate the number of mesh interval points. If $N_{k + 1} \neq N_{k}$ , the mesh intervals $S_{k + 1} = [T_{k}, T_{k + 1}]$ and $S_{k} = [T_{k - 1}, T_{k}]$ cannot be merged because highest polynomial order of the two adjacent mesh intervals are not equal. All the matching points of the original two mesh intervals are combined, and the conditions for the merging of the two mesh subintervals are mainly three:

(1) The two mesh subintervals must be adjacent.

(2) The relative error estimations of the two grid intervals are not more than $ε$ .

(3) The relative error of the new mesh interval after the merger is not larger than $ε$ .

3.7. Mesh refinement method

The schematic of adaptive mesh refinement method is shown in Fig. 1. The adaptive mesh refinement method is summarized as follows.

Step 1: Set $M =$ 0 and supply initial mesh, $S = ⋃_{k = 1}^{K} S_{k} = [- 1, + 1]$ , where $⋂_{k = 1}^{K} S_{k} = \emptyset$ .

Step 2: Solve NLP on current mesh $S$ .

Step 3: Compute maximum relative error $e_{m a x}^{(k)}$ in $S_{k}$ , $k = 1, . . ., K$ , if $e_{m a x}^{(k)} \leq ε$ for all $k = 1, . . ., K$ or $M > M_{m a x}$ , then quit. Otherwise, proceed to Step 4.

Step 4: If $e_{m a x}^{(k)} > ε$ , $k = 1, . . ., K$ , proceed to Step 5, otherwise, proceed to Step 6.

Step 5: Compute the ratio between the maximum and the mean curvature $r_{k}$ in $S_{k}$ , if $r_{k} \leq r_{m a x}$ , set the number of collocation points increase by $P_{k}$ , else divide the interval $S_{k}$ into $B_{k}$ subintervals, where $B_{k}$ is given by Eq. (23). Then proceed to Step 7.

Step 6: For the single mesh interval, decrease the number of collocation points. Merge the adjacent mesh interval if they satisfy the conditions of the merger.

Step 7: Set $M \vec{=} M + 1$ , and return to Step 2.

4. Numerical example

The order and intervals of the method in Ref. [16] are fixed in each simulation, while that of the method described in this paper are variable. For the convenience of narration, the mesh refinement method in Ref. [16] is called FOI (fixed order and intervals) method, and the method in section is called VOI (variable order and intervals) method. The term $M$ denotes the mesh refinement iteration, and $M =$ 0 means the mesh initialization, and the term $N$ and term $K$ denote the total collocation points and interval number respectively. The number of collocation points within each intervals of the two method is at least 2. The maximum of all mesh interval allowable error values is $ε$ , where $ε =$ 10^-6. When the mesh is initialization, the whole mesh is divided into 10 intervals, and each interval with a number of 2 collocation points. The value of term $r_{m a x}$ is 1.2, and the maximum number of collocation points with each interval is 12. The simulation of this paper was performed on a 3.4 GHz Intel Core i7 CPU computer and MATLAB Version R2013.

Fig. 1Schematic of adaptive mesh refinement method

4.1. Example 1

Consider the following Bang-Bang optimal control problem from Ref. [18] to illustrate the effectiveness of the method in this paper. Firstly, the method is able to accurately solve the optimal control problem. Secondly, the mesh refinement with elimination of unnecessary mesh points is able to improve the algorithm performance.

Minimize the cost function:

25

\min J = \int_{0}^{1} (x^{2} - \frac{1}{2} u) d t,

26

s . t . \dot{x} = - x + u,

27

x (0) = 1.0,

28

|u (t)| \leq 1.0, t \in [0,1] .

The analytic solution for this optimal control problem is given as:

29

u^{*} (t) = \{\begin{array}{l} - 1, (0 \leq t \leq t c), \\ 1, (t c \leq t \leq 1), \end{array} (t c = l n \frac{e}{2}) .

Fig. 2(a) and Fig. 2(b) and Fig. 2(c) show the exact solutions of the optimal control problem.

Fig. 2a) x(t) vs. t, b) u(t) vs. t, c) λ(t) vs. t

a)

b)

c)

Fig. 3(a) and Fig. 3(b) show the collocation point’s distribution of the solutions obtained using FOI method and VOI method respectively. According to the exact solutions of Eq. (29), we can know that near time $t = t c$ , the state variables and control variables are rather changeable. Fig. 4(a) and Fig. 4(b) show the collocation points are mainly located near $t = t c$ instead of located at both ends of the solution, which is because more number of collocation points are needed to capture the changes at $t = t c$ of the solution. When $M \geq 2$ , the mesh point in Fig. 3(b) does not increase with each mesh iteration, but decreases. This is because the VOI method has the properties of reducing the interval number and the number of mesh points, and the FOI method does not have this kind of property, and mesh points in Fig. 3(a) is fixed. The iteration program is terminated at $M =$ 5 when the accuracy tolerance is satisfied, and the number of mesh points using the VOI method is 60.

Fig. 3a) FOI mesh point history, b) VOI mesh point history

a)

b)

Next, we analyze the approximation ability of solution obtained by the VOI method to the exact solution. Fig. 4(a) and Fig. 4(b) show the state on each mesh refinement iteration alongside the analytic solution using the VOI method. In addition, it is seen from Fig. 4(a) show that the resulting solution gradually converge to the exact solution with each mesh refinement iteration. Fig. 4(b) shows the states near $t = t c$ on each mesh refinement iteration, it is apparent that the difference between Mesh Iteration 1 and Analytic Solution is great, and the Mesh Iteration 2 is much closer to the analytic than Mesh Iteration 1, moreover, the gap between Mesh Iteration 2 and Mesh Iteration 3-5 is very small, which mean that the solutions are gradually converged on the analytic solution with each mesh refinement iteration. The number of mesh points near $t = t c$ is also increasing with the continuous refinement of the mesh, because there are larger state changes near $t = t c$ , thus the precise requirement of the solution is satisfied with more mesh points.

Fig. 4a) x(t) vs. t, b) x(t) vs. t

a)

b)

A comparison of the implementation of the VOI method and FOI method with different higher-order and intervals solutions are given in Table 1. Comparisons are made in terms of computation time, the number of total mesh points $N$ and the number of mesh intervals $K$ and the number of mesh refinement iteration $M$ , and the cost function values. In each case, the initial guesses are randomized controls, with randomized state. A total of 100 samples are used to produce the results in this paper. The terminology VOI (2, 12) refers to the VOI method where the number of mesh points within each interval can vary between 2 and 12, furthermore, the number of mesh intervals in VOI method can vary as well. All the simulations parameters are shown in Table 1, and all the results are shown in blue. As it can be seen from the results listed in Table 1, the VOI method result in the smallest overall times compared with other cases. The reason is that the VOI method has the properties of reducing the unnecessary points and intervals, while FOI method (other cases) doesn’t have this kind of property but only keeps the number of mesh points and intervals fixed until the simulation terminated. It is a fact computation times are mostly depended on the number of mesh refinement iterations and mesh size, while the growth of computation time for case 4, 7, 10 are due mostly to the increase in number of mesh points and mesh intervals. Interestingly, the number of mesh intervals in case 1 is not set parameter but the result from the simulation, where the number of mesh intervals in case 2-10 are set parameters. The case 10 using 9 mesh points with 15 intervals gives a terminal cost function of 0.4572, which is the most optimality one in all cases. The results show, as expected, that using larger mesh size results in improvements in accuracy at the expense of increases in runtime, due to the denser Jacobians. For FOI method, better accuracy can be achieved by increasing mesh points for smooth problems, whereas increasing the number of intervals to achieve better accuracy for nonsmooth problems [16]. However, it is a fact that smooth regions and nonsmooth regions together exist in one solution of the problem, so it is difficult to trade the number of mesh points and the number of mesh intervals when solving a complicated problem. The simulations show that the VOI method can trade the number of mesh points within intervals with the number of mesh intervals, and obtain an accurate solution with a relatively small mesh size.

Table 1Mesh refinement results for example1 using VOI and FOI methods

Case		$n$	$K$	Mean times / s	$N$	$M$	Cost function	Constraint Jacobian density (%)
1	VOI	(2,12)	13	1.76	60	4	0.4574	1.358
2	FOI	5	10	8.39	50	7	0.4589	3.683
3	FOI	5	15	11.3	75	6	0.4583	2.776
4	FOI	5	20	29.5	100	6	0.4577	2.103
5	FOI	7	8	10.3	56	7	0.4581	4.156
6	FOI	7	12	23.2	84	6	0.4575	3.917
7	FOI	7	16	37.2	112	5	0.4583	2.843
8	FOI	9	6	11.5	54	7	0.4582	4.468
9	FOI	9	10	25.4	90	6	0.4577	4.015
10	FOI	9	15	45.8	135	5	0.4572	3.672

Table 2Convergence of VOI method compare with FOI method in Ref. [16]

	VOI (case 1)		FOI (case 2)		FOI (case 7)		FOI (case 10)
	$e_{m a x}$	$e_{e x a c t}$	$e_{m a x}$	$e_{e x a c t}$	$e_{m a x}$	$e_{e x a c t}$	$e_{m a x}$	$e_{e x a c t}$
1	2.08×10⁰	5.07×10^-3	5.61×10^-1	4.28×10^-3	1.33×10⁰	3.11×10^-3	1.89×10^-1	5.40×10^-3
2	5.44×10^-1	3.22×10^-2	3.24×10^-1	3.19×10^-2	2.03×10^-1	2.58×10^-2	9.19×10^-2	7.13×10^-2
3	3.15×10^-3	4.81×10^-2	7.53×10^-2	4.51×10^-2	4.59×10^-3	4.50×10^-2	5.71×10^-3	6.24×10^-3
4	1.36×10^-8	1.02×10^-7	6.45×10^-3	6.77×10^-3	4.78×10^-5	6.26×10^-4	4.55×10^-5	4.12×10^-5
5	–	–	3.05×10^-3	8.05×10^-3	6.84×10^-8	1.13×10^-7	6.50×10^-9	9.87×10^-9
6	–	–	4.85×10^-5	5.83×10^-5	–	–	–	–
7	–	–	9.34×10^-8	1.29×10^-7	–	–	–	–

Next, we analyze the convergence of mesh refinement. Table 2 shows the estimated maximum relative errors and exact relative errors for each mesh refinement iteration by using the VOI method (case 1) and FOI method (case 2, 7, 10). First, it can be seen from the Table 2 that the relative error on final mesh is quite small at $\approx$ 10^-7 for the state. The consistency in the exact relative error and the relative error approximation demonstrates the accuracy of the estimate derived in section 3.2.

4.2. Example 2

Consider the following trajectory optimization problem taken from Ref. [19] of maximizing the downrange of a Maneuverable Research Re-entry Vehicle (MaRRV). Minimize the cost function:

30

J = m i n \{- θ (t_{f})\} .

The state equations for hypersonic vehicle which is commonly used in midcourse guidance systems are listed as followings:

31

\dot{r} = v s i n γ, \dot{ϕ} = \frac{v c o s γ s i n χ}{r c o s θ}, \dot{θ} = \frac{V c o s γ c o s χ}{r}, \dot{v} = - \frac{D}{m} - g s i n γ,

\dot{γ} = \frac{1}{v} [L c o s β - (g - \frac{v^{2}}{r}) c o s γ], \dot{χ} = \frac{L s i n β}{m v c o s γ} + \frac{v}{r} c o s γ s i n χ t a n θ .

It is noted that the model (physical model and wing-body vehicle model) are taken from Ref. [19]. The initial conditions and terminal constraints are listed in Table 3. A typical solution of this problem is shown in Fig. 5(a)-(d) by using the VOI (2, 12) method.

It is seen that the solution to this example is relative smooth, especially the control variable (attack angle) is slowly changing in Fig. 5(d), meaning that it is easier to apply to engineering. As a result, it is possible to achieve an accurate solution with a relatively small number of mesh points when compared with FOI method. Table 3 shows that the terminal constraints are satisfied with error of less than 0.5 % in all parameters. A comparison of the implementation of the VOI method and FOI method solutions are given in Table 4.

Fig. 5a) Altitude vs. downrange, b) velocity vs. time, c) flight path angle vs. time, d) attack angle vs. time

a)

b)

c)

d)

Table 3Conditions and results

State variables	$h$ (km)	$θ$ (deg)	$ϕ$ (deg)	$v$ (km/s)	$γ$ (deg)	$χ$ (deg)
Initial conditions	72	0	0	5.435	–1	0
Terminal constraints	29.390	Index	0	1.500	–5	0
Final conditions	29.385	37.728	0.001	1.548	–5.001	0
Difference	0.005	–	0.001	0.002	0.001	0

It is seen that the VOI (2, 11) and FOI (5, 55) methods [shown in Table 4] are almost the most computationally efficient. FOI (9, 45) method takes the most time (with 185.7 seconds), while FOI (7, 55) produce the most optimal solution with the smallest cost function. As a result, for the vast majority of the solution, the largest decrease in error is achieved by using more mesh intervals and more mesh points in each mesh interval. The reason for more computation time needed to obtain the desired solution is that the number of mesh intervals and mesh points in each mesh interval are fixed when uses FOI methods. The simulations show that the proposed method can efficiently obtain a reliable, accurate solution for re-entry trajectory optimization problem of MaRRV.

Table 4Mesh refinement results for example2 using VOI and FOI methods

Case	Methods	$n$	$K$	Mean times / s	$N$	$M$	Cost function
1	VOI	(2, 11)	69	18.6	233	5	–37.728
2	FOI	5	55	21.5	265	7	–37.713
3	FOI	5	70	41.3	350	6	–37.722
4	FOI	7	50	65.1	350	7	–37.698
5	FOI	7	55	147.8	385	8	–37.740
6	FOI	9	40	108.0	360	8	–37.681
7	FOI	9	45	185.7	405	7	–37.730

5. Conclusions

Trajectory optimization was considered through the combination of the direct transcription and mesh refinement approach in this paper. The suggested method uses the Hermite interpolating polynomials to approximate the trajectory states, which employ the HGL points as collocation and interpolation points. The Hermite interpolating polynomials method can improve the sparsity of the constraint Jacobian. The method employs mesh refinement algorithms that it gets the ability to trade mesh polynomial degree with the number of mesh intervals. The number of mesh interval is increased in nonsmooth regions of the solution, while the mesh points increased in smooth regions of the solution. Furthermore, the mesh size can be decreased either by reducing the mesh points or by combining adjacent mesh intervals which share the same number of mesh points. The method is applied successfully to hyper-sensitive optimal control problem and trajectory optimization problem from the open literature. It is obvious that in terms of the example reviewed, better performance is achieved when compared with other mesh refinement methods. It may be suggested to study the advantages vs. disadvantages for the method in detail in the near future compared with other conventional methods in case of mesh refinement algorithms as well about other related aspects.

References

Betts J. T. Survey of numerical methods for trajectory optimization. Journal of Guidance, Control, and Dynamics, Vol. 21, Issue 2, 1998, p. 193-207.

Publisher
Elnagar G., Kazemi M. A., Razzaghi M. The pseudospectral legendre method for discretizing optimal control problems. IEEE Transactions on Automatic Control, Vol. 40, Issue 10, 1995, p. 1793-1796.

Publisher
Herman A. L., Conway B. A. Direct optimization using collocation based on high-order Gauss-Lobatto quadrature rules. Journal of Guidance, Control, and Dynamics, Vol. 19, Issue 3, 1996, p. 592-599.

Publisher
Gill P. E., Wong E., Murray W., Saunders M. A. User’s guide for SNOPT version 7.4: software for large-scale nonlinear programming. Numerical Analysis Report, Department of Mathematics, University of California, San Diego, 2015.

Search CrossRef
Soler M., Olivares A., Staffetti E. Multiphase optimal control framework for commercial aircraft four-dimensional flight-planning problems. Journal of Aircraft, Vol. 52, Issue 1, 2015, p. 274-286.

Publisher
Arribas D. G., Rivo M. S., Arnedo M. S. Optimization of path-constrained systems using pseudospectral methods applied to aircraft trajectory planning. IFAC-PapersOnLine, Vol. 48, Issue 9, 2015, p. 192-197.

Publisher
Huntington G. T. Advancement and analysis of a Gauss pseudospectral transcription for optimal control problems. Massachusetts Institute of Technology, Cambridge, 2007.

Search CrossRef
Huntington G. T., Benson D., Rao A. V. A comparison of accuracy and computational efficiency of three pseudospectral methods. AIAA Guidance, Navigation and Control Conference and Exhibit, South Carolina, 2007.

Publisher
Ross I. M., Fahroo F. Pseudospectral knotting methods for solving nonsmooth optimal control problems. Journal of Guidance, Control, and Dynamics, Vol. 27, Issue 3, 2004, p. 397-405.

Publisher
Poustini M. J., Esmaelzadeh R., Adami A. A new approach to trajectory optimization based on direct transcription and differential flatness. Acta Astronautica, Vol. 107, 2015, p. 1-13.

Publisher
Su Zikang, Wang Honglun A novel robust hybrid gravitational search algorithm for reusable launch vehicle approach and landing trajectory optimization. Neurocomputing, Vol. 162, 2015, p. 116-127.

Publisher
Ma Lin, Chen Weifeng, Song Zhengyu, Shao Zhijiang A unified trajectory optimization framework for lunar ascent. Advances in Engineering Software, Vol. 94, 2016, p. 32-45.

Publisher
Chai Dong, Fang Yang-Wang, Wu You-li, Xu Su-hui Boost-skipping trajectory optimization for air-breathing hypersonic missile. Aerospace Science and Technology, Vol. 46, 2015, p. 506-513.

Publisher
Lei H. M., Liu T., Li J., Jiang Z. P. Adaptive mesh refinement of hp pseudospectral method using size reduction. Control Theory and Applications, Vol. 33, 8, p. 1061-1067.

Search CrossRef
Ross I. M., Fahroo F. Legendre Pseudospectral Approximations of Optimal Control Problems. Lecture Notes in Control and Information Sciences, Springer-Verlag, Berlin, Vol. 295, 2003, p. 327-342.

Publisher
Paul W. Hermite-Legendre-Gauss-Lobatto direct transcription in trajectory optimization. Journal of Guidance, Control, and Dynamics, Vol. 32, Issue 4, 2009, p. 1392-1395.

Publisher
Garg D., Patterson M. A., Darby C. L., Francolin C., Huntington G. T., Hager W. W., Rao A. V. Direct trajectory optimization and costate Estimation of finite-horizon and infinite-horizon optimal control problems via a Radau pseudospectral method. Computational Optimization and Applications, Vol. 49, Issue 2, 2011, p. 335-358.

Publisher
Hu Y. Q., Liu X. G., Xue A. K. A penalty method for solving inequality path constrained optimal control problems. Acta Automatica Sinca, Vol. 39, 12, p. 1996-2001.

Publisher
Rizvi S. Tauqeer ul Islam, He Linshu, Naseemullah Vehicle performance tradeoff study for a small size lifting reentry vehicle. Proceedings of 10th International Bhurban Conference on Applied Sciences and Technology, Islamabad, Pakistan, 2013.

Search CrossRef

Cited by

An improved adaptive hp mesh refinement method in solving optimal control problems

Changxin Luo | Jiong Li | Chijun Zhou | Humin Lei

(2023)

Trajectory Optimization for High-Speed and Long-Range Interceptor Based on Improved Adaptive hp Pseudospectral Method

Changxin Luo | Chijun Zhou | Jiong Li | Humin Lei | Chuang Liu

(2022)

About this article

Received

29 December 2016

Accepted

01 July 2017

Published

31 December 2017

SUBJECTS

Vibration generation and control

DOI

https://doi.org/10.21595/jve.2017.18146

Keywords

optimal control

mesh refinement

relative error estimation

merge mesh intervals

mesh iteration

Acknowledgements

Supported by National Natural Science Foundation of China (61573374, 61503408).

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Previous article in issue Previous Next article in issue Next

Research article

2023 08 01

Minimum-time lane changing problem of vehicle handling inverse dynamics based on adaptive mesh refinement and collocation optimization method

Yingjie Liu, Dawei Cui, Wen Peng

Research article

2023 02 14

An adaptive infrared image denoising method based on two-dimensional empirical mode decomposition for distribution network inspection UAV

Qigang Zhou, Lei Yang, Fengqi Liu, Songyu Li

Research article

2022 12 11

Vehicle state and parameter estimation based on adaptive robust unscented particle filter

Yingjie Liu, Dawei Cui, Wen Peng

Research article

2021 09 30

Mathematical simulation of adaptive vector finite element method for the analysis of electromagnetic vibration spectrum field response

Yiyuan Cheng, Mingyang Su, Ming Hui, Wei Liu, Yangbing Zheng

H. Lei, T. Liu, D. Li, and J. Ye, “Adaptive mesh refinement method for optimal control based on Hermite-Legendre-Gauss-Lobatto direct transcription,” Journal of Vibroengineering, Vol. 19, No. 8, pp. 6036–6048, Dec. 2017, https://doi.org/10.21595/jve.2017.18146

Copy Extrica

Copied to clipboard!

TY  - JOUR
DO  - 10.21595/jve.2017.18146
UR  - https://doi.org/10.21595/jve.2017.18146
TI  - Adaptive mesh refinement method for optimal control based on Hermite-Legendre-Gauss-Lobatto direct transcription
T2  - Journal of Vibroengineering
AU  - Liu, Tao
AU  - Lei, Humin
AU  - Li, Deng
AU  - Ye, Jikun
PY  - 2017
DA  - 2017/12/31
PB  - JVE International Ltd.
SP  - 6036-6048
IS  - 8
VL  - 19
SN  - 1392-8716
ER  - 

Copy Ris

Copied to clipboard!

@article{Liu_2017,
	doi = {10.21595/jve.2017.18146},
	url = {https://doi.org/10.21595/jve.2017.18146},
	year = 2017,
	month = {dec},
	publisher = {{JVE} International Ltd.},
	volume = {19},
	number = {8},
	pages = {6036--6048},
	author = {Tao Liu and Humin Lei and Deng Li and Jikun Ye},
	title = {Adaptive mesh refinement method for optimal control based on Hermite-Legendre-Gauss-Lobatto direct transcription},
	journal = {Journal of Vibroengineering}
}

Copy Bibtex

Copied to clipboard!

[1]T. Liu, H. Lei, D. Li, and J. Ye, “Adaptive mesh refinement method for optimal control based on Hermite-Legendre-Gauss-Lobatto direct transcription,” Journal of Vibroengineering, vol. 19, no. 8, pp. 6036–6048, Dec. 2017, doi: 10.21595/jve.2017.18146.

Copy IEEE

Copied to clipboard!

Liu, Tao, Humin Lei, Deng Li, and Jikun Ye. “Adaptive Mesh Refinement Method for Optimal Control Based on Hermite-Legendre-Gauss-Lobatto Direct Transcription.” Journal of Vibroengineering 19, no. 8 (December 31, 2017): 6036–48. https://doi.org/10.21595/jve.2017.18146.

Copy Chicago

Copied to clipboard!

Adaptive mesh refinement method for optimal control based on Hermite-Legendre-Gauss-Lobatto direct transcription

Abstract

1. Introduction

2. Optimal control problem

3. Adaptive mesh refinement methodology

3.1. Numerical discretization and approximation

3.2. Approximation of solution error

3.3. Refining the mesh

3.4. Generation of new mesh segment

3.5. Reducing the number of collocation points in a mesh interval

3.6. Merging adjacent mesh subintervals

3.7. Mesh refinement method

4. Numerical example

4.1. Example 1

4.2. Example 2

5. Conclusions

References

Cited by

About this article

Related Articles