Journal of Fuzzy Systems and Control, Vol. 2, No 3, 2024

Model Predictive Control for Rotary Inverted Pendulum: Simulation and Experiment

Phuc-Hoang Huynh 1, Minh-Hanh Nguyen 2, Nguyen-Phat Pham 3, Hoang-Viet-Phuc Duong 4, Huy-Ha Nguyen 5,

Duc-Chung Le 6, Minh-Khoa Nguyen 7, Ngoc-Liem Bui 8, Nguyen-Phi-Long Le 9, Van-Dong-Hai Nguyen 10,*

1, 2, 3, 4, 5, 6, 7, 8, 10 Faculty of Electrical and Electronics Engineering, Ho Chi Minh City University of Technology and Education (HCMUTE), Vietnam

9 Faculty of International Education, Ho Chi Minh City University of Technology and Education (HCMUTE), Vietnam

Email: 1 21151503@student.hcmute.edu.vn, 2 21142525@student.hcmute.edu.vn, 3 21161346@student.hcmute.edu.vn,

4 21161348@student.hcmute.edu.vn, 5 20151357@student.hcmute.edu.vn, 6 20151043@student.hcmute.edu.vn,

7 20142350@student.hcmute.edu.vn, 8 20161038@student.hcmute.edu.vn, 9 20151291@student.hcmute.edu.vn,

10 hainvd@hcmute.edu.vn

*Corresponding Author

Abstract—Rotary Inverted Pendulum (RIP) is one of the simplest nonlinear systems commonly used for validating control algorithms. In this study, two controllers, Model Predictive Control (MPC) and Linear Quadratic Regulation (LQR), are simulated and experimentally validated. These controllers are executed in real-time on a PC, while the STM32F407 chip handles control and data acquisition from the pendulum using a high-speed USB interface. Due to the custom-built nature of this model, there are inaccuracies in the model and parameter identification. However, results show that the MPC controller is better at trajectory tracking and maintaining balance near the set point compared to the LQR controller. On the other hand, the LQR controller responds more robustly to disturbances and external forces, highlighting distinct differences between MPC’s optimization over each prediction horizon and LQR’s single-solution approach for the entire prediction horizon.

Keywords—LQR; MPC; Rotary Inverted Pendulum; STM32F4

Introduction

In nonlinear systems, RIP is considered an easily constructed object with a simple mechanical structure but high nonlinearity [1]. Therefore, this system is commonly used in experiments related to identification and control. Various control algorithms have been applied to the inverted pendulum (IP) model, including methods such as PID control [2], Back-stepping [3], fuzzy control [4], Reinforcement Learning [5], as well as optimal control like LQR [6], yielding significant success. However, MPC control is often used for SISO or SIMO systems. It is not popularly used for SIMO systems, such as RIP. The main difficulty in controlling this model in real models is the challenge of identifying the exact system parameters as the requirement of the MPC method. In [7] Quanser model is used to test MPC control. The experiment is successful due to the standard model of this company. However, this experimental model is expensive and the processor in that research is a professional board that cannot be popularized. Therefore, an MPC control that is successful on a self-made platform that is based on the STM32F4 board can be a solution. In this paper, we propose applying the MPC controller, one of the controllers that are used to manage overall processes in industries such as processing plants, oil refineries, and real-time applications [8][9]. Unlike LQR, MPC is an optimal control technique where control actions are calculated to minimize a cost function for a dynamically constrained system over a finite, receding horizon [10]. To highlight the differences between the two algorithms, we will compare MPC and LQR controllers on RIP to clarify the strengths, weaknesses, and advantages of these two control methods.

RIP MODELLING

The model’s kinetic equations

RIP consists of an arm and a pendulum, with a DC motor mounted at the end of the arm. The pendulum is normally stable in a downward position but unstable in an upright position. Therefore, a controller must be designed to keep the pendulum in an upright position and move it along a predefined trajectory. The specific structure is shown in Fig. 1 [1].

Mathematical IP model

We use the Parameter Estimator toolbox in MATLAB to estimate system parameters [11]. The parameters of the pendulum are shown in Table 1.

Parameters of RIP model

Sympol	Description	Value	Unit
	Mass of pendulum	0.24297
	Half-length of pendulum	0.20147
	Length of pendulum arm	0.14902
	Moment of inertia of arm	0.0045556
	Inertia moment of pendulum	0.0017725
	Friction coefficient of arm	0.0063986
	Friction coefficient of pendulum	0.0065929
	Gravitational acceleration constant	9.81
	Pendulum arm angle
	Pendulum angle
	Armature voltage
	Torque constant	0.053344
	Back emf constant	0.28834
	Armature resistance	0.72921
	Moment of inertia of rotor	0.012818
	Viscous friction constant	0.0033158

According [1], we have mathematical equations describing RIP as follows:

(1)

The control signal here is the torque of the motor (). It needs to be converted into voltage to fit the real system. The torque produced by a DC motor is defined as [12].

(2)

Combining equations (1) and (2), we obtain the mathematical equation for RIP in (3)

(3)

Linearization at working point

Defining state variables as in (4)

(4)

nonlinear state equations of RIP are listed in (5)

(5)

The requirement is to control the arm to keep the pendulum balanced in the upright position. The working point is that both the pendulum angle and arm angle are stationary, and no voltage is applied to the motor. It is described in (6) below

(6)

By linearizing RIP around this upright equilibrium point (where deviation angle β is less than 100), we obtain linearized state equations for the pendulum system in the following form:

(7)

where

(1)

Substituting the parameters from Error! Reference source not found. into equations in (7) and we obtain matrix (8) below.

(8)

Design Controller

In this section, we configure parameters for two controllers: LQR and MPC. Results are simulated in MATLAB Simulink and experimentally tested on a real model.

LQR controller

According to the LQR control method, to control the pendulum system, we need to design a state feedback controller [1]:

(9)

where: K is the control matrix, and x(t) is the state variable matrix.

Structure of the LQR controller

The value of matrix K needs to be optimized, meaning that we must find the value of K minimizing performance index J.

Performance index is chosen to be quadratic, with the final time being:

(10)

Optimal control theory demonstrates that vector K that minimizes quality index (10) is determined by expression:

(11)

where P is the solution to the Riccati algebraic equation and is computed by solving the Riccati equation:

(12)

where are positive definite square matrices used to tune the LQR controller. Here, ,,, are optimal weights corresponding to state variables, respectively.

Designed MPC controller

Predicting future trajectories

MPC is an optimal control algorithm that considers system constraints, such as its physical limits. As shown in Fig. 3, MPC uses a discrete-time linear model to predict future outputs of the system [13][14]:

(13)

Where:

is the state vector, is vector observed at the system's RIP,, v(k), and nd(k) are dimensionless manipulated variables, measured disturbances, and unmeasured input disturbances, respectively. Discrete state-space matrices A, B, C, Bu, Bv, Bd, Dv, Dd are computed from the continuous linear model using a discrete sampling time Ts.

Consider the problem of predicting future trajectories of the model performed at time k=0. Set nd(i)=0 for all prediction instants i, and obtain [15] as

(14)

The solution to the equation (14) is:

(15)

Where:

Structure of MPC controller

Optimization Variables

Let m be the number of free control moves, and let . Then, it yields

(16)

where JM depends on the choice of blocking moves

Standard cost function

Based on predicted states, MPC calculates optimal control sequences to minimize cost function:

(17)

Where:

LSy and LSu are diagonal matrices or outputs and MV scaling factors, respectively.

: Tuning weight for the jth plant output at the ith prediction horizon step (dimensionless).

: Tuning weight for the jth MV at the ith prediction horizon step (dimensionless)

: Tuning weight for the jth MV movement at the ith prediction horizon step (dimensionless)

εk : Slack variable at the control interval k (dimensionless).

ρε : Constraint violation penalty weight (dimensionless).

Constraints

When a system contains physical limitations, such as motor voltage V, MPC accounts for these limitations in optimization problems using hard constraints.

(18)

Optimization Variables

Let m be a number of free control moves, and let z= [z0; ...; zm–1]. Then

(19)

where JM depends on the choice of blocking moves. Together with the slack variable ɛ, vectors constitute free optimization variables of the optimization problems.

QP Solvers

Model predictive controller QP solver converts a linear MPC optimization problem to the general form QP problem [16]:

(20)

Subject to linear inequality constraints

(21)

where

x is the solution vector.
H is the Hessian matrix. This matrix is constant when your prediction model and tuning weights do not change at run time.
A is a matrix of linear constraint coefficients. This matrix is constant when your prediction model does not change at run time.
b and f are vectors.

Control Results and Discussion

In this section, we simulate RIP tracking the set-point signal at arm angle. The experiment will use Simulink Real-Time Target with STM32F407VE control chip. The sampling time for the system is Ts=0.01s, with control parameters as follows:

Because we use a microcontroller to control with a sampling time , the discrete-time matrices A, B and the weighting matrices for the LQR controller have the following values:

	(22)
	(23)
	(24)
	(25)

From (22) to (25) we obtain the matrix K:

(26)

The configuration of the system's input and output measurements is shown Fig. 4, along with the controller parameters as follows:

Configuration of MPC controller for RIP

Sample time: Ts = 0.01s
Prediction horizon: P = 50
Control horizon: m = 3

Results in Simulation

We use MATLAB/Simulink to simulate the output responses of two controllers. The simulation diagram is shown in Fig. 5. Simulation results of MPC and LQR controllers with white noise (Noise power = [0.000001]) and arm angle tracking set-point signal f=1 (rad) are shown in Fig. 6.

Simulation results show that the output responses of the arm and pendulum under both LQR and MPC controllers are similar. However, a control signal for the LQR controller has a larger amplitude oscillation, ranging from [−40, 40] (V), whereas the control signal for the MPC controller is [−12, 12] (V).

The results indicate that the control quality of the MPC controller is better. It also ensures that the operating voltage threshold of the motor is maintained even under significant disturbances affecting the system.

Simulation results of two controllers with white noise (Noise power = [0.0000001]) and arm angle tracking set-point signal f=sin(0.2πt) (rad) are shown in Fig. 7.

Simulation results show that the output response at the pendulum angle is similar for both controllers. However, the response at the arm angle is better with the MPC controller compared to the LQR controller.

Description: A diagram of a computer program

Description automatically generated

Simulation diagram of MPC and LQR controllers

Output response of RIP tracking a step signal

A diagram of a graph

Description automatically generated with medium confidence

Output response of RIP tracking a sine wave signal

Results in Experiment

The RIP system model is shown in Fig. 8. We set the reference signal for the arm angle of the MPC and LQR controllers from f=0 (rad) to f=1 (rad) at the 20th second. Experimental results for LQR and MPC controller are shown from Fig. 9 to Fig. 11.

Hardware platform of RIP

Arm response under LQR and MPC controllers

Pendulum response under LQR and MPC controllers

Control voltage of the LQR and MPC controllers

The performance of two controllers, when the arm angle tracks set-point signal f=1 (rad), is shown in Table 2 and Table 3.

Quality of the RIP

	Average pendulum angle (rad)			Average arm angle (rad)	Control voltage (V)	(s) (5%)	(%)
			Range		Control voltage (V)	(s) (5%)	(%)
LQR	1.15	0.15	[1; 1.2]	0.002	[1.6; -1.5]	38.7	23.5
MPC	0.87	0.13	[0.9; 0.84]	0.002	[0.9; -0.7]	7.5	47.7

Quality of RIP according to Root Mean Square Error standard

Arm angle

(rad)

Pendulum angle

(rad)

Control voltage

(V)

LQR

0.2020

0.8795

0.9950

MPC

0.1560

0.8634

0.9214

Experimental results show that:

At steady state, MPC tracks set-point signal at arm angle better than LQR, with an oscillation amplitude that is three times smaller, a settling time that is five times faster, and a control voltage fluctuation that is half as large.
From RMSE values, we see that the MPC controller performs significantly better in tracking reference signals for arm angle and pendulum angle compared to the LQR controller, although the control voltage signals of the two controllers are nearly identical.
During a transient period, LQR provides an arm angle response that is twice as small and a control voltage that is four times smaller.

From experimental results, we observe that the arm and pendulum oscillate along a curve around the reference signal because:

Dynamic equation described in (3) is for the ideal model. However, experimental model during fabrication inevitably has errors. This leads to a mathematical equation that describes the physical characteristics of the experimental model as being relatively accurate, but not entirely precise.
Encoder signal wire attached to the pendulum unintentionally creates an additional resistance force, which we can refer to as "Input Disturbance".
Control system, originally designed as SISO for a SIMO system, makes stabilizing two variables simultaneously challenging.

These three reasons cause the controller to attempt to reduce reference signal error, leading to curve oscillation.

Conclusions

After evaluating the simulation and experimental results of MPC and LQR controllers under different operating conditions on RIP, we find that MPC frequently computes new solutions, whereas LQR uses the same single (optimal) solution for the entire time horizon [17]. For this reason, in terms of control quality, MPC performs well for trajectory tracking and handling system constraints, while LQR provides strong responses to system disturbances, external forces, and unforeseen system changes. Additionally, MPC is characterized by smooth changes in the control signal, whereas LQR produces rapid changes in the control signal, which is a significant drawback due to its substantial impact on actuator wear.

Regarding controller processing, the size of the control matrix for the LQR controller depends only on the number of internal states of the system. In contrast, the MPC controller sets up a control matrix for the entire prediction horizon. Therefore, size of the MPC control matrix not only depends on the number of internal states but also increases proportionally with extension of the prediction horizon and reduction in sampling time [18]. This result in an increasing number of calculations required to generate a control signal, limiting potential applications of the MPC algorithm and relying on the capabilities of the controller used.

At each step, an MPC controller receives or estimates the current state of the plant. It then calculates a sequence of control actions that minimize cost over the horizon by solving a constrained optimization problem that relies on an internal plant model and depends on the current system state. The controller then applies only the first computed control action to the plant, disregarding subsequent ones. The process repeats in the following time step [13]. Therefore, an accurate mathematical model is necessary, considering the uncertainties [19][20] and disturbance rejection [21].

ACKNOWLEDGMENT

This paper belongs to the project for students in HCMUTE for the year 2025. It is funded by HCMUTE. We, the authors, are grateful for this support. The operation of the system is shown in the link: https://www.youtube.com/watch?v=woHq5wdWdEM

References

S. N. Vassilyev, A. Y. Kelina, Y. I. Kudinov, and F. F. Pashchenko, “Intelligent control systems,” Procedia Computer Science, vol. 103, pp. 623-628, 2017, https://doi.org/10.1016/j.procs.2017.01.088.
N. V. Đ. Hải and N. V. Thuyên, “PID-neuron controller design for rotary inverted pendulum system”, JTE, vol. 7, no. 4, pp. 37–43, 2012, https://jte.edu.vn/index.php/jte/article/view/824.
M. T. Vo, “Back-stepping control for rotary inverted pendulum”, JTE, vol. 15, no. 4, pp. 93–101, 2020. https://jte.edu.vn/index.php/jte/article/view/110.
Haiyan Wang and Yu Bai, "Application of fuzzy control in the inverted pendulum," Proceedings of 2013 2nd International Conference on Measurement, Information and Control, pp. 1354-1357, 2013, https://doi.org/10.1109/MIC.2013.6758210.
J.-B. Kim, H.-K. Lim, C.-M. Kim, M.-S. Kim, Y.-G. Hong, and Y.-H. Han, "Imitation Reinforcement Learning-Based Remote Rotary Inverted Pendulum Control in OpenFlow Network," IEEE Access, vol. 7, pp. 36682-36690, 2019, https://doi.org/10.1109/ACCESS.2019.2905621.

Y.-J. K. Minho Park, Ju-Jang Lee, "Swing-up and LQR stabilization of rotary inverted pendulum," The Sixteenth International Symposium on Artificial Life and Robotics, vol. 16, pp. 94–97, 2011, https://doi.org/10.1007/s10015-011-0897-9.
K. R. S. K N Deepak, T Ananthan, "Model Predictive Control for rotary inverted pendulum using LabVIEW," IOP Publishing Ltd, 2019, https://doi.org/10.1088/1757-899X/577/1/012113.
D. M. M. Ganga G, "MPC controller for trajectory tracking control of quadcopter," International Conference on Circuit, Power and Computing Technologies (ICCPCT), pp. 1-6, 2017, https://doi.org/10.1109/ICCPCT.2017.8074380.
S. J. Qin and T. A. Badgwell, "A survey of industrial model predictive control technology," Control Engineering Practice, vol. 11, no. 7, pp. 733-764, 2003, https://doi.org/10.1016/s0967-0661(02)00186-7.
J. B. Rawlings, "Tutorial: model predictive control technology," Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251), pp. 662-676 vol.1, 1999, https://doi.org/10.1109/ACC.1999.782911.

G. V. Troshina, A. A. Voevoda and K. M. Bobobekov, "The parameters determination of the inverted pendulum model in the automatic control system," 2016 13th International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE), pp. 180-182, 2016, https://doi.org/10.1109/APEIE.2016.7807049.
N.-C. Tran, "Double Rotary Inverted Pendulum LQR," Journal of Fuzzy Systems and Control, vol. 2, pp. 104–10, 2024, https://doi.org/10.59247/jfsc.v2i2.212.
A. Bemporad, "Model Predictive Control Design: New Trends and Tools," Proceedings of the 45th IEEE Conference on Decision and Control, pp. 6678-6683, 2006, https://doi.org/10.1109/CDC.2006.377490.
T.-D. Chu and C.-K. Chen, "Design and Implementation of Model Predictive Control for a Gyroscopic Inverted Pendulum," Applied Sciences, vol. 7, no. 12, 2017, https://doi.org/10.3390/app7121272.
C. Gambella, B. Ghaddar, and J. Naoum-Sawaya, “Optimization problems for machine learning: A survey,” European Journal of Operational Research, vol. 290, no. 3, pp. 807-828, 2021, https://doi.org/10.1016/j.ejor.2020.08.045.

C. Schmid and L. T. Biegler, "Quadratic programming methods for reduced hessian SQP," Computers & chemical engineering, vol. 18, no. 9, pp. 817-832, 1994, https://doi.org/10.1016/0098-1354(94)E0001-4.
L. Wang. Model predictive control system design and implementation using MATLAB, vol. 3. London: springer. 2009. https://link.springer.com/book/10.1007/978-1-84882-331-0.
A. Jezierski, J. Mozaryn, and D. Suski, "A Comparison of LQR and MPC Control Algorithms of an Inverted Pendulum," In Polish Control Conference, pp. 65-76, 2017, https://doi.org/10.1007/978-3-319-60699-6_8.
M. M. J. Grimm G, Tuna S E, and Teel A R, "Nominally robust model predictive control with state constraints," IEEE transactions on Automatic control, vol. 52, pp. 1856-1870, 2007, https://doi.org/10.1109/TAC.2007.906187.
S. M. M. a. R. S. V. Mayne D Q, "Robust model predictive contol of constrained linear systems with bounded disturbances," Automatica, vol. 41, pp. 219-224, 2005, https://doi.org/10.1016/j.automatica.2004.08.019.
Z. F. a. G. Y. P, "A simplified predictive control algorithm for disturbance rejection," ISA transactions, vol. 44, pp. 187-198, https://doi.org/10.1016/S0019-0578(07)60177-3.

Phuc-Hoang Huynh, Model Predictive Control for Rotary Inverted Pendulum: Simulation and Experiment

Introduction

RIP MODELLING

The model’s kinetic equations

Linearization at working point

Design Controller

LQR controller

Designed MPC controller

Control Results and Discussion

Results in Simulation

Results in Experiment

Conclusions

ACKNOWLEDGMENT

References