Full Terms & Conditions of access and use can be found at
https://www.tandfonline.com/action/journalInformation?journalCode=tcfm20
Engineering Applications of Computational Fluid
Mechanics
ISSN: (Print) (Online) Journal homepage: https://www.tandfonline.com/loi/tcfm20
Research on a method for predicting the
underflow concentration of a thickener based on
the hybrid model
Dong Xiao , Hongfei Xie , Longqiang Jiang , Ba Tuan Le , Jichun Wang ,
ChongMin Liu & Hongzong Li
To cite this article: Dong Xiao , Hongfei Xie , Longqiang Jiang , Ba Tuan Le , Jichun Wang , ChongMin Liu & Hongzong Li (2020) Research on a method for predicting the underflow
concentration of a thickener based on the hybrid model, Engineering Applications of Computational Fluid Mechanics, 14:1, 13-26, DOI: 10.1080/19942060.2019.1658228
To link to this article: https://doi.org/10.1080/19942060.2019.1658228
© 2019 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group
Published online: 07 Nov 2019.
Submit your article to this journal Article views: 1214
View related articles View Crossmark data
2020, VOL. 14, NO. 1, 13–26
https://doi.org/10.1080/19942060.2019.1658228
Research on a method for predicting the underflow concentration of a thickener
based on the hybrid model
Dong Xiaoa,b, Hongfei Xiea,b, Longqiang Jiangc, Ba Tuan Lea,d, Jichun Wange, ChongMin Liua,band Hongzong Lia,b
aInformation Science and Engineering School, Northeastern University, Shenyang, People’s Republic of China;bLiaoning Key Laboratory of Intelligent Diagnosis and Safety for Metallurgical Industry, Northeastern University, Shenyang, People’s Republic of China;cZhongxing Telecommunication Equipment Corporation, Shenzhen, People’s Republic of China;dInstitute of Research and Development, Duy Tan University, Da Nang, Vietnam;eScience School, Shenyang Jianzhu University, Shenyang, People’s Republic of China
ABSTRACT
As a method for extracting metals and their compounds, hydrometallurgy has the advantages of high comprehensive metal recovery rate, low environmental pollution, and easier production process. The intensive washing process is a key process in the hydrometallurgical process, and the underflow concentration is a key indicator for measuring the quality of the concentrated washing process. In this paper, after analyzing the characteristics of the thick washing process, the hybrid model combin-ing mechanism modelcombin-ing and error compensation model based on EDO-TELM (three hidden layers Extreme Learning Machine optimized with Entire Distribution Optimization algorithm) is used to achieve accurate measurement of the underflow concentration in the dense washing process. The hybrid model uses the improved EDO-TELM algorithm as an error compensation model to compen-sate the error of the un-modeled part of the mechanism model, and gives a reasonable estimate of the uncertain part of the model, which theoretically reduce the prediction error of the model. The Matlab simulation results show that the prediction error of the hybrid model is significantly lower than that of the mechanism model and the data model, and can be adapted to the measurement needs of the industrial site.
ARTICLE HISTORY Received 23 April 2019 Accepted 18 August 2019 KEYWORDS Hydrometallurgy; underflow concentration; mechanism model; EDO; TELM; hybrid model
1. Introduction
With the large-scale, centralized and continuous pro-duction of hydrometallurgical industry, an efficient and stable automatic production line is urgently required. The overall level of automation of hydrometallurgical pro-duction in China is low, and its automation technology greatly restricts the development of hydrometallurgical industry in China (Gui, Yang, Chen, & Wang, 2013). The hydrometallurgical process mainly includes typical processes such as leaching, dense washing, and extrac-tion/replacement (Wang, He, & Chang,2012). The dense washing process is a key process in the hydrometallur-gical process. Since in industrial production, the solid materials are usually subjected to wet sorting, and the selected products are in the form of a liquid-solid two-phase fluid, solid-liquid separation is required in most cases. The solid-liquid separation process using a thick-ener is commonly referred to as a ‘washing’ process. The thickener has the characteristics of large process-ing capacity, large bufferprocess-ing capacity, large inertia, poor controllability, and continuous operation. Therefore, the
CONTACT Dong Xiao [email protected]
concentrated washing process is the key link of the hydrometallurgical mass production process, and it is also the key to ensure the comprehensive recovery rate of valuable metals and maintain the volume balance of the production process (Miao, Wang, Wang, & Mao,2009). At present, the concentration of the underflow of a con-centrate in Shandong is difficult to detect, and the oper-ator relies on the production experience to carry out the ore mining, resulting in a sharp fluctuation in the down-stream filter press process, and the water content of the filter cake product is difficult to reach the standard. The tailings thickener is controlled by the experience of the operator, and it is arbitrarily large. Therefore, it is nec-essary to establish a thickener underflow concentration prediction model that meets the needs of industrial sites. At present, researches on various aspects of the pro-duction of thickeners have been carried out at home and abroad. In 1952, Kynch (Bürger, Concha, Karlsen, & Narváez,2006; Bürger, Diehl, Farås, Nopens, & Torfs,
2012; Kynch, 1952) proposed a mathematical analysis method, which can obtain the relationship between the
© 2020 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
underflow concentration and the sedimentation velocity only by performing a sedimentation experiment. Fred-Schoenbrunn and TomToronto (Ichiro Tetsuo,1986) pro-posed an expert control system to achieve the control of thickeners, and were well applied in the CortezCold project in Nevada, using advanced fuzzy control for the prediction of the underflow concentration of the thick-ener; Raimund Bürger (Bürger, Diehl, & Nopens,2011) established a consistent modeling approach for complex practical processes in wastewater treatment. The actual process was approximated by mathematical models, and the established mathematical techniques were integrated into wastewater treatment modeling.
In the Meishan Concentrator (Gui & Zhao,2007), an experiment considering the tailings concentration was conducted by using a high-efficiency deep cone thick-ener; the underflow discharge was controlled automat-ically, the algorithm of combining double input single output fuzzy logic with PID was used, and the under-flow concentration was detected by the door-ray den-sitometer. The control system was distributed control, which can ensure the normal operation of the ener. Regarding the automatic uniformity of the thick-ener for a cyanide plant in Shandong Gold Mining Co., Ltd. (Ma & Cui, 2013), the ore drawing problem was studied and reformed, and a nuclear concentration meter was used as the detection system to realize the real-time detection of the underflow concentration of the thickener in the cyanide workshop; good results were obtained. The Beijing General Institute of Mining and Metallurgical Research performed research on the thick-ening machine of the Dexing Copper Mine (Ma,2004) Jingwei Plant. According to the test data of material balance and settlement, the load forecasting model and the optimal control model of the thickener were estab-lished, and the optimal operating values of flocculant addition and underflow discharge speed were given to ensure the optimal mud storage and stable production state of thickener. Aiming at the mathematical mod-eling of the thickener control system, Yang and Chen (2002) proposed to establish the dynamic balance rela-tionship between ore feeding and ore drainage by the thickener. The underflow concentration is a function of the ore supply and the underflow rate. Liu (2007) pro-posed that the underflow concentration of the thickener is determined by the compression time of the material in the thickener and the height of the compressed layer. For a relatively fixed production process, the compres-sion time is basically fixed, and the factor affecting the underflow concentration is the height of the compressed layer. It is considered that the underflow concentration can be predicted by controlling the underflow rate of the thickener.
In this article, we propose a hybrid model by com-bining mechanism model with data-driven modeling method. The mechanism model provides a priori knowl-edge of the data model, which saves training samples and reduces the need for sample data.
On the basis of solving fluid mechanics related prob-lems, the method based on mechanism model mod-eling is widely used. Ramezanizadeh, M studied the thermal performance of thermosiphon by three con-centrations of NI/glycerol-water nanofluid experiments, and designed a heat exchanger based on thermosiphon (Ramezanizadeh, Nazari, Ahmadi, & Chau,2019). Gha-landari, M. used computational fluid dynamics (CFD) to simulate the flow of Ag/water nanofluids in the root canal and evaluated the effect of injection height and nanofluid concentration on the shear stress of the root canal wall (Ghalandari, Elaheh, Fatemeh, Shahabbodin, & Chau,2019). In recent years, many studies have applied intelligent optimization algorithms or neural network related knowledge to the study of fluid mechanics for modeling and simulating fluid mechanics related prob-lems. In 2015, Taormina, R. developed a variable selec-tion method based on the rainfall-runoff simulaselec-tion experiment using binary coded discrete full informa-tion particle swarm optimizainforma-tion algorithm and ELM (Taormina & Chau, 2015). In 2018, Ardabili, S.F. ana-lyzed the accuracy of intelligent computing methods in hydrogen production modeling (Ardabili et al.,2018). In 2019, Yaseen, Z.M used the EELM algorithm to model potential disturbances in tropical river flow patterns and their physical spaces (Yaseen, Sulaiman, Deo, & Chau,
2019).
The data-driven model can complement the unmod-eled features of the mechanism model, so that the model not only has local approximation characteristics, but also has global approximation characteristics.
2. Mechanism model implementation of thick process
2.1. Establishment of mechanism model
The thickening process is based on gravity sedimenta-tion. Obviously, the slurry concentration depends on the settling time and the height of the space. Therefore, the slurry concentration can be expressed as C(z, t), where the z-axis is in the positive direction and t is the dense process time, as shown in Figure1. We make rational-ization assumptions, assuming that the settlement pro-cess is one-dimensional. Since gravity settlement and compression are essentially dimensional, the one-dimensional settlement model can capture the basic char-acteristics of the process well. The mass conservation
Figure 1.Internal working space distribution of thickener. relationship of the settlement process can be described by the partial differential equation of Equation (1) (Li,2006).
∂C ∂t +
∂
∂z(Cvs) = 0 (1)
where vs≥ 0 is the sedimentation rate of the slurry. The
equation contains two unknowns, slurry concentration C and sedimentation rate vs. Therefore, the solution to
this equation requires the establishment of a constitutive relationship between C and vs.
In a unit time, the growth of the arbitrary interval (z1, z2) is equal to the flow rate (|z=z1) of the z1height inflow minus the flow rate of the z2height flow(|z=z2) plus the flow generated in the interval, and the expression is as shown in Equation (2). d dt z2 z1 AC(z, t)dz = A(|z=z1− |z=z2) + z2 z1 Qf(t)Cf(t)δ(z)dz (2)
where Qfis the feed flow rate; A is the cross-sectional area
of the thickener; Cf is the feed concentration;δ(z) is the
δ function, δ(z) = 1 only at the feed layer, and at other heights,δ(z) = 0;
Flow is expressed as: C,∂C ∂z, z, t = F(C, z, t) − (γ (z)dcomp(C) + ddisp(z))∂C∂z (3) F(C, z, t) = ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ −Qe(t)C/A z< −H vhs(C)C − Qe(t)C/A −H <z <0 vhs(C)C + Qu(t)C/A 0 < z < B +Qu(t)C/A z> B (4) γ (z) = ⎧ ⎨ ⎩ 1 −H ≤ z ≤ B 0 z< −Horz > B (5)
The layering idea (Guo, Zhang, Xu, & Chen,2009) is used to subdivide the thickener into n layers, so the height of each layer is z = (B + H)/n. The position of the boundary line between the layers is as shown in Figure2. The height of each boundary line can be calculated by the formula (6).
z= iz − H, i = 0, . . . N (6) The overflow layer z0 and the underflow layer zn fall
on the boundary line; the overflow layer z0= −H, the underflow layer zn = B, and the feeding inlet z = 0 is in
the interval of (zm−1, zm]; and the corresponding m-th
layer is the feeding layer. In the simulation scheme, the corresponding overflow zone and the underflow zone are respectively added with two layers at the top and bottom of the equation, the top two layers simulate the overflow
zone, and the bottom two layers simulate the under-flow zone. The overunder-flow turbidity Cetakes the 0-th layer
concentration. The underflow concentration Cn is the
n+ 1th layer concentration. Therefore, the calculation area consists of n+ 4 intervals of length z.
For each layer, Equation (2) can be rewritten as a simplified version of the mass conservation equation as follows: dCi dt = − Finum− Fnumi−1 z + 1
z(Jdisp,inum − Jdisp,inum−1+ Jcomp,inum − Jcomp,inum −1)
+QfCf
Azδi,m. i= −1, . . . , n + 2 (7) Jdisp(zi, t) ≈ Jdisp,inum := Jdisp,iCi+1− Ci
z (8)
Jcomp(zi, t) ≈ Jcomp,inum := γ (zi)
Dnumi+1 − Dnumi
z (9)
where Dnumi is the compression factor.
Since each term of Equation (7) does not exist in every layer, it is necessary to layer to establish more detailed machine differential equations. In the subsidence zone, the i-th (i= 2 . . . , n−1) layers are given by
dCi dt = Qe Az(Ci+1− Ci) − Gi− Gi−1 z + ddisp,i(Ci+1− Ci) − ddisp,i−1(Ci− Ci−1) +Dnum i+1 − 2Dnumi + Dnumi−1 (z)2 When i= m, the layer is the feed layer:
dCm dt = Qf Az(Cf − Cm) − Gm− Gm−1 z + ddisp,m(Cm+1− Cm)−ddisp,m−1(Cm−Cm−1) +Dnum m+1− 2Dnumm + Dnumm−1 (z)2 For the underflow layer:
dCn+1 dt = − Qu Az(Cn+1− Cn) +zGn + (Dnumn+1− Dnumn ) (z)2
where Cf is the feed concentration; Dnumi is the
disper-sion coefficient, n is the number of stratified layers; z is the height of the thickener; zf is the feed height; vsis the
settling velocity; C is the concentration of the pulp; and
Giis given by formula (10). Gi = Gi(Ci, Ci+1) = ⎧ ⎪ ⎨ ⎪ ⎩ min Ci≤C≤Ci+1 Cvhs(C) Ci ≤ Ci+1 max Ci>C>Ci+`1Cvhs(C) Ci > Ci+1 (10)
2.2. Application of Bernoulli principle
Based on the limitations of the field conditions, the flow velocity of the fluid cannot be measured by the instru-ment. Therefore, the Bernoulli equation (Zhang,2016) of fluid mechanics is used to convert the pressure data measured by the field instrument into the flow veloc-ity suitable for the mechanism model for subsequent data processing; this approach is of great convenience. At the same time, the gravitational potential energy of the fluid at the same level is ignored, so the formula for the flow rate and pressure of the fluid can be expressed as Equation (11)
v=
2p
ρ (11)
In order to carry out the next simulation analysis, Table1
gives the model parameters of the dense washing process mechanism model.
2.3. Auxiliary variable selection
Auxiliary variables include the type of variable, the num-ber of variables, and the location of the monitoring point. The choice of auxiliary variables is the first step in estab-lishing a process data model. This step determines the input of the soft-measurement model and thus has a direct impact on the model structure, which is critical to the success of the model. The selection of auxiliary vari-ables is generally through the mechanism analysis, clari-fying the task of the soft-measurement model, determin-ing the dominant variables, and selectdetermin-ing the main influ-encing factors from the measurable variables affecting the dominant variables. The task of the mechanism model in this paper is to predict the underflow concentration of the key variables in the dense washing process to determine the dominant variable of the soft-measurement model as the underflow concentration. Variables that have a large impact on the underflow concentration include top vol-ume flow, feed flow, underflow bulk density, and fluid flow rate. These four variables can be measured by the inspection device at the industrial site, so we choose them as input variables of the model, and choose underflow concentration as output variable of the model. Table2is the auxiliary variable.
Table 1.Parameter set for the mechanism model of the thickener.
Model
parameter Parameter meaning
Parameter
values Unit A Thickener cross section 1962.5 [m2]
z Thickener height 5.95 [m]
Zf Feed layer height 2.06 [m]
r Free settlement velocity undetermined parameter 1
1.3359 [−] k0 Free settlement velocity
undetermined parameter 2
0.0325 [m/h] n0 Free settlement velocity
undetermined parameter 3
0.0001785 [m3/kg] k1 Undetermined parameters of
the velocity of interference settlement 1
6.79 [m/h]
n1 Undetermined parameters of
the velocity of interference settlement 2
0.000425 [m3/kg]
k2 Pressure settlement velocity
undetermined parameter 1
70.9 [m/h]
n2 Pressure settlement velocity
undetermined parameter 2 0.000875 [m3/kg] β Settlement function parameter 1 4.00 [kg/m3] α Settlement function parameter 2 4.00 [Pa] α1 Diffusion function parameter 1 0.001 [m−1] α2 Diffusion function parameter 2 0.0026 [s/m2] CC ritical concentration 6.00 [kg/m3] g acceleration of gravity 9.81 [m/s2] [kg/m3] ρs Solid density 1050 [kg/m3]
ρ Density difference between solid and liquid
50 [kg/m3]
Table 2.Variable tables.
Order number Variable name Variable meaning
1 x1 Top volume flow
2 x2 Underflow bulk density
3 x3 Feed flow
4 x4 Fluid flow rate
5 y Underflow concentration
2.4. Data preprocessing
In the actual measurement data, there are often individ-ual measurement data that obviously exceed the general range of measurement data, that is, the measurement value is too far from the average level of the remain-ing measurement values; such data are usually set as an abnormal value. Outliers can usually be processed using the 3σ principle. In general, for a sample set x, if there is only a random error in the sample, the normal distri-bution law of the random error is statistically calculated, and the data with the absolute value of the deviation greater than 3σ are regarded as abnormal data and are eliminated. The specific implementation method is:
For the measurement data [x1, x2, x3, x4], the average value is first calculated according to formula (12), and the estimated value of the standard deviation is calculated
Figure 3.Mechanism model parameter identification optimiza-tion model. according to Equation (13). ¯x = 1 n n i=1 xi (12) σ = 1 n n i=1 (xi− ¯x)2 (13)
It is assumed that for any data point xd, if|xd− ¯x| > 3σ
is satisfied, the data is regarded as an outlier according to the 3σ principle, and xdshould be excluded from the
measurement data. After the xdis removed, theσ value is
recalculated for the retained data, the outlier detection is repeated, and the iterative operation is repeated until all outliers are eliminated.
2.5. Parameter identification and simulation of mechanism model
Figure3 shows the recursive least squares (RLS) opti-mized thickener mechanism model.
Improve the parameters of the thickener mechanism model, ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ dCi dt = g1ADzQe (Ci+1− Ci)− Gi−Gi−1 Dz
+ddisp,i(Ci+1−Ci)−ddisp,i−1(Ci−Ci−1)+Dnumi+1−2Dnumi +Dnumi−1 (Dz)2 dCm dt = g2 Qf ADz(Cf − Cm)− Gm−Gm−1 Dz + ddisp,m(Cm+1− Cm)− ddisp,m−1(Cm− Cm−1) +Dnum m+1− 2Dnumm + Dnumm−1 (Dz)2 dCn+1 dt = −g3ADzQu (Cn+1− Cn)+ g4GDzn +(Dnumn+1−Dnumn ) (Dz)2 + g5
where A is the cross-sectional area of the thickener, H is used to describe the height of the clarification zone, B is the depth of the settlement zone,z = (B + H)/n is the height of each layer, Qd is the top volume flow
rate, Qf is the feed flow, Qu is the underflow bulk
den-sity, Dnumi is the compressibility factor that contains the pulp concentration and density, Gicontains the settling
Table 3.Results of RLS parameter identification.
Parameter γ1 γ2 γ3 γ4 γ5
Identification result 0.2327 0.0293 0.2585 0.7427 0.5155
Figure 4.Prediction results of mechanism model. (a) Prediction error of underflow concentration. (b) Prediction effect of under-flow concentration.
velocity model v, andγ1,γ2,γ3,γ4,γ5are undetermined parameters.
Fifty sets of data were randomly selected from the 190 sets of data measured in the field to identify the parameters of the RLS identification thickener mecha-nism model. Table3shows the parameters of the mecha-nism model after identification.
Figure4shows the simulation results of the mecha-nism model after the RLS parameter identification.
From Figure4(a), it can be seen that the error of the mechanism model after RLS parameter identification is a bit large. So further modification is needed to make
the predicted value of the mechanism model closer to the actual value. Figure 4(b) shows that although the mechanism model of parameter identification has a good predictive effect on the trend of the process, the actual industrial process is often complicated, some factors in the process are constantly changing, and its changes are difficult to accurately describe, leads to large deviations in the prediction results of the model. Thus, the data cannot be applied to the real-time monitoring of industrial field data. As a result, further correction is required to obtain a more accurate prediction effect.
This chapter establishes a mechanism model of slurry concentration distribution in dense washing process based on solid flux theory and mass conservation theory. Then, the Bernoulli principle of fluid mechanics is used to convert the pressure data measured in the field into the flow velocity suitable for the mechanism model, which brings great convenience to the subsequent data pro-cessing. Finally, the parameter identification mechanism model of RLS algorithm is adopted. By simulating the mechanism model of parameter identification, the main factors affecting the dense washing process are found out. The mechanism model for measuring the underflow con-centration is established, which provides a direction for subsequent optimization.
3. Data model establishment based on TELM 3.1. Introduction of ELM
Extreme Learning Machine (ELM) is a learning algorithm for a single hidden layer feedforward neural networks (SLFNs). In the ELM model, the weights and threshold parameters of the network are given randomly before training, and no iteration correction is needed. It is only to manually select the number of hidden layer nodes. After the least squares operation, the only optimal solu-tion of the ELM network can be obtained without falling into the local optimum.
With the increasing popularity of neural networks, the research and innovation of the algorithm have devel-oped rapidly because of the characteristics of the ELM algorithm (Huang, 2014; Huang, Zhu, & Siew, 2004,
2006). At the same time, there are many studies dedicated to improving ELM. The incremental extreme learning machine (Huang & Chen,2007,2008), the error mini-mum extreme learning machine (Feng & Huang,2009), the reconstructed hidden layer node extreme learning machine (Lan, Soh, & Huang,2010), the extreme learning machine for optimal middle-layer node selection (Miche, Sorjamaa, & Bas, 2010), the extreme learning machine with regularization term (Deng, Zheng, & Chen,2009),
Figure 5.The network structure of the ELM.
the extreme learning machine using integrated learn-ing ideas (Liu & Wang,2010), and the extreme learning machine combined with deep learning ideas (Ma,2015). Continuous improvement and development have allowed the ELM algorithm to be successfully applied to all walks of life, especially in the fields of classification, regression, and pattern recognition.
Figure5shows the ELM network structure in which neurons are connected together in adjacent network lay-ers in the network. The input layer have n number of neuron nodes, and there are n dimensions correspond-ing to one input data sample. The number of hidden layer neuron nodes is l, which are randomly set by experience. The output layer has m neurons nodes.
Equation (14) gives the connection weight matrix w between the input layer and the hidden layer.
w= ⎡ ⎢ ⎢ ⎢ ⎣ w11 w12 · · · w1n w21 w22 · · · w2n .. . ... wl1 wl2 · · · wln ⎤ ⎥ ⎥ ⎥ ⎦ l×n (14)
where wjirepresents the connection weight between the
j-th neuron in the hidden layer and the i-thneuron in the input layer.
Let the connection weightβ between the output layer and the hidden layer be:
β = ⎡ ⎢ ⎢ ⎢ ⎣ β11 β12 · · · β1m β21 β22 · · · β2m .. . ... βl1 βl2 · · · βlm ⎤ ⎥ ⎥ ⎥ ⎦ l×m (15)
whereβjkrepresents the connection weight between the
k-th neuron of the output layer and the j-th neuron of the hidden layer.
The hidden layer threshold b is given by Equation (16)
b= ⎡ ⎢ ⎢ ⎢ ⎣ b1 b2 .. . bl ⎤ ⎥ ⎥ ⎥ ⎦ l×1 (16)
Suppose the input matrix with Q training set samples is X(n × Q) and the marking matrix is Y(m × Q).
X= ⎡ ⎢ ⎢ ⎢ ⎣ x11 x12 · · · x1Q x21 x22 · · · x2Q .. . ... xn1 xn2 · · · xnQ ⎤ ⎥ ⎥ ⎥ ⎦ Y= ⎡ ⎢ ⎢ ⎢ ⎣ y11 y12 · · · ymQ y21 y22 · · · ymQ .. . ... ym1 ym2 · · · ymQ ⎤ ⎥ ⎥ ⎥ ⎦ (17) ELM output is T=t1 t2 · · · tQ m×Q tj = ⎡ ⎢ ⎢ ⎢ ⎣ t1j t2j .. . tmj ⎤ ⎥ ⎥ ⎥ ⎦ m×1 = ⎡ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ l i=1βi1g(wi xj+ bi) l i=1 βi2g(wixj+ bi) .. . l i=1 βimg(wixj+ bi) ⎤ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ m×1 j= 1, 2, . . . , Q (18) where wi= [wi1 wi2 . . . win], xj = [x1j x2j . . . xnj]T g(x) is activation function.
Equation (18) can also be expressed as
Hβ = T (19)
where H is the output matrix of the ELM hidden layer,
H= ⎡ ⎢ ⎣ g(w1x1+ b1) · · · g(wlx1+ bl) .. . . .. ... g(w1xQ+ b1) · · · g(wlxQ+ bl) ⎤ ⎥ ⎦ Q×l and Tis the transpose of the tag matrix T.
Then the output weight can be obtained from Equation (20).
β = H+T (20)
where H+is the Moore-Penrose generalized inverse of the output matrix H.
3.2. Selection of activation function and the number of hidden layer nodes
In artificial neural networks, the activation function has a huge impact on the model. Since the data set processed by the neural network may have low nonlinearity and affect the training of the model, it is necessary to add an activa-tion funcactiva-tion to improve the nonlinearity of the model. When the activation function is a linear function, the final output and the input must also be linear, so that the hidden layer in the network does not reflect the value of its hidden layer, so the activation function needs to use nonlinear function. At the same time, when the expres-sion ability of linear data is weak, the expresexpres-sion ability of the model can be improved by adding nonlinear factors by adding an activation function.
A suitable activation function can improve perfor-mance of the ELM. The activation function must sat-isfy conditions such as nonlinearity, differentiability, and monotonicity.
Some commonly used activation functions and their mathematical expressions are as follows:
Sigmoid function: g(x) = 1/(1 + e−x) Linear function: g(x) = kx + c ReLU: g(x) = max(x, 0)
Sinusoidal function: g(x) = sin(x) Log function: g(x) = ln(x)
Hyperbolic tangent function: g(x) = eexx−e+e−x−x Hardlim function: g(x) = 1, x≥ 0 0, x< 0 Polynomial function: g(x) = 0.1 × (ex+ x2cos(x2) + x2) Radbas function:g(x) = e−n2 Satlin function: g(x) = ⎧ ⎪ ⎨ ⎪ ⎩ 1, x≥ 1 x, 0< x < 1 0, x< 0
At present, there is no definitive method to select the number of neuron nodes in the hidden layer of the artificial network. Most scholars have conducted exper-iments based on their own experience and selected the best results. We also determined the number of neuron nodes in the hidden layer by the method of ‘optimizing from multiple training’.
3.3. Introduction of TELM
The limit learning machine of the three hidden layer net-work structure (TELM) (Xiao, Li, & Mao, 2017) adds two hidden layers on the basis of the classical extreme learning machine, and each layer of neurons is in a fully
Figure 6.TELM flow and structure. (a) The work flow of the TELM. (b) The structure of the TELM.
connected state. At the same time, the algorithm inherits the advantage of the ELM algorithm to randomly ini-tialize the first hidden layer parameter, and introduces a method to obtain the parameters of the other hidden layer. The method passes the layer-by-layer transmission of network parameters between different hidden layers, and the final output is closer to the actual result than the traditional ELM model, and also inherits the advantages of the traditional extreme learning machine.
Figure6is the network structure and algorithm flow of TELM.
Suppose a given input training set sample is{X, T} = {xi, ti}(i = 1, 2, . . . , Q), where X is the input sample and
T is the flag sample. According to the TELM algorithm, we first consider the three hidden layers in the TELM net-work as two hidden layers (the latter two hidden layers as an implicit layer). The weight matrix of the first hidden layer and the threshold parameters are randomly initial-ized. It can be known from the ELM algorithm that the expected output of the third hidden layer can be obtained by Equation (21).
whereβnew+ is the generalized inverse matrix ofβnew.
Next, the third hidden layer of the network is added. The predicted output H3of the third hidden layer is given by Equation (22).
H3= g(W2H2+ B2) (22)
where W2is the weight matrix between the second hid-den layer and the third hidhid-den layer, B2is the threshold of the third hidden layer, and H2is the output matrix of the hidden layer and is given here as the second implicit output matrix of the layer.
Make the predicted output of the third hidden layer infinitely close to the expected output for H3= H3∗.
We assume the matrix WHE1= [B2W2]; thus, the
weight W2and threshold B2of the third hidden layer can be solved by Equation (23).
WHE1= g−1(H3∗)HE1+ (23)
where HE1+ is the generalized inverse matrix of matrix HE1= [1 H2]T, 1 represents a vector with Q elements
and each element is 1, and g−1(x) is the inverse of activa-tion funcactiva-tion g(x).
The predicted output of the third hidden layer can be updated by Equation (24)
H4= g(W2H2+ B2) = g(WHE1HE1) (24)
Thenβnewcan be updated by Equation (25).
βnew1= H4+T (25)
The TELM neural network outputf(x) can be finally obtained by Equation (26).
f(x) = H4βnew1 (26)
3.4. Data model establishment and simulation
Data modeling is to find out the relationship between input and output by performing data processing and sta-tistical analysis on the historical data of the controlled process. Data modeling does not require knowing the specific process to avoid complex mechanism analysis. It only needs to determine the input and output of the model, so it is convenient.
The data collected from the industrial site is processed, and 140 sets of data are randomly selected from the 190 sets of data as training set data, 50 sets of data are used as test set data. Use the TELM algorithm to build a data model.
Figure7(a) shows that the error of the predicted value generated by the simple data model still has a certain deviation from the actual value; thus, the predicted data
Figure 7.Prediction results of data model. (a) Prediction error of TELM test set. (b) TELM test set output comparison.
don’t meet the actual measurement requirements of the industrial site, and the data model needs further opti-mization. Figure7(b) shows the output of the data model. A certain deviation between the predicted output of the data model and the actual value is found. Because the data model only depends on the process data, the information source is single and cannot reflect the process character-istics. The structure of the data model has great subjec-tivity, resulting in poor generalization, and the model is not explanatory; moreover, it is easy to cause over-fitting phenomenon and predictive output inaccuracy.
In this chapter, after pre-processing the data collected at the process site, a data model is established. Through simulation analysis, it can be seen that the establishment of the data model avoids the analysis of complex mecha-nisms, and only needs to determine the input and output
of the model and the solution of the model is very con-venient. However, because it only relies on process data and the data source is single, the simulation results still have certain deviations from the actual ones, which can not meet the measurement requirements of the industrial site, but provide a clear direction for the establishment and optimization of the next hybrid model.
4. Establishment of a hybrid model
The advantage of mechanism modeling is to reflect the law of the process, high reliability, good extrapolation and interpretability. The disadvantage is that the modeling process is cumbersome and relies on prior knowledge. For some complex processes, reasonable assumptions are needed to obtain a simplified model of the controlled pro-cess. However, the accuracy of the simplified mechanism model cannot be guaranteed. The advantage of data mod-eling is that the process model can be directly established according to the input and output data of the process, without the prior knowledge of the process object, and avoiding the analysis of complex mechanisms. The dis-advantage is that the model has poor performance, is not interpretable, easily causes over-fitting, and may even fit in the noise, resulting in instability of the model. In summary, the use of a mechanistic model alone or a sepa-rate data model has significant drawbacks. Therefore, this paper proposes a method of combining mechanism mod-eling and data-driven modmod-eling, so that the mechanism model and the data model can complement each other.
The data compensation model based on the EDO-TELM algorithm combined with the mechanism model of the thick washing process constitutes a hybrid model of the dense washing process, as shown in Figure8. The model uses a mechanism model to describe the overall characteristics of the dense washing process. As the error compensation model of the mechanism model, the data model establishes the relationship between the deviation of the mechanism model and the process measurable variables. The error generated by the mechanism model
Figure 8.Hybrid model structure of dense washing process.
is taken as the output of the data model, and the input data is used as an input sample to train the compensator, that is, the EDO-TELM model. The mechanism model is added to the predicted value of the compensator as the estimated value of the model. Therefore, EDO-TELM is used to compensate the deviation of the mechanism model, that is, EDO-TELM is used to compensate the error of the un-modeled part, and the uncertain part of the model is given a reasonable estimate, which greatly reduces the model error. The prediction accuracy of the model is improved.
The input and output relationship of the hybrid model can be expressed as follows:
Yt= Ym(x1, x2, x3, x4) + YEDO−TELM(x1, x2, x3, x4)
(27) In the formula, x1, x2, x3, x4represents four measurable auxiliary variables; function Ym(x1, x2, x3, x4) represents
the predicted output of the mechanism model; function YEDO−TELM(x1, x2, x3, x4) represents the compensation
value of the EDO-TELM compensation model for the output error of the mechanism model; Ytrepresents the
predicted output of the hybrid model, that is, the model estimate.
4.1. Introduction of EDO
The entire distribution optimization algorithm (Xue, Liu, & Xiao,2015) (EDO) is a new optimization algorithm derived from the PSO (Chen,2017) algorithm, which is based on summarizing the population distribution law of the PSO algorithm. Compared with the PSO algorithm, it has the characteristics of simple implementation, fast convergence and strong robustness. Figure9shows the specific calculation process.
4.2. TELM optimized with EDO
In the process of optimizing the three-layer extreme learning machine (EDO-TELM) in the overall distribu-tion optimizadistribu-tion algorithm, the posidistribu-tion of each particle in the EDO algorithm corresponds to the input weight and offset vector of the TELM. The dimension of the par-ticle is determined by the number of input layer and first hidden layer neurons, and the training set output error is used as the fitness value. The particle swarm searches for input weights and offset vectors that minimize the TELM output error by moving the search within the weight space.
The steps of EDO optimization algorithm optimizes the TELM as follows:
Figure 9.Program flow chart of EDO.
In this paper, the specific implementation steps of optimizing the TELM by using the EDO algorithm are as follows:
Step 1: Initialize the TELM: S Set the number of input and output neurons in the network and select the appro-priate activation function.
Step 2: Initialize the EDO: Randomly generate the swarm within the entire domain while initializing the Cauchy distribution to cover a radius of 0.5 times the entire domain. Set the following parameters: Cauchy distribution scale parameter r= 0.1, swarm diameter decline rate a= 0.93, number of stagnation times b = 9, maximum iteration number 10,000 or swarm diameter scale less than 0.000001, and swarm size 70.
Step 3: Calculate the fitness value: For all the particles, using the TELM model, calculate their respective out-put values and finally obtain the sample error; the sample error is the fitness of each particle.
Determine whether the maximum number of itera-tions is reached, or whether the fitness value meets the requirements, if the condition is satisfied, go to the step 6, otherwise continue to execute downward;
Step 4: Update the global extremum and the individual extremum of each particle: the best individual of this time is compared with the last best individual; if it is better than the last best individual, then replace the last optimal
individual, As the optimal individual of this time, the swarm diameter remains unchanged; if it is worse than the last optimal individual, then the last optimal indi-vidual is retained as the optimal indiindi-vidual of this time, and the number of stagnation times is reduced by one. If the number of stagnation times is 0, then the swarm diameter is reduced to 0.93 of the original diameter, and the number of stagnation times is set to 9. If the number of stagnation times is not 0, then the original diameter remains unchanged. The number of iterations is reduced by 1.
Step 5: Focus on the location of the best individual found, and then generate a new swarm with the Cauchy distribution.
Step 6: When the maximum number of iterations is reached or the scale of the swarm diameter is less than 0.000001, the algorithm stops iteratively, and the input weight and offset vector of the TELM corresponding to the global extremum are the optimal solution of the prob-lem. An excellent solution and input test samples for forecasting are obtained.
4.3. Simulation and analysis of hybrid models
The field measured input variables x1, x2, x3, x4(that is Qε, Qf, Qu, Qv) are selected to compare the predicted
out-put of the mechanism model, the data model and the hybrid model. The predicted result is the underflow con-centration, and compare the prediction errors. The fol-lowing three tables are numerical comparisons of selected data for the experimental results.
It can be seen from Table4(Panel A) that the error of the mechanism model after parameter identification is large, among them, the maximum error rate is more than 10%, indicating that the pure mechanism model is not suitable for complex industrial processes and needs to be combined with other modeling methods. Table4(Panel B) shows that the prediction error of the data model is also relatively large and cannot be adapted to the needs of industrial field measurement. The data in Table4(Panel C) shows that the predicted value of the hybrid model is approaching the actual value, and the error rate is about 5%, which is 5 percentage points higher than the accuracy of the mechanism model, which meets the measurement needs of complex industrial sites.
In order to verify the prediction effect of the hybrid model, 190 data generated by the mechanism model was used for simulation. Among them, 130 groups were used as training samples and 60 groups were used as test samples. Figure10shows the simulation results.
Figure10(a) shows the prediction error of the hybrid model. the prediction error is below 2% and the prediction accuracy is significantly improved. Figure10(b)
Table 4.Comparison of predicted and actual values under three models.
Panel A. Mechanism model predictive output compared with actual values (unit:kg/m3)
Actual value 783 796 619 615 568 576 633 666 678 646 568 547 666 564
Predicted value 823.6 841.9 654.2 650.8 613.3 616.6 89.6 733.5 712.3 690.1 613.3 584.5 733.5 615.1 Error 40.6 45.9 35.2 35.8 45.3 40.6 56.6 67.5 34.3 44.1 45.3 37.5 67.5 51.1 Error rate 0.052 0.058 0.057 0.058 0.08 0.071 0.089 0.101 0.051 0.068 0.074 0.064 0.092 0.083
Panel B. Data model predictive output compared with actual values (unit:kg/m3)
Actual value 783 796 619 615 568 576 633 666 678 646 568 547 666 564
Predicted value 823.6 841.9 654.2 650.8 613.3 616.6 689.6 733.5 712.3 690.1 613.3 584.5 733.5 615.1 Error 40.6 45.9 35.2 35.8 45.3 40.6 56.6 67.5 34.3 44.1 45.3 37.5 67.5 51.1 Error rate 0.052 0.058 0.057 0.058 0.08 0.071 0.089 0.101 0.051 0.068 0.074 0.064 0.092 0.083
Panel C. Hybrid model predictive output compared with actual values (unit:kg/m3)
Actual value 709 646 619 611 622 718 666 712 736 696 604 529 622 596
Predicted value 732.4 668.1 641.5 603.0 630.9 720.4 656.6 717.4 746.9 712.7 596.9 556.6 630.9 605.0 Error 23.4 22.1 22.5 −8.0 8.9 2.4 −9.4 5.44 10.9 16.7 −7.1 27.6 8.9 9.0 Error rate 0.033 0.034 0.036 −0.01 0.014 0.003 −0.01 0.008 0.015 0.024 −0.01 0.052 0.014 0.015
Figure 10.Prediction results of hybrid model. (a) Prediction error of underflow concentration. (b) Predictio of underflow concentra-tion.
is the output of the predicted value of the hybrid model. It can be seen that the predicted and actual values of the hybrid model are very close. The hybrid model can
compensate for the deviation of the mechanism model, and the prediction effect is better and the precision is higher, which can meet the actual production require-ments.
In this section, the EDO algorithm is used to opti-mize the parameters of the first hidden layer of TELM. A method combining the mechanism model and the mod-eling based on EDO-TELM data compensation model is proposed. The experimental results show that the hybrid model achieves an accurate prediction of the underflow concentration of the thickener, which can meet the mea-surement requirements of the underflow concentration of the thickener in complex industrial production.
5. Conclusion
For the prediction of the underflow concentration in the thick washing process, this paper uses the hybrid mod-eling method to establish a soft measurement model of the dense washing process, which is composed of the optimized mechanism model and the data compensa-tion model. The mechanism model is used to describe the overall trend of the dense washing process and reduce the amount of calculation of the model. The data model is used to compensate for errors in the mechanism model. Considering the nonlinear charac-teristics of the thick washing process, the mechanism model is parameterized to improve the accuracy of the mechanism model modeling. The data compensation model uses the improved EDO-TELM algorithm. In this paper, the mechanism model, mathematical model and hybrid model are simulated respectively. Simula-tion results prove that compared with the mechanism model and the data model, the hybrid model with higher prediction accuracy.
Disclosure statement
Funding
This research work is supported by National Natural Science Foundation of China [grant numbers 41371437, 61203214, 61473072, 61773105, 61374147 and 61733003]; Fundamen-tal Research Funds for the Central Universities [grant num-bers N180404012, N180704013, N182608003, N150402001, N160404008 and N172410005-7]; National Key Research and Development Plan [grant number 2016YFC0801602]; National Training Program of Innovation and Entrepreneurship for Undergraduates [grant number 201810145161]. National Twelfth Five-Year Plan for Science and Technology Support (2015BAB15B01) PR China. Fundamental Research Funds for Liaoning Natural Science Foundation (Grant No. 2019-MS-120).
References
Ardabili, S. F., Najafi, B., Shmshirband, S., Bidgoli, M. B., Deo, R. C., & Chau, K. (2018). Computational intelligence approach for modeling hydrogen production: A review.
Engineering Applications of Computational Fluid Mechanics, 12(1), 438–458.
Bürger, R., Concha, F., Karlsen, K., & Narváez, A. (2006). Numerical simulation of clarifier-thickener units handling of the ideal suspensions with a flux density function having two inflection points. Mathematical and Computer Modelling, 44, 255–275.
Bürger, R., Diehl, S., Farås, S., Nopens, I., & Torfs, E. (2012). A consistent modeling methodology for secondary settling tanks: A reliable numerical method. Water Research, 46(2), 2254–2278.
Bürger, R., Diehl, S., & Nopens, I. (2011). A consistent mod-elling methodology for secondary settling tanks in wastew-ater treatment. Wwastew-ater Research, 45, 2247–2260.
Chen, J. Q. (2017). Compression perception based on improved particle swarm optimization. Signal Processing Device, 33(4), 488–495.
Deng, W. Y., Zheng, Q. H., & Chen, L. (2009). Regularized
extreme learning machine. 2009 IEEE symposium on
compu-tational intelligence and data mining, Nashville, TN, USA. Feng, G., & Huang, G. B. (2009). Error minimized extreme
learning machine with growth of hidden nodes and incre-mental learning. IEEE Transactions on Neural Networks,
20(8), 1352–1357.
Ghalandari, M., Elaheh, M. K., Fatemeh, M., Shahabbodin, S., & Chau, K. (2019). Numerical simulation of nanofluid flow inside root canal. Engineering Applications of Computational
Fluid Mechanics, 13(1), 254–264. 2019.
Gui, W. H., Yang, C. H., Chen, X. F., & Wang, Y. L. (2013). Problems and challenges in modeling and optimization of non-ferrous metallurgical processes. Acta Automatica Sinica,
39(3), 197–207.
Gui, J. L., & Zhao, F. G. (2007). Application of high-efficiency deep cone thickener in Meishan concentrator. Mining
Express, 3, 70–71.
Guo, L. S., Zhang, D. J., Xu, D. Y., & Chen, Y. (2009). An experimental study of low concentration sludge settling velocity under turbulent condition. Water Research, 43(3), 2383–2390.
Huang, G. B. (2014). An insight into extreme learning
machines: Random neurons, random features and kernels.
Cognitive Computation, 6(3), 376–390.
Huang, G. B., & Chen, L. (2007). Convexin cremental extreme learning machine. Neurocomputing, 70(3), 3056–3062. Huang, G. B., & Chen, L. (2008). Enhanced random search
based incremental extreme learning machine.
Neurocomput-ing, 71(3), 3060–3068.
Huang, G. B., Zhu, Q. Y., & Siew, C. K. (2004). Extreme learning machine: A new learning scheme of feed forward neural net-works. 2004 IEEE International Joint Conference on Neural
Networks, 2(2), 985–990.
Huang, G. B., Zhu, Q. Y., & Siew, C. K. (2006). Extreme learning machine: Theory and applications. Neurocomputing, 70(1), 489–501.
Ichiro Tetsuo, etc. (1986). Water treatment engineering theory
and application. Beijing: China Building Industry Press. (pp.
60–100).
Kynch, G. J. (1952). A theory of sedimentation. Transactions of
the Faraday Society, 48(7), 166–176.
Lan, Y., Soh, Y. C., & Huang, G. B. (2010). Constructive hidden nodes selection of extreme learning machine for regression.
Neurocomputing, 10(16), 5–22.
Li, Z. L. (2006). One-dimensional flux model of secondary
set-tling tank and its application. Chongqing: Chongqing
Uni-versity.
Liu, X. D. (2007). Application of interface detector in mud layer of sedimentation tank. Automatic Instrument and
Instru-ment, 1, 52–53.
Liu, N., & Wang, H. (2010). Ensemble based extreme learning machine. IEEE Signal Processing Letters, 17(8), 754–757. Ma, M. M. (2015). Research on algorithm of ultimate learning
machine based on deep learning. Shandong: ocean University
of China.
Ma, X. X. (2004). Study on the treatment of acidic water in high-concentration mines with high-efficiency thickener. China
Mine Engineering, 33(5), 17–19.
Ma, S. J., & Cui, S. T. (2013). Application of mine automatic thickening machine automatic equalizing system. Science &
Technology Information, 2, 77.
Miao, T. Y., Wang, X., Wang, Q. K., & Mao, X. (2009). Research and application of automatic control of thickener production process in wastewater treatment process. Digital Technology
and Application, 9(5), 18–20.
Miche, Y., Sorjamaa, A., & Bas, P. (2010). OP-ELM: Optimally Pruned Extreme Learning Machine. IEEE Transactions on
Neural Networks, 21(1), 158–162.
Ramezanizadeh, M., Nazari, M. A., Ahmadi, M. H., &
Chau, K. (2019). Experimental and numerical analysis
of a nanofluidic thermosyphon heat exchanger.
Engineer-ing Applications of Computational Fluid Mechanics, 13(1),
40–47.
Taormina, R., & Chau, K. W. (2015). Data-driven input vari-able selection for rainfall-runoff modeling using binary-coded particle swarm optimization and extreme learning Machines. Journal of Hydrology, 529(3), 1617–1632. Wang, F. L., He, D. K., & Chang, Y. Q. (2012). 2012
imple-mentation report of comprehensive intelligent integrated opti-mization control technology for hydrometallurgy. Shenyang,
China: Northeastern University.
Xiao, D., Li, B. J., & Mao, Y. C. (2017). A multiple hidden lay-ers extreme learning machine method and its application.
Mathematical Problems in Engineering, 2017, 10.
Xue, C. P., Liu, J. Y., & Xiao, D. (2015). Based on greedy strat-egy overall distribution optimization algorithm solving 0-1
knapsack problem. Journal of Northeast Normal University,
2015(2), 53–57.
Yang, H., & Chen, W. (2002). Automatic control of 50m large thickener. Metal Mines, 318(12), 38–40.
Yaseen, Z. M., Sulaiman, S. O., Deo, R. C., & Chau, K. (2019). An enhanced extreme learning machine model for river flow
forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction.
Journal of Hydrology, 569, 387–408.
Zhang, L. (2016). Talking about the application of Bernoulli’s
equation in fluid mechanics. Education Education