House Price Prediction Analysis using Machine
Learning
Swathi B
1, Shravani V
21, 2New Horizon College of engineering Bangalore
Abstract: Land is the least straightforward industry in our biological system. Lodging costs continue changing all day every day and now and then are advertised instead of being founded on valuation. Anticipating lodging costs with genuine elements is the principle essence of our exploration venture. Here we plan to make our assessments dependent on each fundamental parameter that is considered while deciding the cost. We utilize different relapse systems in this pathway, and our outcomes are not sole assurance of one method rather it is the weighted mean of different procedures to give most exact outcomes. The outcomes demonstrated that this methodology yields least blunder and most extreme precision than individual calculations connected. We additionally propose to utilize ongoing neighbourhood subtleties utilizing Google maps to get accurate genuine valuations. House costs increment consistently, so there is a requirement for a framework to anticipate house costs later on. House value expectation can enable the engineer to decide the selling cost of a house and can assist the client with arranging the correct time to buy a house. There are three factors that impact the cost of a house which include physical conditions, idea and area.
Keywords: Linear regression, Machine Learning, Prediction, Parameters, Boosted Regression, Forest Regression, Neural Network.
I. INTRODUCTION
Land Property isn't just the essential need of a man however today it likewise speaks to the wealth and glory of an individual. Interest in land by and large is by all accounts productive in light of the fact that their property estimations don't decrease quickly. Changes in the land cost can influence different family financial specialists, investors, strategy producers and many. Interest in land part is by all accounts an alluring decision for the ventures. In this way, foreseeing the land esteem is a significant financial index. India positions second on the planet in number of families as indicated by 2011 evaluation with various 24.67 crore. India is likewise the quickest developing real economy in front of China with previous' development rate as 7% this year and anticipated to be 7.2% in the following year. As per the 2017 rendition of Emerging Trends in Real Estate Asia Pacific, Mumbai and Bangalore are the top-positioned urban communities for speculation and improvement. Costs of the land property are identified with the monetary states of the state [1]. Regardless of this, we are not having appropriate institutionalized approaches to gauge the land property values. Generally the property estimations ascend as for time and its assessed esteem should be determined. This assessed esteem is required amid the clearance of property or while applying for the credit and for the attractiveness of the property. These evaluated qualities are determined by the expert appraisers. Nonetheless, disadvantage of this training is that these appraisers could be one-sided due to offered interests from purchasers, vender’s or home loans. In this manner, we require a robotized forecast model that can foresee the property estimations with no predisposition. This mechanized model can help the first run through purchasers and less experienced clients to comprehend whether the property rates are exaggerated or underrated. Now, Property costs rely upon different parameters in the economy and society. Nonetheless, past examinations demonstrate that house costs are unequivocally subject to the extent of the house and its land area then we has connected these parameter esteems to two distinctive AI calculations. We have considered straight relapse model and bolster vector relapse model to anticipate the value estimation of the house and looked at their output. In this paper, we are foreseeing house value esteems utilizing two models for example Linear regression, bolster vector relapse, Forest Regression and Decision Tree .
II. LITRETURE SURVEY
A. A House Price Affecting Factors
significant factor in forming the cost of a house. This is on the grounds that the area decides the common land price[4]. Moreover, the area additionally decides the simple entry to open offices, for example, schools, grounds, emergency clinics and wellbeing focuses, just as family diversion offices, for example, shopping centres, culinary visits, or even offer a delightful scenery[5] [6]. In most recent two decades estimating the property estimation has turned into a significant field. Ascend in the interest for property and unusual conduct of economy force analysts to discover a way that anticipate the land costs with no predispositions. Accordingly, it is a test for specialists to discover all the moment factors that can influence the expense of property and make a prescient model by contemplating every one of the components. Building a prescient model for land value valuation requires exhaustive information regarding the matter. Numerous scientists have taken a shot at this issue and imparted their examination work. Most of this exploration work is enlivened from [7]. The creator has scratched the lodging informational index from Centris.ca and duProprio.com. Their dataset comprises of roughly 25,000 models and 130 elements. Around 70 highlights were scratched from the above sites and real home organizations, for example, RE/MAX, Century 21, and Sutton, and so on. Other 60 highlights were socio demo graphic dependent on where the property is found. Afterward, creator actualized Principal Component Analysis to lessen the dimensionality. The creator utilized four relapse procedures to foresee the value estimation of the property. The four methods are Linear Regression, Support Vector Machine, K-Nearest Neighbours’ (KNN) and Random Forest Regression and a troupe approach by consolidating KNN and Random Forest Technique. The outfit approach anticipated the costs with least blunder of 0.0985. Be that as it may, applying PCA did not improve the expectation error. A part of inquires about have been done on Artificial Neural Networks. This has helped numerous scientists concentrating on land issue to settle utilizing neural systems. In [8], the creator has thought about epicurean value model and ANN model that anticipate the house costs. Decadent value models are fundamentally used to figure the cost of any ware that are reliant on inner attributes just as outer qualities. The decadent model fundamentally includes relapse strategy that considers different parameters, for example, region of the property, age, number of rooms, etc. The Neural Network is prepared at first and the loads and inclinations of the edges and hubs individually are viewed as utilizing experimentation technique. Preparing the Neural Network model is a discovery technique. Be that as it may, the R-Squared an incentive for Neural Network model was more prominent contrasted with epicurean model and the RMSE estimation of Neural Network model was generally lower. Thus it is reasoned that Artificial Neural Network performs predominant than Hedonic model. Some scientists like that in [9] have utilized classifiers to anticipate the property estimations. The writer in research article [9] has gathered the information from Multiple Listing Service (MLS), verifiable home loans rates and government funded school evaluations. Land Data was acquired from Metropolitan Regional Information Systems (MRIS) database. The creator separated around 15,000 records from these three sources which included 76 factors. In this way, t-test was utilized to choose 49 factors as a primer screening.
III. METHODOLOGY
Approach speaks to a portrayal about the system that is embraced. It comprises of different achievements that should be accomplished so as to satisfy the target. We have embraced distinctive information mining and AI ideas. The accompanying diagram speaks to step-wise assignments that should be finished:
System Architecture
A. Data Collection
The dataset utilized in this task was an open source dataset from Kaggle Inc[10]. It comprises of 3000 records with 80 parameters that have the likelihood of influencing the property costs. Anyway out of these 80 parameters just 37 were picked which will undoubtedly influence the lodging costs. Parameters, for example, Area in square meters, Overall quality which rates the general condition and completing of the house, Location, Year in which house was fabricated, Numbers of Bedrooms and washrooms, Garage region and number of autos that can fit in carport, pool region, selling year of the house and Price at which house is sold. Selling cost is a needy variable on a few other free factors. A few parameters had numerical qualities and some were evaluations. These evaluations were changed over to numerical qualities. Following Table 1 speak to a concise portrayal about most significant parameters that influence the selling cost of the house.
Data
Collection
Pre-
Processing
Data
Analysis
Application
of
Algorithms
B. Data Pre-processing
It is a process of transforming the raw, complex data into systematic understandable knowledge. It involves the process of finding out missing and redundant data in the dataset. Entire dataset is checked for Nan and whichever observation consists of Nan will be deleted. Thus, this brings uniformity in the dataset. However in our dataset, there was no missing values found meaning that every record was constituted its corresponding feature values.
C. Data Analysis
Before applying any model to our dataset, we have to discover qualities of our dataset. Consequently, we have to dissect our dataset and concentrate the diverse parameters and connection between these parameters. We can likewise discover the exceptions present in our dataset. Anomalies happen because of some sort of trial blunders and they should be barred from the dataset. From the investigation we discovered that there exist one or two exceptions. The general pattern available to be purchased cost over various parameters.' GrLivArea' and 'TotalBsmtSF' appear to be directly related with 'Sale Price'. The general nature of the house and Area raises the deal cost of the house rises as well! In any case, Overall quality and number of restrooms are non-related and are free of one another. All out Basement Area and Ground Living Area are related to one another. There exists an outlier in every one of the diagrams of Total Basement Area. This outlier could be available because of exploratory mistakes and consequently that perception can be maintained a strategic distance from.
D. Application of Algorithms
When the information is perfect and we have picked up bits of knowledge about the dataset, we can apply a suitable AI model that accommodates our dataset. We have chosen four calculations to anticipate the reliant variable in our dataset. The calculations that we have chosen are essentially utilized as classifiers yet we are preparing them to foresee the ceaseless qualities. The four calculations are Logistic Regression, Support Vector Machine, Lasso Regression Technique and Decision Tree. These calculations were actualized with the assistance of python's SciKit-learn Library [10]. The anticipated yields got from these calculations were saved in comma isolated esteem document. This document was created by the code at run time.
E. Regression Analysis
The expectation model used in this research is libertine evaluating, the reasonable model utilizing relapse, with the standard recipe as appeared (1). The needy variable symbolized as Y is NJOP price and autonomous variables with image x1-x14consist of year, building zone, land territory, NJOP land price (IDR/m2), NJOP building price (IDR/m2), separation to focal point of the city, sum number of campuses, sum number of restaurants, sum number of health offices, sum number of amusement parks, sum number of educational offices, sum number of traditional markets, sum number of worship places, and ease to public transportations is appeared. Relapse is a major task in measurements and incorporates methods for demonstrating and examining a few factors at any given moment. Relapse investigation is utilized for clarifying the connection between a needy variable, typically signified by Y, and various autonomous factors, X1, X2,...., Xp. The autonomous factors are otherwise called indicator or informative variables.In most relapse examinations, the factors are thought to be constant. In straightforward relapse, there is just a single autonomous variable. Be that as it may, most genuine applications include more than one variable which impact the result variable. The model for Multiple Linear Regression can be spoken to as:
Given a data set of n statistical units, a linear regression model assumes that the relationship between the dependent variable y and the p-vector of regressors x is linear. This relationship is modeled through a disturbance term or error variable ε — an unobserved random variable that adds "noise" to the linear relationship between the dependent variable and regressors. Thus the model takes the form.
where β0is called intercept and βj are called slopes or regression coefficients. The difference between the predicted and the actual
where Yiis the actual value and εiis the error for the it observation. We write Xi,j for the jth predictor variable measured for the it
observation. The main assumptions for the errors εiis that E(εi) = 0 and var(εi) = ζ2.Also the εi are randomly distributed. The predicted value is also denoted. The various errors are given as:
SSE is the Sum of Squares of Error, SSR is the Sum of Squares of Regression, and SST is the Sum of Squares Total. R-square is the square of the correlation between the response values and the predicted response values. It is also called the square of the multiple correlation coefficients and the coefficient of multiple determinations. The coefficient of determination is the overall measure of the usefulness of a regression. The value of R2can range between 0 and 1, a higher value indicates a better model. In terms of the sample, the estimate of the population total variance (SST) is denoted by Mean Sum of Squares Total (MST). MST is obtained as SST/(n-1) where n is the sample size. Similarly, the estimated residual or error is called Mean Squared Error (MSE) and is calculated as, MSE= SSE/ (n-p-1) where n is the sample size, and p is the number of exploratory variables. A better estimate of the coefficient of determination is made by the Adjusted-R squared statistic: The F-testing one way Analysis of Variance (ANOVA) is also used as a statistic to find the goodness of fit of the model.
F. Artificial Neural Network
Architecture of Artificial Neural Network with three Layers
G. Decision Tree
Choice trees are viewed as the best and most generally utilized regulated learning calculation. This model can foresee the yield with at most accuracy and strength. It is utilized to foresee any sort of issues, for example, grouping or relapse. In any case, for our situation we need to foresee constant target esteem consequently our concern is of relapse type. In this model, the accessible dataset can be nonstop or clear cut. We utilize double tree that will recursively parcel the indicator vector into various subsets with the end
goal that our objective esteem ܡis progressively homogenous. represents the vector of indicators x=x1,x2,x3,...,xn.A choice tree with terminal hubs is utilized for imparting the grouping choice. A parameterᵠ=(ᵠ1,ᵠ2,ᵠ3,....ᵠt)associates the parameter esteem
ᵠi(i=1,2,3,...,t )with the it terminal hub. The parcelling system seeks through all estimations of indicator factors (vector of indicators) to locate the variable x that gives best segment into youngster hubs [13]. The best segment will be the one that limits the weighted variance. However one of the key difficulties in choice trees is over fitting. In the most pessimistic scenario, it will consider leaf hub for each esteem and in this manner give 100% exactness. In order to avert over fitting we can set imperatives on the extent of the tree or pruning the tree. The accompanying chart speaks to values anticipated by choice tree for our dataset:
H. Boosted Regression
Slope boosting is an AI procedure for relapse and characterization issues, which delivers an expectation model as a group of frail forecast models, normally choice trees. It constructs the model in a phase savvy style like other boosting techniques do, and it sums them up by permitting advancement of a self-assertive differentiable misfortune work.
The possibility of angle boosting started in the perception by Leo Breiman that boosting can be deciphered as a streamlining calculation on a reasonable expense function.[12] Explicit relapse slope boosting calculations were in this manner created by Jerome H. Friedman,[13][14] at the same time with the more broad useful angle boosting point of view of Llew Mason, Jonathan Baxter, Peter Bartlett and Marcus Frean.[15][16] The last two papers presented the perspective on boosting calculations as iterative utilitarian slope plummet calculations. That is, calculations that enhance a cost capacity over capacity space by iteratively picking a capacity (powerless speculation) that focuses in the negative angle course. This practical angle perspective on boosting has prompted the advancement of boosting calculations in numerous regions of AI and insights past relapse and order.
In many supervised learning problems one has an output variable y and a vector of input variables x described via a joint probability distribution. Using a training set of known values of x and corresponding values of y, the goal is to find an approximation to a function that minimizes the expected value of some specified loss function :
The gradient boosting method assumes a real-valued y and seeks an approximation in the form of a weighted sum of functions from some class , called base (or weak) learners:
In accordance with the empirical risk minimization principle, the method tries to find an approximation that minimizes the average value of the loss function on the training set, i.e., minimizes the empirical risk. It does so by starting with a model, consisting of a constant function, and incrementally expands it in a greedy fashion:
Unfortunately, choosing the best function h at each step for an arbitrary loss function L is a computationally infeasible optimization problem in general. Therefore, we restrict our approach to a simplified version of the problem. The idea is to apply a steepest descent step to this minimization problem. If we considered the continuous case, i.e. where is the set of arbitrary differentiable functions on , we would update the model in accordance with the following equations .
I. Forest Regression
Decision trees are a popular method for various machine learning tasks. Tree learning "come[s] closest to meeting the requirements for serving as an off-the-shelf procedure for data mining", say Hastie et al., "because it is invariant under scaling and various other transformations of feature values, is robust to inclusion of irrelevant features, and produces inspect able models. However, they are seldom accurate”. In particular, trees that are grown very deep tend to learn highly irregular patterns: they over fit their training sets, i.e. have low bias, but very high variance. Random forests are a way of averaging multiple deep decision trees, trained on different parts of the same training set, with the goal of reducing the variance. This comes at the expense of a small increase in the bias and some loss of interpretability, but generally greatly boosts the performance in the final model. The training algorithm for random forests applies the general technique of bootstrap aggregating, or bagging, to tree learners. Given a training set X = x1, ..., xn with responses Y = y1, ..., yn, bagging repeatedly (B times) selects a random sample with replacement of the training set and fits trees to these samples:
For b = 1, ..., B:
1) Sample, with replacement, n training examples from X, Y; call these Xb, Yb.
2) Train a classification or regression tree fb on Xb, Yb.
After training, predictions for unseen samples x' can be made by averaging the predictions from all the individual regression trees on
x':
or by taking the majority vote in the case of classification trees.
This bootstrapping procedure leads to better model performance because it decreases the variance of the model, without increasing the bias. This means that while the predictions of a single tree are highly sensitive to noise in its training set, the average of many trees is not, as long as the trees are not correlated. Simply training many trees on a single training set would give strongly correlated trees (or even the same tree many times, if the training algorithm is deterministic); bootstrap sampling is a way of de-correlating the trees by showing them different training sets. Additionally, an estimate of the uncertainty of the prediction can be made as the standard deviation of the predictions from all the individual regression trees on x':
The number of samples/trees, B, is a free parameter. Typically, a few hundred to several thousand trees are used, depending on the size and nature of the training set. An optimal number of trees B can be found using cross-validation, or by observing the out-of-bag error: the mean prediction error on each training sample xᵢ, using only the trees that did not have xᵢ in their bootstrap sample. The training and test error tend to level off after some number of trees has been fit.
IV. CONCLUSION
collaborating way utilizing increased reality [17].Also; a learning framework can be made which will assemble clients input and history with the goal that the framework can show the most appropriate outcomes to the client as indicated by is inclinations. We have utilized AI calculations to foresee the house costs. We have referenced the well ordered methodology to investigate the dataset and finding the connection between's the parameters. Subsequently we can choose the parameters which are not connected to one another and are autonomous in nature. These lists of capabilities were then given as a contribution to four calculations and a csv document was produced comprising of anticipated house costs. Henceforth we determined the execution of each model utilizing distinctive execution measurements and looked at them dependent on these measurements. We found that Decision Tree over fits our dataset and gives the most elevated exactness of 84.64%. Rope gives minimal exactness of 60.32%. Strategic Regression and Support Vector Regression giving a precision of 72.81% and 67.81% respectively Thus we infer that we actualized classifiers to the issue of relapse to check how well would classifier be able to fit to relapse problem [18].For future work, we suggest that chipping away at vast dataset would yield a superior and genuine picture about the model. We have embraced just couple of Machine Learning calculations that are really classifiers however we have to prepare numerous other classifiers and comprehend their anticipating conduct for ceaseless qualities as well. By improving the blunder esteems this examination work can be valuable for improvement of utilizations for different separate urban areas. The non-direct connection between house characteristics and house value, the absence of some ecological traits, and insufficient number of test size could be the reason for the poor execution of the libertine value models. In any case, it ought to be noticed that the ideal counterfeit neural system model is made by a preliminary and error strategy. Without this methodology, the outcomes may not show prevalence of the neural system model (Lenik et al., 1997).There is, in any case, a few restrictions in this paper. Initially, the house cost utilized isn't the real deal cost however the assessed cost because of the trouble in getting the genuine information from the market. Besides, this paper considered just the present year's data of the houses. The time impact of the house value, which could conceivably affect the assessed outcomes was disregarded (a similar house ought to have diverse cost in various years, expecting that age factor is consistent). At long last, the house cost could be influenced by some other financial elements, (for example, conversion scale and loan fee) are excluded in the estimation.
inclinations coming about because of the split in train and test subsets. Since we comprehend that the parameterization of the distinctive techniques can drive huge varieties in the execution, we have recognized a portion of the possibly most in fluencing parameters and tried diverse setups for those. We have likewise announced outcomes on the use of the calculations after information normalization. After preparing and assessing the models, we have completely considered the results, revealing discoveries on how extraordinary setups sway the execution. Results demonstrate that outperforming models are dependably those comprising of groups of relapse trees. In quantitative terms, we have
Appl. Sci.2018,8, 232122 of 24found that the littlest mean outright blunder is 338,715 Euros, and the best middle supreme mistake is94,850 Euros. These mistakes can be viewed as high inside the extent of budgetary venture, yet are relatively little under the way that information include just resources with qualities more than one million euros.In reality, when the mean and middle total blunders are contrasted and the mean and middle of the distribution of costs, relative blunders of 16.80% and 5.71% are acquired, individually. These error sare essentially littler than the ones given by a traditional direct relapse model, therefore highlighting the upside of progressively complex AI algorithms. In this sense, it merits referencing that the reality of the mean being a lot bigger than the median can be disclosed because of the nearness of anomalies. For instance, the most costly resource in the datasets estimated 90 million dollars, and, as indicated by the portrayal, it is a loft of 473 square meters with five rooms and five restrooms. This value is by all accounts extreme for a loft of such characteristics, implying that either the cost is a grammatical error and the benefit was sold at an a lot littler price, or the depiction has been recorded incorrectly. In either case, further investigation of forecast mistakes, notwithstanding including manual appraisal of experts, is left for future work which can serve for improving the nature of the database and accordingly of the trained AI models. In expansion, the assessment of a model constructed using k-closest neighbours with various neighbours smaller than five was left for future work. Also, further examination on the effect of normalization in the execution of the multi-layer perception can be of enthusiasm, since results revealed in this paper seems to repudiate the basic conviction that standardization counteracts numerical shakiness and can ease quicker combination. At long last, the effect of various models in both relapse blunder and time are worth exploring. Another field for potential research includes the utilization of profound taking in methods for extracting relevant highlights from normal language portrayals. Up until now, these information have not been given toes, however all adverts have a portrayal presented by the dealer depicting the home at deal. A feature vector separated from this content, for instance by utilizing convolution neural systems with temporal components, could increase the value of the highlights definitely known. Pertinent profound learning techniques for this reason have been as of late overviewed by Liu et al. [19].Another potential future research line lies in the utilization of grouping methods all together to differentiate between market portions. At the end of the day, in a greater example, the conjunction, inside the same test, of a few market fragments could determine in inconsistent evaluations in the event that they are completely neglected. On those cases, a two-advance procedure would be suitable in which, first, we portion the sample and, at long last, we apply the relapse models as proposed in this work. Additionally, in this paper, we have moved toward the issue of distinguishing investment opportunities as a relapse issue comprising of the estimation of the genuine evaluation of the assets. Nonetheless, in the event that this valuation were done physically, at that point the issue could be handled as a twofold arrangement issue, where the goal is decide if the advantage itself is an speculation opportunity; for instance, if the deal cost were littler than the valuation price. Finally, in this work, we have compelled to the examination of a preview of the land marketing a six-month time frame. Be that as it may, it could be fascinating to think about the demonstrating of time arrangement for prediction, since, now and again, it has been demonstrated that worldly data can improve substantially the expectation execution [20]. Investigating this examination line is likewise proposed as a future work.
REFERENCES
[1] R. J. Shiller, “Understanding recent trends in house prices and home ownership,” National Bureau of Economic Research, Working Paper 13553, Oct. 2007. DOI: 10.3386/w13553. [Online]. Available: http://www.nber.org/papers/w13553.
[2] R. A. Rahadi, S. K. Wiryono, D. P. Koesrindartotoor, and I.B. Syamwil, ―Factors influencing the price of housing in Indonesia,‖ Int. J. Hous. Mark.
Anal., vol. 8, no. 2, pp. 169–188, 2015.
[3] V. Limsombunchai, ―House price prediction: Hedonic price model vs. artificial neural network,‖ Am. J. ..., 2004.
[4] D. X. Zhu and K. L. Wei, ―The Land Prices and Housing Prices ——Empirical Research Based on Panel Data of 11 Provinces and Municipalities in Eastern China,‖ Int. Conf. Manag. Sci. Eng., no. 2009, pp. 2118–2123, 2013.
[5] S. Kisilevich, D. Keim, and L. Rokach, ―A GIS-based decision support system for hotel room rate estimation and temporal price prediction: The hotel brokers’ context,‖ Decis. Support Syst., vol. 54, no. 2, pp. 1119–1133, 2013.
[6] C. Y. Jim and W. Y. Chen, ―Value of scenic views: Hedonic assessment of private housing in Hong Kong,‖ Landsc. Urban Plan., vol. 91, no. 4, pp. 226–234, 2009.
Economics Society Conference. 2004.
[8] Park, Byeonghwa, and Jae Kwon Bae. "Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data."Expert Systems with Applications 42.6 (2015): 2928-2934.
[9] Bhuriya, Dinesh, et al. "Stock market predication using a linear regression." Electronics, Communication and Aerospace Technology (ICECA), 2017 International conference of.Vol. 2.IEEE, 2017.
[10] https://www.kaggle.com/ohmets/feature-selection-for-regression/data
[11] Pedregosa, Fabian, et al. "Scikit-learn: Machine learning in Python." Journal of machine learning research 12.Oct (2011): 2825-2830. [12] Breiman, L. (June 1997). "Arcing The Edge" (PDF). Technical Report 486. Statistics Department, University of California, Berkeley. [13] Friedman, J. H. (February 1999). "Greedy Function Approximation: A Gradient Boosting Machine" (PDF).
[14] Friedman, J. H. (March 1999). "Stochastic Gradient Boosting" (PDF).
[15] Mason, L.; Baxter, J.; Bartlett, P. L.; Frean, Marcus (1999). "Boosting Algorithms as Gradient Descent" (PDF). In S.A. Solla and T.K. Leen and K. Müller (ed.). Advances in Neural Information Processing Systems 12. MIT Press. pp. 512–518.
[16] Mason, L.; Baxter, J.; Bartlett, P. L.; Frean, Marcus (May 1999). "Boosting Algorithms as Gradient Descent in Function Space" (PDF). [17] R. T. Azuma et al.. “A survey of augmented reality,” Presence, vol. 6, no. 4, pp. 355-385, 1997.
[18] Torgo,Luis, and Joao Gama. "Regression using classification algorithms." Intelligent Data Analysis 1.4 (1997): 275-292.
[19] Liu, W.; Wang, Z.; Liu, X.; Zeng, N.; Liu, Y.; Alsaadi, F. A Survey of Deep Neural Network Architectures andtheir Applications.Neurocomputing2017,243, 11–26.