Essay 3: Combined Markov and hidden Markov model in stock price movement
5.2 Future works
5.2.2 Analysis of high–frequency financial data using non–linear models
returns and, in Essay 2, we utilised weekly data to build a trading strategy or portfolio. The main reason that we used daily/weekly data is that we needed to decrease the trade number to account for the trading fees.
News information and stock prices are high–frequency data. Recent research has used ANNs or HMMs to analyse such high–frequency financial data. For instance, Lahmiri (2014) presents a forecasting model that integrates the discrete wavelet transforming and backpropagation neural networks for predicting financial time series. The model uses low– and high–frequency components, obtained through the decomposition of the financial time series data by discrete wavelet transformation, as input variables for forecast future stock prices. Arévalo et al. (2016) use deep neural networks to forecast the next one–minute average price. The deep neural networks model is trained on the current time (hour and minute) as well as the n–lagged one–minute
124
pseudo–returns, price standard deviations and trend indicators. Cartea and Jaimungal (2013) employ an HMM to examine how the intraday dynamics of the stock market have changed and how to use this information to develop trading strategies at high frequencies.
These existing studies use high–frequency financial data, only considering stock prices to predict stock price movement. In our future work, we will consider high–frequency news information and stock prices to build a prediction model based on neural networks or HMMs.
125
Appendix: RavenPack News Analytics (RPNA)
In this research, we use data on public news from RPNA, Edition 3.0. RavenPack (http://www.ravenpack.com/) is one of the most well–known providers of news analytics data. Another well–known provider is Thomson Reuters News Analytics (TRNA). News analytics is a relatively new tool based on AI and designed to improve the understanding of news events.RPNA collect corporate news items (from the year 2000) from all public sources, including the Dow Jones, Barron’s and the Wall Street Journal. There are two categories of editions: Global Equities and Global Macro. Global Equities editions contain only entities that are classified as companies. These companies include over 40,000 listed stocks from the world’s equity markets, which spread across the Americas, Europe and the Asia–Pacific. Global Macro editions contain all entities that are not classified as companies. RavenPack analyses news on over 200 economies, delivering data on more than 138,000 places, 2,500 financially relevant organisations, 155 currencies and 82 commodities. In this thesis, we use the Global Equities editions dataset.
There are 39 fields used to describe each news item in the Global Equities editions (and 27 fields per news item in the Global Macro editions); these include the time stamp, the entity ID, the company name, the relevance of the news, the event category, the event sentiment, the novelty of the news, the composite sentiment score of the news, the story event count and the story ID. Some of these fields, such as event category and event sentiment, are generated by RPNA through content analysis.
126
The ‘time stamp’ is used to record the UTC at which the news occurred. RavenPack uses the ‘relevance’ variable to differentiate between news items where the corporation is the main object of the original news source and news items where the name of the corporation is mentioned only tangentially. The relevance variable attributes values from 0 to 100 and a score of 100 indicates that an article is highly relevant.
For a news story with a relevance score of 100, the ENS represents its novelty value. The first story to report a categorised event will receive a novelty score of 100. The novelty scores of subsequent stories about the same event will follow a decay function (i.e., 100, 75, 56, 42, 32, 24, 18, 13, 10, 8, 6, 4, 3, 2, 2, 1, 1, 1, 1, 0 …).
The ESS measures whether a particular news item contains favourable or unfavourable information about the underlying corporation. This variable represents the news sentiment for a given entity, ranging from 0 to 100, where 0 indicates extremely negative news, 50 indicates neutral news and 100 indicates extremely positive news.
In terms of the sentiment, RavenPack uses a proprietary computational linguistic analysis algorithm to quantify positive and negative perceptions of facts and opinions reported in the textual content of the news (Shi et al., 2016a). The core of this algorithm can be divided into two steps. First, a group of financial experts manually tag a set of stories and build up a historical database of words, phrases, combinations and other word–level definitions that have affected the target company, market or asset class. Then, the text in the specific news story is compared with the historical database and the
127
sentiments score is generated by automated computer classification using a Bayes Classifier.
Many researchers use RPNA to analyse the financial market. For instance, Smales (2014) examines the market reaction of leading Australian stocks to stock–specific news flow over an extended period. The study concurs with previous literature that news items are critically relevant to identifying significant effects. Akbas et al. (2016) find that high short interest is predictive of negative public news based on RPNA. Shi et al. (2016a) analyse how the hourly return volatility of S&P100 stocks from 2000 to 2010 are linked to the various linguistics–based sentiment scores of the news releases.
128
Bibliography
ABU-MOSTAFA, Y. S. & ATIYA, A. F. 1996. Introduction to financial forecasting. Applied Intelligence, 6, 205-213.
AGA, M. & KOCAMAN, B. 2008. Efficient market hypothesis and emerging capital markets: empirical evidence from Istanbul stock exchange. International Research Journal of Finance and Economics, 13, 131-144.
AGHABOZORGI, S. & TEH, Y. W. 2014. Stock market co-movement assessment using a three-phase clustering method. Expert Systems with Applications, 41,
1301-1314.
AKBAS, F., BOEHMER, E., ERTURK, B. & SORESCU, S. 2016. Short Interest, Returns, and Unfavorable Fundamental Information. Financial Management.
AKTAS, N., DE BODT, E. & VAN OPPENS, H. 2008. Legal insider trading and market efficiency. Journal of Banking & Finance, 32, 1379-1392.
AL-ANASWAH, N. & WILFLING, B. 2011. Identification of speculative bubbles using state-space models with Markov-switching. Journal of Banking & Finance, 35, 1073-1086.
ALAM, M. M. & UDDIN, M. G. S. 2009. Relationship between interest rate and stock price: empirical evidence from developed and developing countries. International journal of business and management, 4, 43.
ALANYALI, M., MOAT, H. S. & PREIS, T. 2013. Quantifying the relationship between financial news and the stock market. Sci. Rep, 3, 3578.
ALIZADEH, A. & NOMIKOS, N. 2004. A Markov regime switching approach for hedging stock indices. Journal of Futures Markets, 24, 649-674.
ALLEN, D. E., MCALEER, M. J. & SINGH, A. K. 2015. Machine news and volatility: the Dow Jones industrial average and the TRNA real-time high-frequency sentiment series. The Handbook of High Frequency Trading, 327-344.
AMIHUD, Y. & WOHL, A. 2004. Political news and stock prices: The case of Saddam Hussein contracts. Journal of banking & Finance, 28, 1185-1200.
ANDREÃO, R. V., DORIZZI, B. & BOUDY, J. 2006. ECG signal analysis through hidden Markov models. IEEE Transactions on Biomedical engineering, 53,
1541-1549.
ANG, A. & BEKAERT, G. 2002. Regime switches in interest rates. Journal of Business & Economic Statistics, 20, 163-182.
ANN, W. K., SEQUEIRA, J. M. & MCALEER, M. 2005. Modelling the information content in insider trades in the Singapore exchange. Mathematics and
129
Computers in Simulation, 68, 417-428.
ANTWEILER, W. & FRANK, M. Z. 2004. Is all that talk just noise? The information content of internet stock message boards. The Journal of Finance, 59, 1259- 1294.
ARÉVALO, A., NIÑO, J., HERNÁNDEZ, G. & SANDOVAL, J. High-frequency trading strategy based on deep neural networks. International conference on intelligent computing, 2016. Springer, 424-436.
ASAFU-ADJAYE, J. 2000. The relationship between energy consumption, energy prices and economic growth: time series evidence from Asian developing countries. Energy economics, 22, 615-625.
ASGHARIAN, H., HOLMFELDT, M. & LARSON, M. 2011. An event study of price movements following realized jumps. Quantitative Finance, 11, 933-946.
ATSALAKIS, G. S. & VALAVANIS, K. P. 2009. Surveying stock market forecasting techniques–Part II: Soft computing methods. Expert Systems with Applications, 36, 5932-5941.
AWOKUSE, T. O. 2007. Causality between exports, imports, and economic growth: Evidence from transition economies. Economics Letters, 94, 389-395.
BAHRAMMIRZAEE, A. 2010. A comparative survey of artificial intelligence applications in finance: artificial neural networks, expert system and hybrid intelligent systems. Neural Computing and Applications, 19, 1165-1195.
BALKE, N. S. & WOHAR, M. E. 2002. Low-frequency movements in stock prices: A state-space decomposition. Review of Economics and Statistics, 84, 649-667.
BARBERIS, N., SHLEIFER, A. & VISHNY, R. 1998. A model of investor sentiment. Journal of financial economics, 49, 307-343.
BEALE, M. H., HAGAN, M. T. & DEMUTH, H. B. 2015. Neural network toolbox user’s guide.
BENEISH, M. D. & GARDNER, J. C. 1995. Information Costs and Liquidity Effects from Changes in the Dow Jones Industrial Average List. Journal of Financial and Quantitative Analysis, 30, 135-157.
BERA, A. K. & PARK, S. Y. 2008. Optimal portfolio diversification using the maximum entropy principle. Econometric Reviews, 27, 484-512.
BESSEMBINDER, H. & CHAN, K. 1995. The profitability of technical trading rules in the Asian stock markets. Pacific-Basin Finance Journal, 3, 257-284.
BETTMAN, J., HALLETT, A. & SAULT, S. 2010. Rumortrage: Can Investors Profit on Takeover Rumors on Internet Stock Message Boards? Social Science Research Network Working Paper Series.
130
management: assumptions and unintended consequences. Journal of Sustainable Finance & Investment, 3, 17-37.
BILMES, J. A. 1998. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. International Computer Science Institute, 4, 126.
BOLLEN, J., MAO, H. & ZENG, X. 2011. Twitter mood predicts the stock market. Journal of Computational Science, 2, 1-8.
BOLLEN, N. P. 1998. Valuing options in regime-switching models. The Journal of Derivatives, 6, 38-49.
BOLLEN, N. P., GRAY, S. F. & WHALEY, R. E. 2000. Regime switching in foreign exchange rates:: Evidence from currency option prices. Journal of Econometrics, 94, 239-276.
BORGES, M. R. 2010. Efficient market hypothesis in European stock markets. The European Journal of Finance, 16, 711-726.
BOUBAKER, S., FARAG, H. & NGUYEN, D. K. 2015. Short-term overreaction to specific events: evidence from an emerging market. Research in international business and finance, 35, 153-165.
BRAV, A., GECZY, C. & GOMPERS, P. A. 2000. Is the abnormal return following equity issuances anomalous? Journal of Financial Economics, 56, 209-249.
BRIEC, W., KERSTENS, K. & JOKUNG, O. 2007. Mean-variance-skewness portfolio performance gauging: a general shortage function and dual approach. Management science, 53, 135-149.
BROTHERSON, W. T., EADES, K. M., HARRIS, R. S. & HIGGINS, R. C. 2015. 'Best Practices' in Estimating the Cost of Capital: An Update. Journal of Applied Finance, 23, 19.
BUFFINGTON, J. & ELLIOTT, R. J. 2002. American options with regime switching. International Journal of Theoretical and Applied Finance, 5, 497-514.
CALAFIORE, G. C. 2008. Multi-period portfolio optimization with linear control policies. Automatica, 44, 2463-2473.
CAO, Q., LEGGIO, K. B. & SCHNIEDERJANS, M. J. 2005. A comparison between Fama and French's model and artificial neural networks in predicting the Chinese stock market. Computers & Operations Research, 32, 2499-2512.
CARTEA, Á. & JAIMUNGAL, S. 2013. Modelling asset prices for algorithmic and high-frequency trading. Applied Mathematical Finance, 20, 512-547.
CELIKYURT, U. & ÖZEKICI, S. 2007. Multiperiod portfolio optimization models in stochastic markets using the mean–variance approach. European Journal of Operational Research, 179, 186-202.
131
CERVELLÓ-ROYO, R., GUIJARRO, F. & MICHNIUK, K. 2015. Stock market trading rule based on pattern recognition and technical analysis: Forecasting the DJIA index with intraday data. Expert Systems with Applications, 42, 5963- 5975.
CHAN, K. C., GUP, B. E. & PAN, M.-S. 1997. International Stock Market Efficiency and Integration: A Study of Eighteen Nations. Journal of Business Finance & Accounting, 24, 803-813.
CHAN, W. S. 2003. Stock price reaction to news and no-news: drift and reversal after headlines. Journal of Financial Economics, 70, 223-260.
CHANG, P.-C., WANG, D.-D. & ZHOU, C.-L. 2012. A novel model by evolving partially connected neural network for stock price trend forecasting. Expert Systems with Applications, 39, 611-620.
CHANG, T.-S. 2011. A comparative study of artificial neural networks, and decision trees for digital game content stocks price prediction. Expert Systems with Applications, 38, 14846-14851.
CHARLES, A. & DARNÉ, O. 2014. Large shocks in the volatility of the Dow Jones Industrial Average index: 1928–2013. Journal of Banking & Finance, 43, 188- 199.
CHAWLA, G. K. 2014. Estimating Cost of Capital in Today's Economic Environment. Journal of Business and Behavior Sciences, 26, 102.
CHEN, L., DA, Z. & ZHAO, X. 2013. What Drives Stock Price Movements? Review of Financial Studies, 26, 841-876.
CHEN, M., MAO, S. & LIU, Y. 2014. Big Data: A Survey. Mobile Networks and Applications, 19, 171-209.
CHIOU-WEI, S. Z., CHEN, C.-F. & ZHU, Z. 2008. Economic growth and energy consumption revisited—evidence from linear and nonlinear Granger causality. Energy Economics, 30, 3063-3076.
CHRISTIE, A. A. 1982. The stochastic behavior of common stock variances: Value, leverage and interest rate effects. Journal of financial Economics, 10, 407-432.
CHUNG, S.-L., HUNG, C.-H. & YEH, C.-Y. 2012. When does investor sentiment predict stock returns? Journal of Empirical Finance, 19, 217-240.
CLEMENTS, M. P. & KROLZIG, H. M. 1998. A Comparison of the Forecast Performance of Markov‐switching and Threshold Autoregressive Models of US GNP. The Econometrics Journal, 1, 47-75.
COHEN, G. & CABIRI, E. 2015. Can technical oscillators outperform the buy and hold strategy? Applied Economics, 47, 3189-3197.
CREAMER, G. G. 2015. Can a corporate network and news sentiment improve portfolio optimization using the Black–Litterman model? Quantitative Finance,
132
15, 1405-1416.
CUTLER, D. M., POTERBA, J. M. & SUMMERS, L. H. 1989. What moves stock prices? The Journal of Portfolio Management, 15, 4-12.
DACCO, R. & SATCHELL, S. 1999. Why do regime‐switching models forecast so badly? Journal of Forecasting, 18, 1-16.
DAHLQUIST, M. & GRAY, S. F. 2000. Regime-switching and interest rates in the European monetary system. Journal of International Economics, 50, 399-419.
DE OLIVEIRA, F. A., NOBRE, C. N. & ZÁRATE, L. E. 2013. Applying Artificial Neural Networks to prediction of stock price and improvement of the directional prediction index–Case study of PETR4, Petrobras, Brazil. Expert Systems with Applications, 40, 7596-7606.
DEMIGUEL, V., GARLAPPI, L. & UPPAL, R. 2009. Optimal versus naive diversification: How inefficient is the 1/N portfolio strategy? Review of Financial Studies, 22, 1915-1953.
DEMPSTER, A. P., LAIRD, N. M. & RUBIN, D. B. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the royal statistical society. Series B (methodological), 1-38.
DICKEY, D. A. & FULLER, W. A. 1979. Distribution of the estimators for autoregressive time series with a unit root. Journal of the American statistical association, 74, 427-431.
DUEKER, M. J. 1997. Markov switching in GARCH processes and mean-reverting stock-market volatility. Journal of Business & Economic Statistics, 15, 26-34.
DURLAND, J. M. & MCCURDY, T. H. 1994. Duration-dependent transitions in a Markov model of US GNP growth. Journal of Business & Economic Statistics, 12, 279-288.
ELLIOTT, R. J., SIU, T. K. & BADESCU, A. 2010. On mean-variance portfolio selection under a hidden Markovian regime-switching model. Economic modelling, 27, 678-686.
ELTON, E. J. & GRUBER, M. J. 1997. Modern portfolio theory, 1950 to date. Journal of Banking & Finance, 21, 1743-1759.
ENGEL, C. 1994. Can the Markov switching model forecast exchange rates? Journal of International Economics, 36, 151-165.
ESTES, J. E., SAILER, C. & TINNEY, L. R. 1986. Applications of artificial intelligence techniques to remote sensing. The Professional Geographer, 38,
133-141.
FAMA, E. F. 1965. Random walks in stock market prices. Financial Analysts Journal, 21, 55-59.
133
FAMA, E. F. 1970. Efficient capital markets: A review of theory and empirical work. Journal of Finance, 25, 383-417.
FAMA, E. F. 1991. Efficient capital markets: II. The journal of finance, 46, 1575-1617.
FAN, K., SHEN, Y., SIU, T. K. & WANG, R. 2016. Pricing dynamic fund protection under hidden Markov models. IMA Journal of Management Mathematics, dpw014.
FERNANDEZ-RODRıGUEZ, F., GONZALEZ-MARTEL, C. & SOSVILLA-
RIVERO, S. 2000. On the profitability of technical trading rules based on artificial neural networks:: Evidence from the Madrid stock market. Economics letters, 69, 89-94.
FIESCHI, M. 2013. Artificial intelligence in medicine: Expert systems, Springer.
FLANNERY, M. J. & JAMES, C. M. 1984. The effect of interest rate changes on the common stock returns of financial institutions. The Journal of Finance, 39,
1141-1153.
FRANKS, J. R. & HARRIS, R. S. 1989. Shareholder Wealth Effects of Corporate Takeovers: The U.K. Experience 1955-1985. Journal of Financial Economics, 23 (2), 225-249
GILLAM, R. A., GUERARD, J. B. & CAHAN, R. 2015. News volume information: Beyond earnings forecasting in a global stock selection model. International Journal of Forecasting, 31, 575-581.
GÖÇKEN, M., ÖZÇALıCı, M., BORU, A. & DOSDOĞRU, A. T. 2016. Integrating
metaheuristics and artificial neural networks for improved stock price prediction. Expert Systems with Applications, 44, 320-331.
GOOGLE INC. 2015. Investor Relations.
GOZBASI, O., KUCUKKAPLAN, I. & NAZLIOGLU, S. 2014. Re-examining the Turkish stock market efficiency: Evidence from nonlinear unit root tests. Economic Modelling, 38, 381-384.
GRANGER, C. W. 1969. Investigating causal relations by econometric models and cross-spectral methods. Econometrica, 37, 424-438.
GRANGER, C. W. 1988. Some recent development in a concept of causality. Journal of econometrics, 39, 199-211.
GRANGER, C. W., HUANGB, B.-N. & YANG, C.-W. 2000. A bivariate causality between stock prices and exchange rates: evidence from recent Asianflu. The Quarterly Review of Economics and Finance, 40, 337-354.
GRAY, S. F. 1996. Modeling the conditional distribution of interest rates as a regime- switching process. Journal of Financial Economics, 42, 27-62.
134
mean–variance portfolio optimization. European Journal of Operational Research, 183, 981-1000.
GUPTA, A. Stock market prediction using Hidden Markov Models. 2012 Students Conference on Engineering and Systems (SCES), 16-18 March 2012 2012.
GURESEN, E., KAYAKUTLU, G. & DAIM, T. U. 2011. Using artificial neural network models in stock market index prediction. Expert Systems with Applications, 38, 10389-10397.
HAAPANEN, R. & EK, A. R. 2001. Software and instructions for kNN applications in forest resources description and estimation.
HAERI, A., HATEFI, S. M. & REZAIE, K. 2015. Forecasting about EURJPY exchange rate using hidden Markova model and CART classification algorithm. Journal of Advanced Computer Science & Technology, 4, 84.
HAJIZADEH, E., ARDAKANI, H. D. & SHAHRABI, J. 2010. Application of data mining techniques in stock markets: A survey. Journal of Economics and International Finance, 2, 109.
HALDRUP, N. & NIELSEN, M. Ø. 2006. A regime switching long memory model for electricity prices. Journal of econometrics, 135, 349-376.
HAMILTON, J. D. 1989. A new approach to the economic analysis of nonstationary time series and the business cycle. Econometrica: Journal of the Econometric Society, 357-384.
HAN, E.-H. S., KARYPIS, G. & KUMAR, V. Text categorization using weight adjusted k-nearest neighbor classification. Pacific-asia conference on knowledge discovery and data mining, 2001. Springer, 53-65.
HAN, Y., YANG, K. & ZHOU, G. 2013. A new anomaly: The cross-sectional profitability of technical analysis. Journal of Financial and Quantitative Analysis, 48, 1433-1461.
HARRIS, L. 1991. Stock price clustering and discreteness. Review of financial studies, 4, 389-415.
HASSAN, M. R. & NATH, B. Stock market forecasting using hidden Markov model: a new approach. 5th International Conference on Intelligent Systems Design and Applications (ISDA'05), 2005. IEEE, 192-196.
HASSAN, M. R., NATH, B. & KIRLEY, M. 2007. A fusion model of HMM, ANN and GA for stock market forecasting. Expert Systems with Applications, 33, 171- 180.
HENRIKSEN, P. N. 2011. Pricing barrier options by a regime switching model. Quantitative Finance, 11, 1221-1231.
HESTON, S. L. & SINHA, N. R. 2017. News vs. Sentiment: Predicting Stock Returns from News Stories. Financial Analysts Journal, 73, 1-17.
135
HIEMSTRA, C. & JONES, J. D. 1994. Testing for Linear and Nonlinear Granger Causality in the Stock Price‐Volume Relation. The Journal of Finance, 49,