Further developments - Latent Markovian Modelling and Clustering for Continuous Data Sequences

The implementation of an universal R package able to treat and cluster all kind of longitudinal data using any specification of the HMTD model (including HMM, DCMM, MTD, Mixture models etc.) will be a very useful tool. As this thesis has shown, HMTD may be a very good alternative to GMM and the other clustering methods. The package for continuous data will be released soon.

On the basis of preliminary trials (that are not included in this thesis), the HMTD model seems to copes well with discrete data too. However, it would be interesting to compare more extensively the performance of the HMTD with DCMM (R package MARCH) when clustering discrete data. Even though DCMM is a model that is specif-ically designed for discrete data, chances are that HMTD could be as good as it for this type of data, because many discrete distributions can be approximated by continuous ones.

The estimation procedure remains demanding in terms of computational time. Ac-celerating the convergence of the model in its full form could open new possibilities for treating larger datasets with higher number of covariates.

Another interesting point is the further study of the di↵erent flexible clustering possibilities, as well as their application using latent level covariates together with the visible ones. The combination of simultaneous clustering and modelling may be very attractive for many social studies involving di↵erent life course trajectories as discussed before. The possibility to identify general groups of persons that evolve di↵erently by simultaneously estimating and modelling di↵erently their latent trajectories, may represent an innovative and useful tool in various domains.

Finally, it would be interesting to see how the HMTD model performs in various

7.6. FURTHER DEVELOPMENTS 171 other fields and for di↵erent purposes. An interesting application may be to identify sequences whose distribution may not be appropriate according to the nature of the phenomenon of interest. As a small illustration in social sciences, we can imagine a longitudinal study that concerns a sensible subject to which the respondents may be afraid to answer and prefer to conform to the norm. In this case, one cluster of the model may capture the trajectories whose auto-dependence structure deviates from the others (randomly or incorrectly answered questions). In other fields, one may similarly identify errors due to the person in charge of collecting survey data, or a faulty measurement tool for instance. Besides ordinary clustering, these are only a few of the many possible applications of the HMTD model.

Bibliography

[1] Abbott A, Tsay A. (2000) Sequence Analysis and Optimal Matching Methods in Sociology: Review and Prospect Sociological Methods Research, August Vol. 29 No. 1, 3-33

[2] Aghabozorgi, S., Shirkhorshidi, A. S., & Wah, T. Y. (2015). Time-series clustering:

A decade review. Information Systems, 53, 16-38. Chicago

[3] Aitkin, M. (2001). Likelihood and Bayesian analysis of mixtures.Statistical Mod-elling, 1(4), 287-304.

[4] Altman, R. M. (2007). Mixed hidden Markov models: an extension of the hidden Markov model to the longitudinal data setting.Journal of the American Statistical Association, 102(477), 201-210.

[5] Antoniadis, A., Brossat, X., Cugliari, J., & Poggi, J. M. (2013). Clustering func-tional data using wavelets.Internafunc-tional Journal of Wavelets, Multiresolution and Information Processing, 11(01), 1350003.

[6] Bank, J., & Cole, B. (2008). Calculating the Jaccard similarity coefficient with map reduce for entity pairs in wikipedia.Wikipedia Similarity Team, 1-18.

[7] Barrense-Dias Y, Berchtold A, Akre C, Sur´ıs JC (2015) The relation between In-ternet use and overweight among adolescents: a longitudinal study in Switzerland.

International Journal of Obesity 40: 45-50.

[8] Bauer, D., and Curran, P. (2003) ”Distributional assumptions of growth mixture models: implications for overextraction of latent trajectory classes.” Psychological methods 8.3: 338.

[9] Begleiter R., El-Yaniv R., Yona G. (2004) On Prediction Using Variable Order Markov Models Journal of Artificial Intelligence Research 22 p.385-421

173

[10] Bendtsen, C. (2012) pso: Particle swarm optimization. R package version 1.0.3.

Available onhttps://cran.r-project.org/web/packages/pso/index.html.

[11] Berchtold A. (1995) Autoregressive Modelling of Markov Chains. In Proceedings of the 10th International Workshop on Statistical Modelling,Springer-Verlag, New York pp. 19-26

[12] Berchtold A. (1998) Chaˆınes de Markov et Mod`eles de Transition: Applications aux Sciences Sociales.Editions HERMES,Paris

[13] Berchtold A. (1999) The Double Chain Markov Model. Communications in Statis-tics: Theory and Methods, 28(11), 2569-2589.

[14] Berchtold A. (2001) Estimation in the mixture transition distribution Model. Jour-nal of Time Series AJour-nalysis 22(4): 379-397.

[15] Berchtold, A. (2003) Mixture transition distribution (MTD) modelling of het-eroscedastic time series. Computational statistics and data analysis41(3): 399-411.

[16] Berchtold, A., Jeannin, A., Akre, C., Michaud, P.-A. & Surs, J.-C. (2010) First use of multiple substances: Identification of meaningful patterns. Journal of Substance Use 15: 118-130.

[17] Berchtold, A. and Raftery, A. (2002) The mixture transition distribution model for high-order Markov chains and non-Gaussian time series. Statistical Science 17(3):

328-356.

[18] Berchtold A., Sackett G. (2002) Markovian Models for the Developmental Study of Social Behavior. American Journal of Primatology, 58 (3), 149-167.

[19] Berchtold A., Sur´ıs J.-C., Meyer T. and Taushanov Z. (2017) Development of somatic complaints among adolescents and young adults in Switzerland. Accepted for publication in the Swiss Journal of Sociology.

[20] Bertoletti, M., Friel, N., & Rastelli, R. (2015). Choosing the number of clusters in a finite mixture model using an exact integrated completed likelihood criterion.

Metron, 73(2), 177-199.

[21] Biernacki, C., Celeux, G., & Govaert, G. (2000). Assessing a mixture model for clustering with the integrated completed likelihood. IEEE transactions on pattern analysis and machine intelligence, 22(7), 719-725.

BIBLIOGRAPHY 175 [22] Biernacki, C., Celeux, G., & Govaert, G. (2010). Exact and Monte Carlo calcu-lations of integrated likelihoods for the latent class model. Journal of Statistical Planning and Inference, 140(11), 2991-3002.

[23] Bishop, C. (2007). Pattern Recognition and Machine Learning (Information Science and Statistics), 1st edn. 2006. corr. 2nd printing edn.Springer, New York.

[24] Bolano D. and Berchtold A. (2016) General framework and model building in the class of Hidden Mixture Transition Distribution models. Computational Statistics and Data Analysis 93: 131-145.

[25] Boussau B., Gueguen L. et Gouy M. (2009)A Mixture Model and a Hidden Markov Model to Simultaneously Detect Recombination Breakpoints and Reconstruct Phy-logenies, Evolutionary Bioinformatics, 5 p.67-79

[26] B¨uhlmann P., Wyner A.J. (1999) Variable Length Markov Chains. The Annals of Statistics, 27, 480-513.

[27] Byrd, R. H., Lu, P., Nocedal, J. and Zhu, C. (1995) A limited memory algorithm for bound constrained optimization. SIAM Journal on Scientific Computing 16(5):

1190-1208.

[28] Celeux, G., Chauveau, D., & Diebolt, J. (1995). On Stochastic Versions of the EM Algorithm. [Research Report] RR-2514,INRIA. 1995. inria-00074164

[29] Celeux, G., & Govaert, G. (1995). Gaussian 9parsimonious clustering models Pat-tern recognition, 28(5), 781-793.

[30] Celeux, G., Hurn, M. and Robert, C. P. (2000). Computational and inferential difficulties with mixture posterior distributions.Journal of the American Statistical Association, 95(451), 957-970.

[31] Celeux, G., Martin, O., & Lavergne, C. (2005). Mixture of linear mixed models for clustering gene expression profiles from repeated microarray experiments.Statistical Modelling, 5(3), 243-267.

[32] Celeux, G. On the di↵erent ways to compute the Integrated Completed Likelihood Criterion, http://convegni.unica.it/cladag2015/files/2015/10/Celeux.pdf

[33] Cerny, V. (1985) Thermodynamical approach to the travelling salesman problem:

an efficient simulation algorithm. Journal of Optimization Theory and Applications 45: 41-51.

[34] Chariatte V., Berchtold A., Akr C., Michaud P.A., Suris J.C. (2008) Missed Ap-pointments in an Outpatient Clinic for Adolescents, an Approach to Predict the Risk of Missing. Journal of Adolescent Health, 43:38-45.

[35] Cherif, A., Cardot, H., & Bon, R. (2011). SOM time series clustering and prediction with recurrent neural networks.Neurocomputing, 74(11), 1936-1944.

[36] Chiou, J. M., & Li, P. L. (2007). Functional clustering and identifying substructures of longitudinal data.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 69(4), 679-699.

[37] Ciampi, A., Campbell, H., Dyachenko, A., Rich, B., McCusker, J., & Cole, M.

G. (2012). Model-based clustering of longitudinal data: Application to model-ing disease course and gene expression trajectories.Communications in Statistics-Simulation and Computation, 41(7), 992-1005.

[38] Co↵ey, N., Hinde, J., & Holian, E. (2014). Clustering longitudinal profiles us-ing P-splines and mixed e↵ects models applied to time-course gene expression data.Computational Statistics & Data Analysis, 71, 14-29.

[39] Cron, A. J., and West, M. (2011). Efficient classification-based relabeling in mix-ture models.The American Statistician, 65(1), 16-20.

[40] Davidson, I., & Satyanarayana, A. (2003, November). Speeding up k-means clus-tering by bootstrap averaging.In IEEE data mining workshop on clusclus-tering large data sets.

[41] Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm.Journal of the royal statistical society. Series B (methodological), 1-38.

[42] Der Kiureghian, A., & Ditlevsen, O. (2009). Aleatory or epistemic? Does it mat-ter?. Structural Safety, 31(2), 105-112.

[43] Dey, M., Jorm, A. F., & Mackinnon, A. J. (2015). Cross-sectional time trends in psychological and somatic health complaints among adolescents: a structural

BIBLIOGRAPHY 177 equation modelling analysis of Health Behaviour in School-aged Children: data from Switzerland.Social Psychiatry and Psychiatric Epidemiology, 50(8), 1189-1198. https://doi.org/10.1007/s00127-015-1040-3

[44] Dias, J. G. & Vermunt, J. K. (2006). Bootstrap methods for measuring classifica-tion uncertainty in latent class analysis. InCompstat 2006-Proceedings in Compu-tational Statistics (pp. 31-41). Physica-Verlag HD.

[45] Dias, J. G., & Wedel, M. (2004). An empirical comparison of EM, SEM and MCMC performance for problematic Gaussian mixture likelihoods.Statistics and Comput-ing, 14(4), 323-332.

[46] Efron, B. (1979). Bootstrap methods: another look at the jackknife.The annals of Statistics, 1-26.

[47] Efron, B. and Tibshirani, R. J. (1994). An introduction to the bootstrap.CRC [48] Elbeltagi, E., Hegazy, T. and Grierson, D. (2005) Comparison among five

evolutionary-based optimization algorithms. Advanced Engineering Informatics 19:

43-53.

[49] Fang, L., Chen, P. and Liu S. (2007) Particle swarm optimization with simulated annealing for TSP. Proceedings of the 6th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases (AIKED 07).

[50] Fang, Y. and Wang, J. (2012). Selection of the number of clusters via the bootstrap method. Computational Statistics & Data Analysis, 56(3), 468-477.

[51] Faraci P, Craparo G, Messina R, Severino S (2013) Internet Addiction Test (IAT):

Which is the Best Factorial Solution?. J Med Internet Res. 15(10): e225.

[52] Felsenstein J. and Churchill G. (1996) A Hidden Markov Model Approach to Vari-ation Among Sites in Rate of Evolution, Mol. Biol. Evol. 13(1) p.93-104

[53] Francis, B. and Liu, J. (2015) Modelling escalation in crime seriousness: a latent variable approach. Metron 73.2 : 277-297.

[54] Friedman, J., Hastie, T. and Tibshirani, R. (2001). The elements of statistical learning (Vol. 1). Springer, Berlin: Springer series in statistics.

[55] Gabadinho, A., & Ritschard, G. (2016). Analyzing state sequences with probabilis-tic suffix trees: the PST R package. Journal of Statisprobabilis-tical Software, 72(3), 1-39.

doi:doi:10.18637/jss.v072.i03

[56] Gelfand, A. E., & Smith, A. F. (1990). Sampling-based approaches to calculating marginal densities. Journal of the American statistical association, 85(410), 398-409.

[57] Genolini, C., & Falissard, B. (2010). KmL: k-means for longitudinal data.Computational Statistics, 25(2), 317-328.

[58] Grob, A., Lthi, R., Kaiser, F. G., Flammer, A., Mackinnon, A., & Wearing, A. J.

(1991). Berner Fragebogen zum Wohlbefinden Jugendlicher (BFW).Diagnostica, 37(1), 66-75.

[59] Gruet, M. A., Philippe, A., & Robert, C. P. (1999). MCMC control spreadsheets for exponential mixture estimation.Journal of Computational and graphical Statistics, 8(2), 298-317.

[60] Gr¨un, B., & Leisch, F. (2004). Bootstrapping finite mixture models.COMPSTAT 2004 Symposium

[61] Gr¨un, B., & Leisch, F. (2009). Dealing with label switching in mixture models under genuine multimodality.Journal of Multivariate Analysis, 100(5), 851-861.

[62] Hajjem, A., Bellavance, F., & Larocque, D. (2014). Mixed-e↵ects random forest for clustered data.Journal of Statistical Computation and Simulation, 84(6), 1313-1328.

[63] Helske S, Helske J, Eerola M (2018) Combining Sequence Analysis and Hidden Markov Models in the Analysis of Complex Life Sequence Data. In G Ritschard &

M Studer (eds),Sequence Analysis and Related Approaches: Innovative Methods and Applications. Berlin: Springer.

[64] Hennig, C. (2007). Cluster-wise assessment of cluster stability.Computational Statistics & Data Analysis, 52(1), 258-271.

[65] Hennig, C. (2016) Practical decision making in cluster analysis: Choice of method and evaluation of quality.Talk on the 22nd International Con-ference on Computational Statistic COMPSTAT 2016, Oviedo, Spain, http://www.compstat2016.org/docs/compstatvalidation.pdf?20160821232000

BIBLIOGRAPHY 179 [66] Hennig, C., Meila, M., Murtagh, F., & Rocci, R. (Eds.). (2016). Handbook of

cluster analysis.CRC Press.

[67] Holland, J. H. (1992) Genetic algorithms. Scientific American 267(1): 66-72.

[68] Jacques, J., & Preda, C. (2014). Functional data clustering: a survey.Advances in Data Analysis and Classification, 8(3), 231-255.

[69] Jasra A., Holmes C., & Stephens D. A. (2005). Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling.Statistical Science, 50-67.

[70] Jelenchick, L. A., Becker, T., & Moreno, M. A. (2012). Assessing the psychometric properties of the Internet Addiction Test (IAT) in US college students. Psychiatry research, 196(2), 296-301.

[71] Jin, Y. (2006). Multi-objective machine learning (Vol. 16).Springer Science & Busi-ness Media.

[72] Jones, B. L., Nagin, D. S., & Roeder, K. (2001). A SAS procedure based on mixture models for estimating developmental trajectories.Sociological methods & research, 29(3), 374-393.

[73] Jonsson, F., Hammarstr¨om, A., & Gustafsson, P. E. (2014). Social capital across the life course and functional somatic symptoms in mid-adulthood.Scandinavian Journal of Public Health. https://doi.org/10.1177/1403494814548749

[74] Jung, T. and Wickrama K.A.S. (2008) An Introduction to Latent Class Growth Analysis and Growth Mixture Modeling Social and Personality Psychology Com-pass, Volume 2, Issue 1, pages 302?317

[75] Kemeny J.G. & Snell J.L. (1976) Finite Markov Chains. Springer-Verlag, New York

[76] Kemeny J.G., Snell J.L., Knapp A.W. (1976) Denumerable Markov Chains.

Springer-Verlag, New York

[77] Kennedy, J. and Eberhart, R. (1995) Particle swarm optimization. Proceedings of IEEE International Conference on Neural Networks IV: 1942-1948.

[78] Kennedy, M. C., & O’Hagan, A. (2001). Bayesian calibration of computer mod-els.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(3), 425-464.

[79] Khazaal Y, Billieux J, Thorens G, Khan R, Scarlatti E, Theintz F, Lederrey J, Van Der Linden M, Zullino D (2008) French validation of the Internet addiction test. Cyberpsychology Behavior 11(6):703-706.

[80] Kirkpatrick, S., Gelatt Jr, C. D. and Vecchi, M. P. (1983) Optimization by simu-lated annealing. Science 220: 671-680.

[81] Kohonen, T. (1998). The self-organizing map.Neurocomputing, 21(1), 1-6.

[82] Krogh, A., Larsson, B., Von Heijne, G. and Sonnhammer, E. L. (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. Journal of molecular biology, 305(3), 567-580.

[83] Lai, C. M., Mak, K. K., Cheng, C., Watanabe, H., Nomachi, S., Bahar, N., ...

& Griffiths, M. D. (2015). Measurement invariance of the internet addiction test among Hong Kong, Japanese, and Malaysian adolescents. Cyberpsychology, Behav-ior, and Social Networking, 18(10), 609-617.

[84] Leisch, F. (2016) Resampling Methods for Exploring Cluster Stability, In Hennig, C., Meila, M., Murtagh, F., & Rocci, R. (Eds.) Handbook of cluster analysis (Chapter 28).CRC Press.

[85] Leng, X., & M¨uller, H. G. (2006). Classification using functional data analysis for temporal gene expression data.Bioinformatics, 22(1), 68-76.

[86] Liaw, A., & Wiener, M. (2002). Classification and regression by randomForest.R news, 2(3), 18-22.

[87] Little, R. J., & Rubin, D. B. (2014). Statistical analysis with missing data (Vol.

333). John Wiley & Sons.

[88] MacDonald I. and Zucchini W. (1997) Hidden Markov and Other Models for Discrete-valued Time Series Chapman & Hall

[89] Marin, J. M., Mengersen, K., and Robert, C. P. (2005). Bayesian modelling and inference on mixtures of distributions.Handbook of statistics, 25, 459-507.

BIBLIOGRAPHY 181 [90] Maruotti, A. (2011). Mixed hidden Markov models for longitudinal data: an

overview.International Statistical Review, 79(3), 427-454.

[91] Matsumoto, M. and Nishimura, T. (1998) Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Transactions on Modeling and Computer Simulation 8(1): 3-30.

[92] McArdle, J.J. and Epstein, D. (1987) Latent growth curves within developmental structural equation models Child development (1987): 110-133.

[93] McLachlan G. et Krishnan T. (1997) The EM Algorithm and Extentions Wiley Series in Probability and Statistics 1997

[94] McLachlan, G., and Peel, D. (2004). Finite mixture models.John Wiley & Sons.

[95] McNicholas, P. D., & Murphy, T. B. (2010). Model-based clustering of longitudinal data. Canadian Journal of Statistics, 38(1), 153-168.

[96] Mehran F. (1989) Analysis of Discrete Longitudinal Data: Infinite-Lag Markov Models. In Statistical Data Analysis and Inference, pp. 533-541. (Edited by Y.

Dodge),Elsevier Science Publishers

[97] Meila, M. (2016) Criteria for Comparing Clusterings, In Hennig, C., Meila, M., Murtagh, F., & Rocci, R. (Eds.) Handbook of cluster analysis (Chapter 27).CRC Press.

[98] Mercer, R.E. and Sampson, J.R. (1978) Adaptive search using a reproductive meta-plan. Kybernetes 7(3): 215-228.

[99] Meyn S.P. et Tweedie R.L. (1993) Markov Chains and Stochastic Stability, Springer-Verlag

[100] Mohapatra, S., Deo, S. J. K., Satapathy, A., & Rath, N. (2014). Somatoform Disorders in Children and Adolescents.German Journal of Psychiatry, 17(1), 19-24.

[101] Mullen, K., Ardia, D., Gil, D., Windover, D. and Cline, J. (2011) DEoptim: an R package for global optimization by di↵erential evolution. Journal of Statistical Software 40(6): 1-26.

[102] Muth´en, B. ”Latent variable mixture modeling.” New developments and tech-niques in structural equation modeling (2001): 1-33.

[103] Muth´en, B. and Asparouhov T. Growth mixture modeling: Analysis with non-Gaussian random e↵ects.Longitudinal data analysis (2008): 143-165.

[104] Muth´en, B. and Shedden, K. (1999) Finite mixture modelling with mixture out-comes using the EM algorithm. Biometrics 55.2 : 463-469.

[105] Nagin, D. (1999) Analyzing developmental trajectories: a semiparametric, group-based approach. Psychological methods 4.2 139.

[106] Nelder, J.A. and Mead, R. (1965) A simplex method for function minimiza-tion.Computer Journal 7: 308-313.

[107] Ocone D. (2009) Markov Chains and Applications to Population Genet-ics,http://www.math.rutgers.edu/courses/338/coursenotes/markovchains.pdf [108] Pardo B. et Birmingham W. (2005) Modeling Form for On-line Following of

Mu-sical Performances, Proceedings of the Twentieth National Conference on Artificial Intelligence, Pittsburgh, Pennsylvania, July 9-13, 2005

[109] Paterlini, S. and Krink, T. (2006). Di↵erential evolution and particle swarm opti-mization in partitional clustering. Computational statistics & data analysis, 50(5), 1220-1247.

[110] Aneiros-P´erez, G., Cao, R., & Vilar-Fern´andez, J. M. (2011). Functional methods for time series prediction: a nonparametric approach.Journal of Forecasting, 30(4), 377-392.

[111] Piguet C, Berchtold A, Zimmermann G and Sur´ıs JC (2016) Rapport fi-nal de l’étude longitudifi-nale [email protected]: Institut universitaire de médecine sociale et préventive. (Raisons de santé, 255).

[112] Premalatha, K. and Natarajan, A.M. (2009) Hybrid PSO and GA for global maximization. International Journal of Open Problems in Computer Science and Mathematics 2(4): 597-608.

[113] Proust-Lima, C., Philipps, V. and Liquet, B. (2015) Estimation of extended mixed models using latent classes and latent processes: the R package lcmm.

arXiv :1503.00890.

BIBLIOGRAPHY 183 [114] Pukkala, T. and Kurttila, M. (2005) Examining the performance of six heuristic optimization techniques in di↵erent forest planning problems. Silva Fennica 39(1):

67-80.

[115] Putnick, D. L., & Bornstein, M. H. (2016). Measurement invariance conventions and reporting: the state of the art and future directions for psychological research.

Developmental Review, 41, 71-90.

[116] R Core Team (2015) R: A language and environment for statistical computing.

R Foundation for Statistical Computing, Vienna, Austria. URL: https://www.R-project.org/.

[117] Rabiner, L. (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2): 257-286.

[118] Rabiner, L. R. and Juang, B. H. (1986). An introduction to hidden Markov models ASSP Magazine, IEEE, 3(1), 4-16.

[119] Raftery, A. (1985) A model for high-order Markov chains. Journal of the Royal Statistical Society, series B 47(3): 528-539.

[120] Raftery A (1995) Bayesian model selection in social research. Sociological Method-ology 25:111-163.

[121] Ram N, Grimm KJ. (2009) Growth Mixture Modeling: A Method for Identify-ing Di↵erences in Longitudinal Change Among Unobserved Groups. International journal of behavioral development 33(6):565-576.

[122] Rand, W. M. (1971). Objective criteria for the evaluation of clustering meth-ods.Journal of the American Statistical association, 66(336), 846-850.

[123] Reinecke, J. and Seddig, D. (2011) Growth mixture models in longitudinal re-search. AStA Advances in Statistical Analysis 95.4: 415-434.

[124] Richardson S., & Green P. J. (1997). On Bayesian analysis of mixtures with an unknown number of components (with discussion).Journal of the Royal Statistical Society: series B (statistical methodology), 59(4), 731-792.

[125] Rissanen J. (1983) A Universal Data Compression SystemIEEE Transactions on Information Theory 29 (5) p.656-664

[126] Rodriguez, C. E., & Walker, S. G. (2014). Label switching in Bayesian mixture models: Deterministic relabeling strategies.Journal of Computational and Graph-ical Statistics, 23(1), 25-45.

[127] Rossi, F., Conan-Guez, B., & El Golli, A. (2004, April). Clustering functional data with the SOM algorithm. In ESANN (pp. 305-312).

[128] Peter E. Rossi (2014). Bayesian Non- and Semi-parametric Methods and Appli-cations.Princeton University Press

[129] Rosychuk, R. J., Sheng, X., & Stuber, J. L. (2006). Comparison of variance estimation approaches in a two-state Markov model for longitudinal data with misclassification.Statistics in medicine, 25(11), 1906-1921.

[130] Rousseeuw, P. J. (1987). Silhouettes: a graphical aid to the interpretation and validation of cluster analysis.Journal of computational and applied mathematics, 20, 53-65.

[131] Ryd´en, T. (2008) EM versus Markov chain Monte Carlo for estimation of hidden Markov models: a computational perspective. Bayesian Analysis 3(4): 659-688.

[132] Ryden, T. (2008) EM versus Markov chain Monte Carlo for estimation of hidden Markov models: a computational perspective. Bayesian Analysis 3(4): 659-688.

[133] Sarlin, P. (2013). Self-organizing time map: An abstraction of temporal multi-variate patterns.Neurocomputing, 99, 496-508.

[134] Schwarz, G. (1978). Estimating the dimension of a model. The annals of statistics, 6(2), 461-464.

[135] Scott, S. (2002) Bayesian methods for hidden Markov models. Journal of the American Statistical Association 97: 337-351.

[136] Scrucca, L. (2013) GA: A package for genetic algorithms in R. Journal of Statis-tical Software 53(4).

[137] Sela, R. J., & Simono↵, J. S. (2012). RE-EM trees: a data mining approach for longitudinal and clustered data.Machine learning, 86(2), 169-207.

[138] Shi, Y. and Eberhart, R.C. (1998) A modified particle swarm optimizer. Proceed-ings of IEEE International Conference on Evolutionary Computation 69-73.

BIBLIOGRAPHY 185 [139] Shmilovici A. et Ben-Gal I. (2007) Using a VOM model for reconstructing potential

coding regions in EST sequences Computational Statistics 22 p.49-69

[140] Singer, S. and Nelder, J. (2009) Nelder-Mead algorithm. Scholarpedia 4(7): 2928.

[141] Skarupova K, Olafsson K, Blinka L (2015) Excessive Internet Use and its associ-ation with negative experiences: Quasi-validassoci-ation of a short scale in 25 European countries. Computers in Human Behavior 53:118-123.

[142] Song, J. J., Lee, H. J., Morris, J. S., & Kang, S. (2007). Clustering of time-course gene expression data using functional data analysis.Computational biology and chemistry, 31(4), 265-274.

[143] Sperrin, M., Jaki, T. and Wit, E. (2010). Probabilistic relabelling strategies for the label switching problem in Bayesian mixture models.Statistics and Computing, 20(3), 357-366.

[144] Srinivas, M. and Patnaik, L. (1994) Adaptive probabilities of crossover and mu-tation in genetic algorithms. IEEE Transactions on System, Man and Cybernetics 24(4): 656-667.

[145] Steinley, D. (2008). Stability analysis in K-means clustering.British Journal of Mathematical and Statistical Psychology, 61(2), 255-273.

[146] Steinley, D. (2004). Properties of the Hubert-Arable Adjusted Rand In-dex.Psychological methods, 9(3), 386.

[147] Stephens, M. (2000). Dealing with label switching in mixture models.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 62(4), 795-809.

[148] Storn, R. and Price, K. (1997) Di↵erential evolution - a simple and efficient heuris-tic for global optimization over continuous spaces. Journal of Global Optimization 11: 341-359.

[149] Sur´ıs JC, Akre C, Berchtold A, Fleury-Schubert A, Michaud PA and Zimmer-mann G (2012) [email protected]: Usage d’Internet chez les adolescents vaudois.

Lausanne: Institut universitaire de médecine sociale et préventive. (Raisons de santé, 208).

[150] Sur´ıs JC, Akre C, Piguet C, Ambresin AE, Zimmermann G and Berchtold A (2014) Is Internet use unhealthy? A cross-sectional study of adolescent Internet overuse. Swiss Med Wkly 2014;144:w14061

[151] Tanner, M. A., & Wong, W. H. (1987). The calculation of posterior distributions by data augmentation. Journal of the American statistical Association, 82(398), 528-540.

[152] Taushanov, Z., & Berchtold, A. (2017). A Direct Local Search Method and its Application to a Markovian Model. Statistics, Optimization & Information Com-puting, 5(1), 19-34.

[153] Taushanov, Z., & Berchtold, A. (2017) Markovian-based Clustering of Internet Addiction Trajectories. In G Ritschard & M Studer (eds), Sequence Analysis and Related Approaches: Innovative Methods and Applications. Berlin: Springer.

[154] Teicher, H. (1961). Identifiability of mixtures. The annals of Mathematical statis-tics, 32(1), 244-248.

[155] Teicher, H. (1963). Identifiability of finite mixtures. The annals of Mathematical statistics, 1265-1269.

[156] Titterington, D. M., Smith, A. F., & Makov, U. E. (1985). Statistical analysis of finite mixture distributions. Wiley.

[157] Tomida, S., Hanai, T., Honda, H., & Kobayashi, T. (2002). Analysis of expression profile using fuzzy adaptive resonance theory.Bioinformatics, 18(8), 1073-1083.

[TREE] TREE. (2016). Documentation on the first TREE cohort (TREE1), 2000-2016. Bern: TREE. Retrieved from http : //www.tree.unibe.ch/unibe/portal/f ak wiso/c dep sowi/micro tree/content/

e206328/e305140/e305154/f iles476810/T REE 2016 P roject documentation T REE1 2000 2016 English ger.pdf

[159] Tse, E., & Anton, J. (1972). On the identifiability of parameters. IEEE Transac-tions on Automatic Control, 17(5), 637-646.

[160] Vandenberg, R. J., & Lance, C. E. (2000). A review and synthesis of the mea-surement invariance literature: Suggestions, practices, and recommendations for organizational research. Organizational research methods, 3(1), 4-70.

BIBLIOGRAPHY 187 [161] Verbeke, G. and Lesa↵re, E. (1996) A linear mixed-e↵ects model with heterogene-ity in the random-e↵ects population. Journal of the American Statistical Associa-tion 91.433 : 217-221.

[162] Verhoof, E., Maurice-Stam, H., Heymans, H., & Grootenhuis, M. (2012).

Growing into disability benefits? Psychosocial course of life of young adults with a chronic somatic disease or disability. Acta Paediatrica, 101(1), e19-e26.

https://doi.org/10.1111/j.1651-2227.2011.02418.x

[163] Vermunt, J. K., & Magidson, J. (2003) Latent class models for classification.

Computational Statistics & Data Analysis, 41(3), 531-537.

[164] Visser, I., Raijmakers, M. E., & Molenaar, P. (2000). Confidence intervals for hidden Markov model parameters.British journal of mathematical and statistical psychology, 53(2), 317-327.

[165] Viterbi A. (1967) ”Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm” IEEE Transactions on Information Theory 13(2) p.260-269

[166] Von Luxburg, U. (2007). A tutorial on spectral clustering Statistics and comput-ing, 17(4), 395-416.

[167] Wang, X., Smith, K., & Hyndman, R. (2006). Characteristic-based clustering for time series data. Data mining and knowledge Discovery, 13(3), 335-364.

[168] Wang, M. and Bodner, T. E. (2007) Growth mixture modeling identifying and predicting unobserved subpopulations with longitudinal data. Organizational Re-search Methods 10(4), 635-656.

[169] Wong C.S., Li W.K. (2001) On a mixture autoregressive conditional heteroscedas-tic model. Journal of the American Statisheteroscedas-tical Association, 96, 982-995.

[170] Wu, C. J. (1983). On the convergence properties of the EM algorithm. The Annals of statistics, 95-103.

[171] Wu L. (2000) Some Comments on ”Sequence Analysis and Optimal Matching Methods in Sociology: Review and Prospect” Sociological Methods & Research Vol.29 399-411

[172] Xiang, Y., Gubian, S., Suomela, B. and Hoeng J. (2013) Generalized simulated annealing for global optimization: the GenSA package. The R Journal 5(1).

[173] Yakowitz, S. J., & Spragins, J. D. (1968). On the identifiability of finite mixtures.

The Annals of Mathematical Statistics, 209-214.

[174] Yao, W. (2012). Model based labeling for mixture models. Statistics and Com-puting, 22(2), 337-347.

[175] Yao W. (2015) Label switching and its solutions for frequentist mixture models.

Journal of Statistical Computation and Simulation, 85(5), 1000-1012.

[176] Young KS (1998) Internet Addiction: The Emergence of a New Clinical Disorder.

CyberPsychology & Behavior 1:237-244

[177] Zhu, W., & Fan, Y. (2016). Relabelling algorithms for mixture models with ap-plications for large data sets. Journal of Statistical Computation and Simulation, 86(2), 394-413.

Glossary

AIC Akaike Information Criterion. 108 BIC Bayesian Information Criterion. 108 CI Confidence Interval. 97

DE Di↵erential Evolution. 61

EM Expectation-Maximization algorithm. 41 GA Genetic Algorithm. 60

GEM Generalized Expectation-Maximization algorithm. 46 GM M Gaussian Mixture Models. 9

GM M Growth Mixture Models. 34 HM M Hidden Markov Model. 13

HM T D Hidden Mixture Transition Distributions. 15 ICL Integrated Complete Likelihood. 109

M LE Maximum Likelihood Estimate. 97 M T D Mixture Transition Distributions. 9 N M Nelder-Mead optimisation. 61

P SO Particle Swarm Optimization. 60 SA Simulated Annealing. 59

189

In document Latent Markovian Modelling and Clustering for Continuous Data Sequences (Page 181-200)