• No results found

Four Equivalent Solutions for Model M h

In the previous section, we showed the full derivation of the closed-form solution to the marginal likelihood of model Mh. It turned out that there are four equivalent solutions to Equation.3. These solutions are derived by redefining which class map to the values 1 and 2 and by redefining which regions map toθ1andθ2.

Let us use the notations introduced in the previous section: a = N21+ α2, b = N22+ β2, c = N11+ α1 and d = N12+ β1. Also, let us define C as follows:

C =1 k· Γ(α1+ β1) Γ(α1) ·Γ(β1) · Γ(α2+ β2) Γ(α2) ·Γ(β2) (.9)

The marginal likelihood of Mh (Equation .3) can be obtained by solving any of the fol- lowing four equations:

C · a+b−1 X j=a Γ(a) ·Γ(b) Γ( j + 1) ·Γ(a + b − j)· Γ(c + j) ·Γ(a + b + d − j − 1) Γ(a + b + c + d − 1) (.10)

Which is the solution we derived in the previous section. C · d+c−1 X j=d Γ(c) ·Γ(d) Γ( j + 1) ·Γ(c + d − j)· Γ(b + j) ·Γ(c + d + a − j − 1) Γ(a + b + c + d − 1) (.11) C · Ã Γ(a) ·Γ(b) Γ(a + b) · Γ(c) ·Γ(d) Γ(c + d) − a+b−1 X j=b Γ(a) ·Γ(b) Γ( j + 1) ·Γ(a + b − j)· Γ(d + j) ·Γ(a + b + c − j − 1) Γ(a + b + c + d − 1) ! (.12) C · Ã Γ(a) ·Γ(b) Γ(a + b) · Γ(c) ·Γ(d) Γ(c + d) − c+d−1 X j=c Γ(c) ·Γ(d) Γ( j + 1) ·Γ(c + d − j)· Γ(a + j) ·Γ(c + d + b − j − 1) Γ(a + b + c + d − 1) ! (.13)

A.4 DERIVATION OF THE CLOSED-FORM SOLUTION FOR MODEL ML

The marginal likelihood of model Ml (P r(G|Ml)) is defined as follows:

=1 k Z 1 θ2=0 θ2N21· (1 − θ2)N22· beta(θ2;α2,β2) | {z } f1 Z θ2 θ1=0 θ1N11· (1 − θ1)N11· beta(θ1;α1,β1)dθ1 | {z } f2 dθ2 (.14) By solving the integral given by f2, we get:

f2= Γ(α1+ β1) Γ(α1) ·Γ(β1) Z θ2 θ1=0 θ1c−1· (1 − θ1)d−1dθ2 = Γ(α1+ β1) Γ(α1) ·Γ(β1)· Γ(c) ·Γ(d) Γ(c + d) · c+d−1 X j=c Γ(c + d) Γ( j + 1) ·Γ(c + d − j)· θ j 2· (1 − θ2)c+d−1− j

where, as before, c = N11+ α1and d = N12+ β1. By solving f1, we get: f1= Γ(α2+ β2) Γ(α2) ·Γ(β2) Z 1 θ2=0 θ2a−1· (1 − θ2)b−1 Now we can solve Equation.14:

P r(G|Ml) = C · c+d−1 X j=c Γ(c) ·Γ(d) Γ( j + 1) ·Γ(c + d − j)· Γ(a + j) ·Γ(c + d + b − 1 − j) Γ(a + b + c + d − 1) (.15) Where C is the constant we defined by Equation.9in the previous section.

Notice that Equation.15 (the solution to P r(G|Ml)) can be obtained from Equation .13 (one of the four solutions to P r(G|Mh)) as follows:

P r(G|Ml) = C ·Γ(a) ·Γ

(b)Γ(c) ·Γ(d)

Γ(a + b) · −Γ(c + d) − P r(G|Mh) (.16) It turned out that no matter which formula we used to solve P r(G|Mh), we can use Equation.16to obtain P r(G|Ml).

A.5 COMPUTATIONAL COMPLEXITY

Since we require that N11, N12, N21, N22,α1,β1,α2andβ2be natural numbers, the gamma function simply becomes a factorial function: Γ(x) = (x − 1)!. Since such numbers can be- come very large, it is convenient to use the logarithm of the gamma function and express Equations .2,.10, .11, .12, .13 and .16 in logarithmic form in order to preserve numerical precision. The logarithm of the integer gamma function can be pre-computed and efficiently stored in an array as follows:

lnGamma[1] = 0 For i = 2 to n

lnGamma[i] = lnGamma[i − 1] + ln(i − 1)

We then can use lnGamma in solving the above equations. However, Equations .10, .11,.12and.13include a sum, which makes the use of the logarithmic form more involved. To deal with this issue, we can define function lnAdd, which takes two arguments x and y that are in logarithmic form and returns ln(ex+ ey). It does so in a way that preserves a good deal of numerical precision that could be lost if ln(ex+ ey) were calculated in a direct

manner. This is done by using the following formula:

lnAdd(x, y) = x + ln(1 + e( y−x))

Now that we introduced functions lnGamma and lnAdd, it is straightforward to eval- uate Equations.2,.10,.11,.12,.13and.16in logarithmic form.

Let us now analyze the overall computational complexity for computing the Bayesian score for a specific rule (solving Equation .1). Doing so requires computing P r(Me|G), P r(Mh|G) and P r(Ml|G). P r(Me|G) can be computed in O(1) using Equation .2. P r(Mh|G) can be computed by applying Equation.10, Equation.11, Equation.12or Equation.13. The computational complexity of these equations are O(N22+ β2), O(N11+ α1), O(N21+ α2) and O(N12+ β1), respectively. Therefore, P r(Mh|G) can be computed in O(min(N11+ α1, N12+

β1, N21+ α2, N22+ β2). P r(Ml|G) can be computed from P r(Mh|G) in O(1) using Equation .16. By assuming that α1, β1, α2, β2 are bounded from above, the overall complexity for computing the Bayesian score is O(min(N11,N12,N21,N22).

BIBLIOGRAPHY

[Abramowitz and Stegun, 1964] Abramowitz, M. and Stegun, I. A. (1964). Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables.

[Agrawal et al., 1998] Agrawal, R., Gehrke, J., Gunopulos, D., and Raghavan, P. (1998). Au- tomatic Subspace Clustering of High Dimensional Data for Data Mining Applications. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Agrawal et al., 1993] Agrawal, R., Imielinski, T., and Swami, A. (1993). Mining Association Rules between Sets of Items in Large Databases. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Agrawal and Shafer, 1996] Agrawal, R. and Shafer, J. C. (1996). Parallel Mining of Associ- ation Rules. IEEE Transaction on Knowledge and Data Engineering, 8:962–969.

[Agrawal and Srikant, 1994] Agrawal, R. and Srikant, R. (1994). Fast Algorithms for Min- ing Association Rules in Large Databases. In Proceedings of the international conference on Very Large Data Bases (VLDB).

[Agrawal and Srikant, 1995] Agrawal, R. and Srikant, R. (1995). Mining Sequential Pat- terns. In Proceedings of the International Conference on Data Engineering (ICDE). [Allen, 1984] Allen, F. (1984). Towards a General Theory of Action and Time. Artificial

Intelligence, 23:123-154.

[Asuncion and Newman, 2007] Asuncion, A. and Newman, D. (2007). UCI machine learning repository.

[Bailey et al., 2002] Bailey, J., Manoukian, T., and Ramamohanarao, K. (2002). Fast Al- gorithms for Mining Emerging Patterns. In Proceedings of the European conference on Principles of Data Mining and Knowledge Discovery (PKDD).

[Batal et al., 2012a] Batal, I., Cooper, G., and Hauskrecht, M. (2012a). A Bayesian Scoring Technique for Mining Predictive and Non-Spurious Rules. In Proceedings of the European conference on Principles of Data Mining and Knowledge Discovery (PKDD).

[Batal et al., 2012b] Batal, I., Fradkin, D., Harrison, J., Moerchen, F., and Hauskrecht, M. (2012b). Mining Recent Temporal Patterns for Event Detection in Multivariate Time

Series Data. In Proceedings of the international conference on Knowledge Discovery and Data mining (SIGKDD).

[Batal and Hauskrecht, 2009] Batal, I. and Hauskrecht, M. (2009). A Supervised Time Se- ries Feature Extraction Technique Using DCT and DWT. In International Conference on Machine Learning and Applications (ICMLA).

[Batal and Hauskrecht, 2010a] Batal, I. and Hauskrecht, M. (2010a). A Concise Represen- tation of Association Rules using Minimal Predictive Rules. In Proceedings of the Euro- pean conference on Principles of Data Mining and Knowledge Discovery (PKDD).

[Batal and Hauskrecht, 2010b] Batal, I. and Hauskrecht, M. (2010b). Constructing Classi- fication Features using Minimal Predictive Patterns. In Proceedings of the international conference on Information and knowledge management (CIKM).

[Batal et al., 2009] Batal, I., Sacchi, L., Bellazzi, R., and Hauskrecht, M. (2009). Multivari- ate Time Series Classification with Temporal Abstractions. In Proceedings of the Florida Artificial Intelligence Research Society (FLAIRS).

[Batal et al., 2011] Batal, I., Valizadegan, H., Cooper, G. F., and Hauskrecht, M. (2011). A Pattern Mining Approach for Classifying Multivariate Temporal Data. In Proceedings of the IEEE international conference on bioinformatics and biomedicine (BIBM).

[Batal et al., 2012c] Batal, I., Valizadegan, H., Cooper, G. F., and Hauskrecht, M. (2012c). A Temporal Pattern Mining Approach for Classifying Electronic Health Record Data. ACM Transaction on Intelligent Systems and Technology (ACM TIST), Special Issue on Health Informatics.

[Bay and Pazzani, 2001] Bay, S. D. and Pazzani, M. J. (2001). Detecting Group Differences: Mining Contrast Sets. Data Mining and Knowledge Discovery, 5:213–246.

[Bayardo, 1998] Bayardo, R. J. (1998). Efficiently Mining Long Patterns from Databases. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Bayardo, 1999] Bayardo, R. J. (1999). Constraint-Based Rule Mining in Large, Dense Databases. In Proceedings of the International Conference on Data Engineering (ICDE). [Beil et al., 2002] Beil, F., Ester, M., and Xu, X. (2002). Frequent Term-based Text Clus-

tering. In Proceedings of the international conference on Knowledge discovery and data mining (SIGKDD).

[Benjamini and Hochberg, 1995] Benjamini, Y. and Hochberg, Y. (1995). Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society, 57(1):289–300.

[Blasiak and Rangwala, 2011] Blasiak, S. and Rangwala, H. (2011). A Hidden Markov Model Variant for Sequence Classification. In Proceedings of the International Joint Con- ferences on Artificial Intelligence (IJCAI).

[Breiman et al., 1984] Breiman, L., Friedman, J., Olshen, R., and Stone, C. (1984). Classi- fication and Regression Trees. Statistics/Probability Series. Wadsworth Publishing Com- pany.

[Brin et al., 1997a] Brin, S., Motwani, R., and Silverstein, C. (1997a). Beyond Market Bas- kets: Generalizing Association Rules to Correlations. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Brin et al., 1997b] Brin, S., Motwani, R., Ullman, J. D., and Tsur, S. (1997b). Dynamic Itemset Counting and Implication Rules for Market Basket Data. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Calders and Goethals, 2002] Calders, T. and Goethals, B. (2002). Mining All Non-derivable Frequent Itemsets. In Proceedings of the European conference on Principles of Data Min- ing and Knowledge Discovery (PKDD).

[Casas-Garriga, 2005] Casas-Garriga, G. (2005). Summarizing Sequential Data with Closed Partial Orders. In Proceedings of the SIAM international conference on Data Mining (SDM).

[Chandola et al., 2006] Chandola, V., Eilertson, E., Ertoz, L., Simon, G., and Kumar, V. (2006). Data Warehousing and Data Mining Techniques for Computer Security, chapter Data Mining for Cyber Security. Springer.

[Cheng et al., 2007] Cheng, H., Yan, X., Han, J., and wei Hsu, C. (2007). Discriminative Frequent Pattern Analysis for Effective Classification. In Proceedings of the International Conference on Data Engineering (ICDE).

[Cheng et al., 2008] Cheng, H., Yan, X., Han, J., and Yu, P. S. (2008). Direct Discriminative Pattern Mining for Effective Classification. In Proceedings of the International Conference on Data Engineering (ICDE).

[Clark and Niblett, 1989] Clark, P. and Niblett, T. (1989). The CN2 Induction Algorithm. Machine Learning, 3:261–283.

[Cohen, 1995] Cohen, W. (1995). Fast Effective Rule Induction. In Proceedings of Interna- tional Conference on Machine Learning (ICML).

[Cohen and Singer, 1999] Cohen, W. and Singer, Y. (1999). A Simple, Fast, and Effective Rule Learner. In Proceedings of the National conference on Artificial Intelligence (AAAI). [Cong et al., 2005] Cong, G., Tan, K.-L., Tung, A. K. H., and Xu, X. (2005). Mining Top- K Covering Rule Groups for Gene Expression Data. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Das et al., 1998] Das, G., Lin, K.-I., Mannila, H., Renganathan, G., and Smyth, P. (1998). Rule Discovery from Time Series. In Proceedings of the international conference on Knowl- edge Discovery and Data mining (SIGKDD).

[Deshpande et al., 2005] Deshpande, M., Kuramochi, M., Wale, N., and Karypis, G. (2005). Frequent Substructure-Based Approaches for Classifying Chemical Compounds. IEEE Transactions on Knowledge and Data Engineering, 17:1036–1050.

[Dong and Li, 1999] Dong, G. and Li, J. (1999). Efficient Mining of Emerging Patterns: Discovering Trends and Differences. In Proceedings of the international conference on Knowledge discovery and data mining (SIGKDD).

[Exarchos et al., 2008] Exarchos, T. P., Tsipouras, M. G., Papaloukas, C., and Fotiadis, D. I. (2008). A Two-stage Methodology for Sequence Classification based on Sequential Pattern Mining and Optimization. Data and Knowledge Engineering, 66:467–487.

[Fan et al., 2008] Fan, W., Zhang, K., Cheng, H., Gao, J., Yan, X., Han, J., Yu, P., and Ver- scheure, O. (2008). Direct Mining of Discriminative and Essential Frequent Patterns via Model-based Search Tree. In Proceeding of the international conference on Knowledge Discovery and Data mining (SIGKDD).

[Fawcett, 2006] Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters., 27(8):861–874.

[Fayyad and Irani, 1993] Fayyad, U. and Irani, K. (1993). Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning. In Proceedings of the Interna- tional Joint Conference on Artificial Intelligence (IJCAI).

[Fischer et al., 2008] Fischer, J., Mäkinen, V., and Välimäki, N. (2008). Space Efficient String Mining under Frequency Constraints. In Proceedings of the International Con- ference on Data Mining (ICDM).

[Friedman et al., 2000] Friedman, J., Hastie, T., and Tibshirani, R. (2000). Additive Logistic Regression: a Statistical View of Boosting. Annals of Statistics, 28.

[Geng and Hamilton, 2006] Geng, L. and Hamilton, H. J. (2006). Interestingness Measures for Data Mining: A Survey. ACM Computing Surveys, 38.

[Goethals et al., 2005] Goethals, B., Muhonen, J., and Toivonen, H. (2005). Mining Non- Derivable Association Rules. In Proceedings of the SIAM international conference on Data Mining (SDM).

[Grosskreutz et al., 2010] Grosskreutz, H., Boley, M., and Krause-Traudes, M. (2010). Sub- group Discovery for Election Analysis: a Case Study in Descriptive Data Mining. In Proceedings of the international conference on Discovery science.

[Guttormsson et al., 1999] Guttormsson, S., Marks, R.J., I., El-Sharkawi, M., and Kerszen- baum, I. (1999). Elliptical Novelty Grouping for on-line short-turn Detection of Excited Running Rotors. IEEE Transactions on Energy Conversion.

[Han et al., 2007] Han, J., Cheng, H., Xin, D., and Yan, X. (2007). Frequent Pattern Mining: Current Status and Future Directions. Data Mining and Knowledge Discovery, 14(1):55– 86.

[Han et al., 2006] Han, J., Kamber, M., and Pei, J. (2006). Data Mining: Concepts and Techniques. Morgan Kaufmann, 2 edition.

[Han et al., 2000] Han, J., Pei, J., and Yin, Y. (2000). Mining frequent patterns without candidate generation. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Hauskrecht et al., 2010] Hauskrecht, M., Valko, M., Batal, I., Clermont, G., Visweswaram, S., and Cooper, G. (2010). Conditional Outlier Detection for Clinical Alerting. In Proceed- ings of the American Medical Informatics Association (AMIA).

[Heckerman et al., 1995] Heckerman, D., Geiger, D., and Chickering, D. M. (1995). Learn- ing Bayesian Networks: The Combination of Knowledge and Statistical Data. Machine Learning.

[Ho et al., 2003] Ho, T. B., Nguyen, T. D., Kawasaki, S., Le, S. Q., Nguyen, D. D., Yokoi, H., and Takabayashi, K. (2003). Mining Hepatitis Data with Temporal Abstraction. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Höppner, 2003] Höppner, F. (2003). Knowledge Discovery from Sequential Data. PhD the- sis, Technical University Braunschweig, Germany.

[Ifrim and Wiuf, 2011] Ifrim, G. and Wiuf, C. (2011). Bounded Coordinate-descent for Bio- logical Sequence Classification in High Dimensional Predictor Space. In Proceedings of the international conference on Knowledge Discovery and Data mining (SIGKDD).

[Jain et al., 2004] Jain, A., Chang, E. Y., and Wang, Y.-F. (2004). Adaptive stream resource management using Kalman Filters. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Jaroszewicz and Scheffer, 2005] Jaroszewicz, S. and Scheffer, T. (2005). Fast discovery of unexpected patterns in data relative to a Bayesian network. In Proceedings of the inter- national conference on Knowledge discovery in data mining (SIGKDD).

[Kadous, 1999] Kadous, M. W. (1999). Learning Comprehensible Descriptions of Multivari- ate Time Series. In Proceedings of the International Conference of Machine Learning (ICML).

[Kam and Fu, 2000] Kam, P.-s. and Fu, A. W.-C. (2000). Discovering Temporal Patterns for Interval-Based Events. In Proceedings of the international conference on Data Warehous- ing and Knowledge Discovery (DaWaK).

[Kavsek and Lavraˇc, 2006] Kavsek, B. and Lavraˇc, N. (2006). APRIORI-SD: Adapting As- sociation Rule Learning to Subgroup Discovery. Applied Artificial Intelligence, 20(7):543– 583.

[Keogh et al., 2000] Keogh, E., Chakrabarti, K., Pazzani, M., and Mehrotra, S. (2000). Di- mensionality Reduction for Fast Similarity Search in Large Time Series Databases. jour- nal of Knowledge and Information Systmes, 3:263–286.

[Keogh et al., 1993] Keogh, E., Chu, S., Hart, D., and Pazzani, M. (1993). Segmenting Time Series: A Survey and Novel Approach. Data mining in Time Series Databases.

[Keogh et al., 2010] Keogh, E., Zhu, Q., Hu, B., Y., H., Xi, X., Wei, L., and Ratanamahatana, C. A. (2010). The UCR Time Series Classification/Clustering. http:www.cs.ucr.edu/

~eamonn/time_series_data.

[Keogh et al., 2001] Keogh, E. J., Chakrabarti, K., Pazzani, M. J., and Mehrotra, S. (2001). Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases. knowledge and information systems, 3(3):263–286.

[Keogh and Pazzani, 1998] Keogh, E. J. and Pazzani, M. J. (1998). An Enhanced Repre- sentation of Time Series which Allows Fast and Accurate Classification, Clustering and Relevance Feedback. In Proceedings of the international conference on Knowledge Discov- ery and Data mining (SIGKDD).

[Kuramochi and Karypis, 2001] Kuramochi, M. and Karypis, G. (2001). Frequent Subgraph Discovery. In Proceedings of the International Conference on Data Mining (ICDM). [Lafferty et al., 2001] Lafferty, J., McCallum, A., and Pereira, F. (2001). Conditional Ran-

dom Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Pro- ceedings of the International Conference on Machine Learning (ICML).

[Lavraˇc and Gamberger, 2005] Lavraˇc, N. and Gamberger, D. (2005). Relevancy in Constraint-based Subgroup Discovery. In Constraint-Based Mining and Inductive Databases.

[Lee et al., 2000] Lee, W., Stolfo, S. J., and Mok, K. W. (2000). Adaptive Intrusion Detection: A Data Mining Approach. Artificial Intelligence Review, 14(6):533–567.

[Leslie et al., 2002] Leslie, C. S., Eskin, E., Weston, J., and Noble, W. S. (2002). Mismatch String Kernels for SVM Protein Classification. In Neural Information Processing Systems (NIPS).

[Levenshtein, 1966] Levenshtein, V. I. (1966). Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10(8):707–710.

[Li et al., 2001a] Li, J., Shen, H., and Topor, R. W. (2001a). Mining Optimal Class Associa- tion Rule Set. In Proceedings of the Pacific-Asia conference on Knowledge Discovery and Data Mining (PAKDD).

[Li et al., 2009] Li, L., McCann, J., Pollard, N. S., and Faloutsos, C. (2009). DynaMMo: Mining and Summarization of Coevolving Sequences with Missing Values. In Proceedings of the international conference on Knowledge Discovery and Data mining (SIGKDD).

[Li et al., 2010] Li, L., Prakash, B. A., and Faloutsos, C. (2010). Parsimonious Linear Fin- gerprinting for Time Series. PVLDB, 3:385–396.

[Li et al., 2001b] Li, W., Han, J., and Pei, J. (2001b). CMAR: Accurate and Efficient Classi- fication Based on Multiple Class-Association Rules. In Proceedings of the International Conference on Data Mining (ICDM).

[Lin et al., 2003] Lin, J., Keogh, E. J., Lonardi, S., and chi Chiu, B. Y. (2003). A Symbolic Representation of Time Series, with Implications for Streaming Algorithms. In Proceed- ings of the SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discov- ery.

[Lin and Lee, 2005] Lin, M. and Lee, S. (2005). Fast Discovery of Sequential Patterns through Memory Indexing and Database Partitioning. Journal Information Science and Engineering, 21:109–128.

[Liu et al., 1998] Liu, B., Hsu, W., and Ma, Y. (1998). Integrating Classification and Associ- ation Rule Mining. Knowledge Discovery and Data Mining, pages 80–86.

[Mampaey et al., 2011] Mampaey, M., Tatti, N., and Vreeken, J. (2011). Tell me what I need to know: Succinctly Summarizing Data with Itemsets. In Proceedings of the international conference on Knowledge Discovery and Data mining (SIGKDD).

[Manber and Myers, 1990] Manber, U. and Myers, G. (1990). Suffix Arrays: A New Method for On-line String Searches. In Proceedings of the first annual SIAM symposium on Dis- crete algorithms.

[Mannila et al., 1997] Mannila, H., Toivonen, H., and Inkeri Verkamo, A. (1997). Discovery of Frequent Episodes in Event Sequences. Data Mining and Knowledge Discovery, 1:259– 289.

[Méger and Rigotti, 2004] Méger, N. and Rigotti, C. (2004). Constraint-based Mining of Episode Rules and Optimal Window Sizes. In Proceedings of the European conference on Principles and Practice of Knowledge Discovery in Databases (PKDD).

[Mitchell, 1982] Mitchell, T. M. (1982). Generalization as Search. Artificial Intelligence, 18(2):203 – 226.

[Mitchell, 1997] Mitchell, T. M. (1997). Machine Learning. McGraw-Hill, Inc.

[Moerchen, 2006a] Moerchen, F. (2006a). Algorithms for Time Series Knowledge Mining. In Proceedings of the international conference on Knowledge Discovery and Data mining (SIGKDD).

[Moerchen, 2006b] Moerchen, F. (2006b). Time Series Knowledge Mining. PhD thesis, Philipps-University Marburg.

[Moerchen and Fradkin, 2010] Moerchen, F. and Fradkin, D. (2010). Robust Mining of Time Intervals with Semi-interval Partial Order Patterns. In Proceedings of the SIAM interna- tional conference on Data Mining (SDM).

[Moerchen and Ultsch, 2005] Moerchen, F. and Ultsch, A. (2005). Optimizing Time Series Discretization for Knowledge Discovery. In Proceedings of the international conference on Knowledge discovery in data mining (SIGKDD).

[Morishita and Sese, 2000] Morishita, S. and Sese, J. (2000). Tranversing Itemset Lat- tices with Statistical Metric Pruning. In Proceedings of the symposium on Principles of database systems (PODS).

[Moskovitch and Shahar, 2009] Moskovitch, R. and Shahar, Y. (2009). Medical Temporal- Knowledge Discovery via Temporal Abstraction. In Proceedings of the American Medical Informatics Association (AMIA).

[Nijssen et al., 2009] Nijssen, S., Guns, T., and De Raedt, L. (2009). Correlated Itemset Mining in ROC space: a Constraint Programming Approach. In Proceedings of the inter- national conference on Knowledge discovery and data mining (SIGKDD).

[Novak et al., 2009] Novak, P. K., Lavraˇc, N., and Webb, G. I. (2009). Supervised Descriptive Rule Discovery: A Unifying Survey of Contrast Set, Emerging Pattern and Subgroup Mining. Journal of Machine Learning Research (JMLR), 10:377–403.

[Papadimitriou et al., 2005] Papadimitriou, S., Sun, J., and Faloutsos, C. (2005). Streaming Pattern Discovery in Multiple Time-Series. In Proceedings of the international conference on Very Large Data Bases (VLDB).

[Papapetrou et al., 2005] Papapetrou, P., Kollios, G., Sclaroff, S., and Gunopulos, D. (2005). Discovering Frequent Arrangements of Temporal Intervals. In Proceedings of the Inter- national Conference on Data Mining (ICDE).

[Pasquier et al., 1999] Pasquier, N., Bastide, Y., Taouil, R., and Lakhal, L. (1999). Discover- ing Frequent Closed Itemsets for Association Rules. In Proceedings of the International Conference on Database Theory (ICDT).

[Patel et al., 2008] Patel, D., Hsu, W., and Lee, M. L. (2008). Mining Relationships among Interval-based Events for Classification. In Proceedings of the international conference on Management Of Data (SIGMOD).

[Pei and Han, 2000] Pei, J. and Han, J. (2000). Can we push more Constraints into Frequent Pattern Mining? In Proceedings of the international conference on Knowledge discovery and data mining (SIGKDD).

[Pei et al., 2001] Pei, J., Han, J., Mortazavi-asl, B., Pinto, H., Chen, Q., Dayal, U., and chun Hsu, M. (2001). PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth. In Proceedings of the International Conference on Data Engineering (ICDE).

[Pei et al., 2007] Pei, J., Han, J., and Wang, W. (2007). Constraint-based Sequential Pattern Mining: the Pattern-growth Methods. Journal of Intelligent Information Systems, 28:133– 160.

[Pendelton et al., 2006] Pendelton, R., Wheeler, M., and Rodgers, G. (2006). Argatroban Dosing of Patients with Heparin Induced Thrombocytopenia and an Alevated aPTT due to Antiphospholipid Antibody Syndrome. The Annals of Pharmacotherapy, 40:972 ˝U–976.

[Plamondon and Srihari, 2000] Plamondon, R. and Srihari, S. N. (2000). Online and Of- fline Handwriting Recognition: A Comprehensive Survey. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 22(1):63–84.

[Quinlan, 1986] Quinlan, J. R. (1986). Induction of Decision Trees. In Machine Learning, pages 81–106.

[Quinlan, 1990] Quinlan, J. R. (1990). Learning Logical Definitions from Relations. Ma- chine Learning, 5:239–266.

[Quinlan, 1993] Quinlan, J. R. (1993). C4.5: Programs for Machine Learning. Morgan Kauf- mann Publishers Inc.

[Rabiner, 1989] Rabiner, L. R. (1989). A tutorial on Hidden Markov Models and selected applications in Speech Recognition. In Proceedings of the IEEE, pages 257–286.

[Ratanamahatana and Keogh, 2005] Ratanamahatana, C. and Keogh, E. J. (2005). Three Myths about Dynamic Time Warping Data Mining. In Proceedings of the SIAM interna- tional conference on Data Mining (SDM).

[Rojas, 1996] Rojas, R. (1996). Neural Networks: a systematic introduction. Springer, 1st edition.

[Sacchi et al., 2007] Sacchi, L., Larizza, C., Combi, C., and Bellazzi, R. (2007). Data mining with Temporal Abstractions: learning rules from time series. Data Mining and Knowledge Discovery.

[Savasere et al., 1995] Savasere, A., Omiecinski, E., and Navathe, S. B. (1995). An Effi- cient Algorithm for Mining Association Rules in Large Databases. In Proceedings of the international conference on Very Large Data Bases (VLDB).

[Schapire and Singer, 1999] Schapire, R. E. and Singer, Y. (1999). Improved Boosting Algo- rithms Using Confidence-rated Predictions. Machine Learning, pages 80–91.

[Sebastiani, 2002] Sebastiani, F. (2002). Machine Learning in Automated Text Categoriza- tion. ACM Computing Surveys.

[Shahar, 1997] Shahar, Y. (1997). A Framework for Knowledge-Based Temporal Abstrac- tion. Artificial Intelligence, 90:79-133.

[Shumway and Stoffer, 2006] Shumway, R. H. and Stoffer, D. S. (2006). Time Series Analysis and Its Applications: With R Examples. Springer, 2nd edition.

[Siebes et al., 2006] Siebes, A., Vreeken, J., and van Leeuwen, M. (2006). Item Sets that Compress. In Proceedings of the SIAM international conference on Data Mining (SDM). [Smyth and Goodman, 1992] Smyth, P. and Goodman, R. M. (1992). An Information Theo-

retic Approach to Rule Induction from Databases. IEEE Transactions on Knowledge and Data Engineering.

[Srikant and Agrawal, 1996] Srikant, R. and Agrawal, R. (1996). Mining Sequential Pat- terns: Generalizations and Performance Improvements. In Proceedings of the interna- tional conference on Extending Database Technology (EDBT).

[Srivastava et al., 2008] Srivastava, A., Kundu, A., Sural, S., and Majumdar, A. (2008). Credit Card Fraud Detection Using Hidden Markov Model. IEEE Transactions on De- pendable and Secure Computing.

[Toivonen, 1996] Toivonen, H. (1996). Sampling Large Databases for Association Rules. In Proceedings of the international conference on Very Large Data Bases (VLDB).

[Tseng and Lee, 2005] Tseng, V. S.-M. and Lee, C.-H. (2005). CBS: A New Classification Method by Using Sequential Patterns. In Proceedings of the SIAM international confer- ence on Data Mining (SDM).

[Vail et al., 2007] Vail, D. L., Veloso, M. M., and Lafferty, J. D. (2007). Conditional Random Fields for Activity Recognition. In Proceedings of the international joint conference on Autonomous Agents and Multiagent Systems (AAMAS).

[Valko and Hauskrecht, 2010] Valko, M. and Hauskrecht, M. (2010). Feature Importance Analysis for Patient Management Decisions. In Proceedings of medical informatics (Med- Info).

[Vanetik et al., 2002] Vanetik, N., Gudes, E., and Shimony, S. E. (2002). Computing Fre- quent Graph Patterns from Semistructured Data. In Proceedings of the International Conference on Data Mining (ICDM).

[Vapnik, 1995] Vapnik, V. (1995). The Nature of Statistical Learning Theory. Springer- Verlag, NY.

[Veloso et al., 2006] Veloso, A., Meira Jr., W., and Zaki, M. J. (2006). Lazy Associative Clas- sification. In Proceedings of the International Conference on Data Mining (ICDM).

[Villafane et al., 2000] Villafane, R., Hua, K. A., Tran, D., and Maulik, B. (2000). Knowledge Discovery from Series of Interval Events. Journal of Intelligent Information Systems, 15:71–89.

[Wang and Han, 2004] Wang, J. and Han, J. (2004). BIDE: Efficient Mining of Frequent Closed Sequences. In Proceedings of the International Conference on Data Engineering (ICDE).

[Wang et al., 2003a] Wang, J., Han, J., and Pei, J. (2003a). CLOSET+: Searching for the Best Strategies for Mining Frequent Closed Itemsets. In Proceedings of the international conference on Knowledge Discovery and Data mining (SIGKDD).

[Wang and Karypis, 2005] Wang, J. and Karypis, G. (2005). HARMONY: Efficiently mining the best rules for classification. In Proceedings of the SIAM international conference on Data Mining (SDM).

[Wang et al., 2003b] Wang, K., Jiang, Y., and Lakshmanan, L. V. S. (2003b). Mining Unex- pected Rules by Pushing User Dynamics. In Proceedings of the international conference on Knowledge Discovery and Data mining (SIGKDD).

[Warkentin, 2000] Warkentin, T. (2000). Heparin-induced thrombocytopenia: pathogenesis and management. British Journal of Haematology, 121:535–555.

[Webb, 2007] Webb, G. I. (2007). Discovering Significant Patterns. Machine Learning, 68(1):1–33.

[Weng and Shen, 2008] Weng, X. and Shen, J. (2008). Classification of Multivariate Time Series using two-dimensional Singular Value Decomposition. Knowledge-Based Systems, 21(7):535 – 539.

[Winarko and Roddick, 2007] Winarko, E. and Roddick, J. F. (2007). ARMADA - An Algo- rithm for Discovering Richer Relative Temporal Association Rules from Interval-based Data. Data and Knowledge Engineering, 63:76–90.

[Wong et al., 2005] Wong, W.-K., Moore, A. W., Cooper, G. F., and Wagner, M. M. (2005). What’s Strange About Recent Events (WSARE): An Algorithm for the Early Detection of Disease Outbreaks. Journal of Machine Learning Research, 6:1961–1998.

[Wu and Chen, 2007] Wu, S.-Y. and Chen, Y.-L. (2007). Mining Nonambiguous Temporal Patterns for Interval-Based Events. IEEE Transactions on Knowledge and Data Engi- neering, 19:742–758.

[Xi et al., 2006] Xi, X., Keogh, E., Shelton, C., Wei, L., and Ratanamahatana, C. A. (2006). Fast Time Series Classification using Numerosity Reduction. In Proceedings of the Inter-