SeqTrie: An index for data mining applications (original) (raw)
Large databases of sales data are not susceptible for manual analysis. In order to extract useful knowledge from them, one must use data mining algorithms (the so-called market basket analysis). Unfortunately, these algorithms, depending on data and parameters, may generate a large number of patterns. These patterns are easier to analyse then the raw data set, nevertheless they must still be analysed by the end user of the data mining application. Such analysis involves executing a lot of queries on complex data types that are not well supported by commercially available database management systems. In this paper, we present an index that may be used for improving performance of such queries and therefore, improving performance of analysis of data mining results.