A system architecture for database mining applications (original) (raw)

KNOWLEDGE RETRIEVAL AND DATA MINING

The rapid growth and adoption of the World Wide Web has further exacerbated the user need for efficient mechanisms for information and knowledge location, selection and retrieval. Much work is required to address knowledge retrieval; for instance, users' information needs could be better interpreted, leading to accurate information retrieval. This paper is to suggest knowledge retrieval as a new research field and also proposes a knowledge retrieval model combining knowledge search with data mining technologies. In this model, data mining is integrated into the whole retrieval procedure of query optimizing, searching, results analyzing, and resources constructing. It realizes knowledge retrieval by various approaches, different levels, and multi-modes, and significantly improves knowledge retrieval level and efficiency. Furthermore, we explore related knowledge retrieval methods and algorithms including association analysis-based concept retrieval and inductive learning-based classification retrieval.

Concept-Based Mining Model

Innovations and Systemic Approaches

Due to the daily rapid growth of the information, there are considerable needs in extracting and discovering valuable knowledge from the vast amount of information found in different data sources today such

KNOWLEDGE RETRIEVAL AND DATA MINING Mrs

2012

The rapid growth and adoption of the World Wide Web has further exacerbated the user need for efficient mechanisms for information and knowledge location, selection and retrieval. Much work is required to address knowledge retrieval; for instance, users' information needs could be better interpreted, leading to accurate information retrieval. This paper is to suggest knowledge retrieval as a new research field and also proposes a knowledge retrieval model combining knowledge search with data mining technologies. In this model, data mining is integrated into the whole retrieval procedure of query optimizing, searching, results analyzing, and resources constructing. It realizes knowledge retrieval by various approaches, different levels, and multi-modes, and significantly improves knowledge retrieval level and efficiency. Furthermore, we explore related knowledge retrieval methods and algorithms including association analysis-based concept retrieval and inductive learning-based cl...

Data Mining Support in Database Management Systems

Lecture Notes in Computer Science, 2000

The most popular data mining techniques consist in searching databases for frequently occurring patterns, e.g. association rules, sequential patterns. We argue that in contrast to today's loosely-coupled tools, data mining should be regarded as advanced database querying and supported by Database Management Systems (DBMSs). In this paper we describe our research prototype system, which logically extends DBMS functionality, offering extensive support for pattern discovery, storage and management. We focus on the system architecture and novel SQL-based data mining query language, which serves as the user interface to the system.

The MiningMart Approach to Knowledge Discovery in Databases

Intelligent Technologies for Information Analysis, 2004

When learning from very large databases, the reduction of complexity is extremely important. Two extremes of making knowledge discovery in databases (KDD) feasible have been put forward. One extreme is to choose a very simple hypothesis language, thereby being capable of very fast learning on real-world databases. The opposite extreme is to select a small data set, thereby being able to learn very expressive (first-order logic) hypotheses. A multistrategy approach allows one to include most of these advantages and exclude most of the disadvantages. Simpler learning algorithms detect hierarchies which are used to structure the hypothesis space for a more complex learning algorithm. The better structured the hypothesis space is, the better learning can prune away uninteresting or losing hypotheses and the faster it becomes. We have combined inductive logic programming (ILP) directly with a relational database management system. The ILP algorithm is controlled in a model-driven way by the user and in a data-driven way by structures that are induced by three simple learning algorithms.

DBMiner: A system for mining knowledge in large relational databases

Proc. Intl. Conf. on Data …, 1996

A data mining system, DBMiner, has been developed for interactive mining of multiple-level knowledge in large relational databases. The system implements a wide spectrum of data mining functions, including generalization, characterization, association, classication, and prediction. By incorporating several interesting data mining techniques, including attributeoriented induction, statistical analysis, progressive deepening for mining multiple-level knowledge, and meta-rule guided mining, the system provides a userfriendly, i n teractive data mining environment with good performance.

RUBRIC: A System for Rule-Based Information Retrieval

IEEE Transactions on Software Engineering, 2000

A research prototype software system for conceptual information retrieval has been developed. The goal of the system, ca)led RUBRIC, is to provide more automated and relevant access to unformatted textual databases. The approach is to use production rules from artificial intelligence to define a hierarchy of retrieval subtopics, with fuzzy context expressions and specific word phrases at the bottom. RUBRIC allows the definition of detailed queries starting at a conceptual level, partial matching of a query and a document, selection of only the highest ranked documents for presentation to the user, and detailed explanation of how and why a particular document was selected. Initial experiments indicate that a RUBRIC rule set better matches human retrieval judgment than a standard Boolean keyword expression, given equal amounts of effort in defining each. The techniques presented may be useful in stand-alone retrieval systems, front-ends to existing information retrieval systems, or real-time document filtering and routing Index Terms-Artificial intelligence, evidential reasoning, expert systems, information retrieval.

Integration of information retrieval and database management systems

Information Processing & Management, 1988

The problem of integrating database management systems and information retrieval systems has received increasing attention in recent years. In a database management environment, the records are formatted. The attributes with which the record characteristics and the user needs are described are precise. In contrast, an information retrieval system provides facilities for identifying references to documents, usually in textual form, from which the user information needs can be satisfied. The descriptors used to represent the content of the documents for this purpose are normally imprecise. The motivation for integrating these two types of systems is presented. Several features of such an integrated system in terms of what kinds of retrieval options such an integrated system should provide are identified. In particular, it is argued that the integrated system should allow the selection of the degree of structuredness of the search, depending on how precisely the user can specify his or her needs. A formulation of the underlying retrieval problem is presented. It is observed that the kind of integrated system considered has a broader scope of application than either one of its components.