Concept Hierarchy in Data Mining (original) (raw)

Last Updated : 26 Nov, 2025

A concept hierarchy organizes data into multiple levels of abstraction-ranging from detailed (low-level) values to more general (high-level) concepts. It allows users to drill down for detailed analysis or roll up for summaries, helping simplify large datasets and improving pattern discovery.

Note: Data mining refers to extracting useful patterns, relationships, and knowledge from large datasets using techniques from statistics, machine learning, and AI.

Example of a Concept Hierarchy

The diagram below shows a hierarchy for the Location dimension.

Such hierarchies allow a user to analyze data at the level they need—city-level detail or country-level summaries.

Concept-hierarchy-2

Concept Hierarchy Example

The hierarchical structure represents the abstraction level of the dimension location, which consists of various footprints of the dimension such as:

Concept-hierarchy-1

low-level to high-level representation

Applications of Concept Hierarchy

There are several applications of concept hierarchy in data mining, some examples are:

Types of Concept Hierarchies

Concept hierarchies can be created in different ways depending on the data and the domain:

1. Schema-Based Hierarchy

Derived from the database schema (e.g., primary key → foreign key relationships). Useful in data warehouses for structuring dimensions such as:

2. Set-Grouping Hierarchy

Groups values based on set membership or category. Useful for:

3. Operation-Derived Hierarchy

Created by applying operations like:

4. Rule-Based Hierarchy

Defined using user-created rules or conditions. Example rule:

5. Better Representation of Domain Knowledge

Hierarchies capture real-world relationships (e.g., Product → Brand → Category), making the data model easier to understand.

Types of Concept Hierarchies

1. Explicitly Defined Hierarchies

These are manually designed by domain experts or database designers. Example:

2. Implicitly Defined Hierarchies

These are formed automatically based on:

Methods to Generate Concept Hierarchies

**1. Schema-Based Generation: Hierarchy is derived from the database schema itself. Examples:

**2. Rule-Based Generation: Hierarchies designed using user-defined rules or metadata. Example rule:

**3. Data-Based Generation: Hierarchies created using data distribution. Methods include:

Need of Concept Hierarchy in Data Mining

There are several reasons why a concept hierarchy is useful in data mining: