Data Replication in DBMS (original) (raw)

Last Updated : 18 Apr, 2026

Data Replication in DBMS refers to the process of storing copies of the same data at multiple sites or nodes within a distributed database system. The main purpose of replication is to improve data availability, reliability, performance, and fault tolerance. In a distributed database environment, data replication ensures that:

**Note: It involves copying data from one database (known as the source or publisher) to another (known as the replica, subscriber or target) so that all users can access consistent and up-to-date information without conflicts.

Need of Data Replication

Types of Data Replication

types_of_data_replication_1

Types of Data Replication

1. Transactional Replication

Involves sending an initial full copy of the database to subscribers, followed by real-time updates as changes occur at the publisher.

**Example: Banking systems or e-commerce sites that demand consistent and up-to-date transaction data.

2. Snapshot Replication

Copies and distributes data exactly as it exists at a specific point in time, without tracking changes.

**Drawback: Can be slower because it replicates the entire dataset each time.

3. Merge Replication

Allows both the publisher and subscriber to make changes independently.

**Example: Field data collection systems or distributed applications where clients update data locally.

4. Master-Slave Replication

In this architecture:

**Note: The master handles all write operations and the slaves receive read-only copies from the master. This model ensures data consistency and simplifies synchronization.

5. Multi-Master Replication

All servers act as masters, meaning that updates can occur at any node and changes are replicated across all other nodes.

6. Peer-to-Peer Replication

Every server acts as both a master and a slave, with data replicated in a peer-to-peer manner.

7. Single Source Replication

In this type, a single source database replicates data to multiple target databases.

Replication Schemes

1. Full Replication

Entire database is replicated at every site. Offers maximum availability and faster local access. However, it increases storage requirements and update complexity.

frame_3167

Full Replication

The above image explains that multiple users (User 1, User 2, User 3) send requests to a central server, which processes and interacts with the original database. The database data is then fully copied to a replica system, ensuring data availability and reliability through full replication.

2. No Replication

Each data item is stored at only one location in the distributed system.

3. Partial Replication

Only selected fragments of the database are replicated based on their importance or access frequency. Balances between storage efficiency and availability.

Advantages

Disadvantages

**Applications: Online Transaction Processing (OLTP), Data Warehousing and Analytics, Distributed Database Systems, Content Delivery Networks (CDNs), Cloud-based and High-Availability Systems, etc.