Dynamo (original) (raw)

The Research Study on DynamoDB – NoSQL Database Service

Reliability at massive scale is one of the biggest challenges we face as the operations in the e-commerce world keeps on expanding; even the slightest outage has significant financial consequences and impacts customer trust. The amazon.com platform provides services for many web sites worldwide. It is implemented on top of an infrastructure of tens of thousands of servers and network components located in many datacenters around the world. At this scale, small and large components fail continuously and the persistent state is managed in the face of these failures. This drives the reliability and scalability of the software systems. DynamoDB is a fast, fully managed NoSQL database service that makes it simple and cost-effective to store and retrieve any amount of data, and serve any level of request traffic. It has guaranteed throughput and low latency which makes it a great fit for gaming, ad technology, mobile and many other applications. This paper presents the design and implementation of DynamoDB, a highly available key-value storage system that some of amazon’s core services use to provide an ―always-on‖ experience. To achieve this level of availability, DynamoDB sacrifices consistency under certain failure scenarios. It makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use. We also compare and contrast the DynamoDB services with other leading database services available for same purposes.

Cassandra -A Decentralized Structured Storage System

Cassandra is a distributed storage system for managing very large amounts of structured data spread out across many commodity servers, while providing highly available service with no single point of failure. Cassandra aims to run on top of an infrastructure of hundreds of nodes (possibly spread across different data centers). At this scale, small and large components fail continuously. The way Cassandra manages the persistent state in the face of these failures drives the reliability and scalability of the software systems relying on this service. While in many ways Cassandra resembles a database and shares many design and implementation strategies therewith, Cassandra does not support a full rela-tional data model; instead, it provides clients with a simple data model that supports dynamic control over data layout and format. Cassandra system was designed to run on cheap commodity hardware and handle high write through-put while not sacrificing read efficiency.

Big-Data NoSQL Databases: A Comparison and Analysis of "Big-Table", "DynamoDB", and "Cassandra"

The growth and enhancement of technology in the corporate society has led to data storage and confidentiality issues. The problem arises from the management of trillions of data, generated every second in corporations, precisely known as " Big Data ". Big Data needs to be stored and managed by larger companies that do not have the right storage systems, as there is not any developed yet. The aim of this paper is to find a solution to this growing problem by analyzing gaps in the literature, and to evaluate possible solutions. This study has analyzed content from top reviewed scientific publications, to gather compared and contrasted data from articles and highlight gaps. The highlighted literature will address this problems, and find solutions by contrasting BigData management approaches of NoSQL databases; BigTable, DynamoDB, and Cassandra. The findings summarized from publications are highlighted and the main features of all three databases and their applications are displayed. The system performances are analyzed based on their consistency, availability and partition intolerance. The study concluded that Google's BigTable and Amazon's DynamoDB are also critical and efficient on their own, and also found that the combination of both systems had caused the development of Cassandra. Cassandra is now the primary focus of numerous companies to develop different applications. Furthermore, all three systems are NoSQL storage systems, and BigTable, and based on one master node approach, unlike Dynamo, and Cassandra, it follows a Peer-to-Peer system. BigTable however, with some additional features from DynamoDB has helped the development of Cassandra, which is the basis of various modern applications available both open source and socially.

Amazon’s highly available key-value store

2013

Reliability at massive scale is one of the biggest challenges we face at Amazon.com, one of the largest e-commerce operations in the world; even the slightest outage has significant financial consequences and impacts customer trust. The Amazon.com platform, which provides services for many web sites worldwide, is implemented on top of an infrastructure of tens of thousands of servers and network components located in many datacenters around the world. At this scale, small and large components fail continuously and the way persistent state is managed in the face of these failures drives the reliability and scalability of the software systems. This paper presents the design and implementation of Dynamo, a highly available key-value storage system that some of Amazon’s core services use to provide an “always-on ” experience. To achieve this level of availability, Dynamo sacrifices consistency under certain failure scenarios. It makes extensive use of object versioning and application-a...