MAG: A performance evaluation framework for database systems (original) (raw)

NNMonitor: Performance modeling for database servers

2013 8th International Conference on Computer Engineering & Systems (ICCES), 2013

Database Management Systems (DBMSs) are the cores of most information systems. Database administrators (DBAs) face increasingly more challenges due to the systems growing complexity and must be proficient in areas, such as capacity planning, physical database design, DBMS tuning and DBMS management. Furthermore, DBAs need to implement policies for effective workload scheduling, admission control, and resource provisioning. In response to these challenges we focus our attention on the development of online DBMS performance model. We aim to meet service level agreements (SLAs) and maintain peak performance for DBMS. To this end, we propose a neural network-based performance model called NNMonitor that can predict the performance metrics of DBMS online and determines if the DBMS needs to tune or not before entering into a complex tuning process. We make use of neural networks to build our proposed model taking into account the interaction among concurrently executing queries and predict throughput. The experimental evaluation demonstrates that this model is capable of predicting the performance metrics of real database servers with high accuracy.

Performance tuning of database systems using a context-aware approach

2014 9th International Conference on Computer Engineering & Systems (ICCES), 2014

Database system performance problems have a cascading effect into all aspects of an enterprise application. Database vendors and application developers provide guidelines, best practices and even initial database settings for good performance. But database performance tuning is not a one-off task. Database administrators have to keep a constant eye on the database performance as the tuning work carried out earlier could be invalidated due to multitude of reasons. Before engaging in a performance tuning endeavor, a database administrator must prioritize which tuning tasks to carry out first. This prioritization is done based on which tuning action would yield highest performance benefit. However, this prediction may not always be accurate. Experiment-based performance tuning methodologies have been introduced as an alternative to prediction-based performance tuning approaches. Experimenting on a representative system similar to the production one allows a database administrator to accurately gauge the performance gain for a particular tuning task. In this paper we propose a novel approach to experiment-based performance tuning with the use of a context-aware application model. Using a proof-of-concept implementation we show how it could be used to automate the detection of performance changes, experiment creation and evaluate the performance tuning outcomes for mixed workload types through database configuration parameter changes.

Utilizing deep learning for automated tuning of database management systems

arXiv (Cornell University), 2023

Managing the configurations of a database system poses significant challenges due to the multitude of configuration knobs that impact various system aspects. The lack of standardization, independence, and universality among these knobs further complicates the task of determining the optimal settings. To address this issue, an automated solution leveraging supervised and unsupervised machine learning techniques was developed. This solution aims to identify influential knobs, analyze previously unseen workloads, and provide recommendations for knob settings. The effectiveness of this approach is demonstrated through the evaluation of a new tool called OtterTune [1] on three different database management systems (DBMSs). The results indicate that OtterTune's recommendations are comparable to or even surpass the configurations generated by existing tools or human experts. In this study, we build upon the automated technique introduced in the original OtterTune paper, utilizing previously collected training data to optimize new DBMS deployments. By employing supervised and unsupervised machine learning methods, we focus on improving latency prediction. Our approach expands upon the methods proposed in the original paper by incorporating GMM clustering to streamline metrics selection and combining ensemble models (such as RandomForest) with non-linear models (like neural networks) for more accurate prediction modeling.

Automatic Diagnosis of Performance Problems in Database Management Systems

Database performance is directly linked to the allocation of the resources used by the Database Management System (DBMS). The complex relationships between numerous DBMS resources make problem diagnosis and performance tuning complex and timeconsuming tasks. Costly Database Administrators (DBAs) are currently needed to initially tune a DBMS for performance and then to retune the DBMS as the database grows and workloads change. Automatic diagnosis and resource management removes the need for DBAs, greatly reducing the cost of ownership for the DBMS. An automated system also allows the DBMS to respond more quickly to changes in the workload as performance can be monitored 24 hours a day. An automated diagnosis and resource management system allows the DBMS to improve performance for both static and dynamic workloads.

A Performance Prediction Model for Database Environments: A Preliminary Analysis

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems, 2015

Properly addressing the performance issues presented in database systems is and has been a significant technological challenge, this due to the uncontrolled fluctuation of user requests. Being able to predict the behaviour of such systems can greatly improve their performance. Several prediction methods, such as linear regression and autoregressive moving average, among others, have extensively been used to predict performance in shared environments where a workload is involved. However, not all them produce accurate predictions when the system is working under different workloads. In this paper, we present our preliminary results on exploring the accuracy of two different approaches (exact and approximate methods) used to predict the response time of a database system subject to different workloads in a controlled environment. Our results show that approximate methods present better prediction accuracy when compared to exact methods. Hence, we consider the main contributions of this work the following: a) the results obtained from comparing exact and approximate methods, since they can be used as a basis for further works addressing similar problems, and b) a preliminary prediction model also based on our findings.

Automatic diagnosis of performance problems in database management systems [microform]

Database performance is directly linked to the allocation of the resources used by the Database Management System (DBMS). The complex relationships between numerous DBMS resources make problem diagnosis and performance tuning complex and timeconsuming tasks. Costly Database Administrators (DBAs) are currently needed to initially tune a DBMS for performance and then to retune the DBMS as the database grows and workloads change. Automatic diagnosis and resource management removes the need for DBAs, greatly reducing the cost of ownership for the DBMS. An automated system also allows the DBMS to respond more quickly to changes in the workload as performance can be monitored 24 hours a day. An automated diagnosis and resource management system allows the DBMS to improve performance for both static and dynamic workloads.

A new approach to dynamic self-tuning of database buffers

ACM Transactions on Storage, 2008

Current businesses rely heavily on efficient access to their databases. Manual tuning of these database systems by performance experts is increasingly infeasible: For small companies, hiring an expert may be too expensive; for large enterprises, even an expert may not fully understand the interaction between a large system and its multiple changing workloads. This trend has led major vendors to offer tools that automatically and dynamically tune a database system. Many database tuning knobs concern the buffer pool for caching data and disk pages. Specifically, these knobs control the buffer allocation and thus the cache miss probability, which has direct impact on performance. Previous methods for automatic buffer tuning are based on simulation, black-box control, gradient descent, and empirical equations. This article presents a new approach, using calculations with an analytically-derived equation that relates miss probability to buffer allocation; this equation fits four buffer r...

A Session-based Approach to Autonomous Database Tuning

Acta Polytechnica Hungarica

By using autonomous tuning tools to optimize database systems, a lot of timeconsuming, manual work can be automated. However, self-tuning database systems are trying to optimize global metrics of efficiency, they may set back rare, but critical functions of applications that use the database. The priority of application functions cannot be expressed in existing solutions, therefore, another approach may be needed. In this paper, a session-based method is presented, where application functions are represented as sessions, by building and using language models based on previous observations. With this technique, a similarity measure can also be defined, to interpret minor differences between sessions caused by program logic, as similarity. If usage patterns appear on user level as well, it is reasonable to construct user groups along similar behavior, to utilize such patterns. As the most significant part of an autonomous solution is forecasting, a method is also presented to predict future workload characteristics, by identified user groups. Then, this approach has been evaluated in practice, mainly to determine the optimal corpus size and validate session recognition.

Study of Performance Tuning Techniques

Journal of emerging technologies and innovative research, 2015

In the modern era, digital data is considered as the more valuable asset of an organization, and the organizations assign more significance to it than the software and hardware assets. Database systems are computer-based record keeping systems, which have been developed to store data for efficient retrieval and processing. Since data is produced and shared every day, data volumes could be large enough for the database performance to become an issue. In order to maintain database performance, identification and diagnosis of the root causes that may cause delayed queries is done. Poor query design can be one of the major causes of delayed queries. There are various methods available to deal with the performance issues. Database administrator decides the method or combination of methods that work best. In this paper, we present some performance tuning techniques such as SQL Tuning, Indexing and Table Partitioning along with their advantages and limitations.

Using reflection to introduce self-tuning technology into DBMSs

Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04., 2004

The increasing complexity of database management systems (DBMSs) and their workloads means that manually managing their performance has become a difficult and time-consuming task. Autonomic computing systems have emerged as a promising approach to dealing with this complexity. Current DBMSs have begun to move in the direction of autonomic computing with the introduction of parameters that can be dynamically adjusted. A logical next step is the introduction of self-tuning technology to diagnose performance problems and to select the dynamic parameters that must be adjusted. We introduce a method for automatically diagnosing performance problems in DBMSs and then describe how this method can be incorporated into current DBMSs using the concept of reflection. We demonstrate the feasibility of our approach with a proof-of-concept implementation for DB2 Universal Database.