Query processing techniques in the summary-table-by-example database query language (original) (raw)
Abstract
Summary-Table-by-Example (STBE) is a graphical language suitable for statistical database applications. STBE queries have a hierarchical subquery structure and manipulate summary tables and relations with set-valued attributes. The hierarchical arrangement of STBE queries naturally implies a tuple-by-tuple subquery evaluation strategy (similar to the nested loops join implementation technique) which may not be the best query processing strategy. In this paper we discuss the query processing techniques used in STBE. We first convert an STBE query into an “extended” relational algebra (ERA) expression. Two transformations are introduced to remove the hierarchical arrangement of subqueries so that query optimization is possible. To solve the “empty partition” problem of aggregate function evaluation, directional join (one-sided outer-join) is utilized. We give the algebraic properties of the ERA operators to obtain an “improved” ERA expression. Finally we briefly discuss the generation...
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.
References (48)
- ABDUL-QADER, W., AND GZSOYOCLU, G. A human factors study of two screen oriented query languages: STBE and QBE. Tech. Rep., Case Western Reserve Univ., Cleveland, Ohio, 1987.
- AHO, A., HOPCROFT, J., AND ULLMAN, J. The Design and Analysis of Computer Algorithms. Addison-Wesley, Reading, Mass., 1976.
- CHAN, P., AND SHOSHANI, A. SUBJECT: A dictionary driven system for organizing and accessing large statistical databases. In Proceedings of the 6th International Very Large Data Bases Conference (Montreal, Oct. 1980).
- CODD, E. F. Relational completeness of database sublanguages. In Database Systems, Rustin, R., Ed. Prentice-Hall, Englewood Cliffs, N.J., 1972.
- CODD, E. F. Extending the database relational model to capture more meaning. ACM Trans. Database Syst. 4,4 (1979).
- DA'ITA, A. Implementation and performance analysis of STBE. MS. thesis, Dept. of Computer Eng. and Science, Case Western Reserve Univ., Cleveland, Ohio, May 1986.
- DATTA, A., FOURNIER, B., Hou, W-C., AND ~ZSOYOCLU, G. The design and implementation of the SSDB. In Proceedings of the 3rd International Workshop on Statistical and Scientific Database Management (Luxembourg, July 1986).
- FOURNIER, B. A display manager for the STBE query language. MS. thesis, Dept. of Computer Eng. and Science, Case Western Reserve Univ., Cleveland, Ohio, Sept. 1985.
- GELDER, A., AND TOPOR, R. W. Safety and correct translation of relational calculus formulas. In Proceedings of the ACM PODS Conference (San Diego, Calif., Mar. 1987). ACM, New York, 1987.
- GHOSH, S. Statistical relational tables for database management. IBM Research Rep. RJ 4394. IBM, Almaden Research, San Jose, Calif., Aug. 1984.
- HOU, W-C. The implementation of the extended relational database management system. MS. thesis, Dept. of Computer Eng. and Science, Case Western Reserve Univ., Cleveland, Ohio, Jan. 1985.
- IKEDA, H., AND KOBAYASHI, Y. Additional facilities of a conventional DBMS to support interactive statistical analysis. In Proceedings of the First LBL Workshop on Statistical Database Management (Menlo Park, Calif., Dec. 1981).
- G. &soyo@~ et al.
- JOHNSON, R. Modeling summary data. In Proceedings of the ACM SZGMOD Conference (Ann Arbor, Mich., Apr. 1981). ACM, New York, 1981.
- KIM, W. On optimizing an SQL-like nested query. ACM Trans. Database Syst. 7,3 (1982).
- KLUG, A. Abe-a query language for constructing aggregates-by-example. In Proceedings of the
- First LBL Workshop on Statistical Database Management (Menlo Park, Calif., 1981).
- KLUG, A. Equivalence of relational algebra and relational calculus query languages having aggregate functions. J. ACM (1982).
- KLUG, A. Access paths in the Abe statistical query facility. In Proceedings of the ACM SZGMOD Conference (Orlando, Fla., June 1982).
- Proceedings of the First LBL Workshop on Statistical Database Management (Menlo Park, Calif., Dec. 1981).
- Proceedings of the Second International Workshop on Statistical Database Management (Los Altos, Calif., Sept. 1983).
- LORIE, R. A., AND NILSON, J. F. An access specification language for a relational database system. IBM J. Res. Dev. 3 (1979).
- MARIN, G. Efficient implementation of the extended relational algebra operators. M.S. thesis, Dept. of Computer Eng. and Science, Case Western Reserve Univ., Cleveland, Ohio, July 1985.
- MATA, F. J. Physical organization techniques for set-valued relations and summary tables. M.S. thesis, Dept. of Computer Eng. and Science, Case Western Reserve Univ., Cleveland, Ohio, May 1984.
- MATOS, V. Extensions to the relational data model for statistical database applications. Ph.D. thesis, Dept. of Computer Eng. and Science, Case Western Reserve Univ., Cleveland, Ohio, Sept. 1985.
- MCCARTHY, J. L. Metadata management for large statistical databases. In Proceedings of the Conference on Very Large Data Bases (Mexico City, 1982).
- NICOLAS, J.-M., AND DEMOLOMBE, R. On the stability of relational queries. Tech. Rep.
- ONERA-CERT, 1982.
- OLKEN, F. How baroque should a statistical database management system be? In Proceedings of the Second International Workshop on Statistical Database Management (Los Altos, Calif., Sept. 1983).
- GZSOYOELU, G., GZSOYOELU, Z. M., AND MATA F. A language and a physical organization technique for summary tables. In Proceedings of the ACM SZGMOD Conference (Austin, Tex., 1985). ACM, New York, 1985, pp. 3-16.
- GZSOYOELU, G., GZSOYOELU, Z. M., AND MATOS, V. Extending relational algebra and relational calculus with set-valued attributes and aggregate functions. ACM Trans. Database Syst. 12, 4 (Dec. 1987), 566-592.
- ~)ZSOYOCLU, G., AND ~~ZSOYO~LU, Z. M. Features of SSDB. In Proceedings of the Second International Workshop on Statistical Databose Management (Los Altos, Calif., Sept, 1983).
- ~)ZSOYO&LU, Z. M., AND GZSOYOCLU, G. An extension of relational algebra for summary tables. In Proceedings of the Second International Workshop on Statistical Database Management (Los Altos, Calif., Sept. 1983).
- ~)ZSOYO&LU, Z. M., AND ~ZSOYO~LU, G. STBE-A database query language for manipulating summary data. In Proceedings of the IEEE COMPDEC Conference (Los Angeles, 1984). IEEE, New York, 1984.
- ~ZSOYOCLU, G., AND ~~ZSOYO&LU, Z. M. SSDB-An architecture for statistical databases. In Proceedings of the Fourth International JCZT Conference (Jerusalem, May 1984).
- OZSOYO~LU, Z. M., AND ~ZSOYOCLU, G. A query language for statistical databases. In Query Processing in Database Systems, W. Kim, D. Reiner, and D. Batory, Eds., Springer-Verlag, New York, 1985.
- ~ZSOYOCLU, G., AND ~ZSOYOELU, Z. M. Statistical database query languages. IEEE Trans. Softw. Eng. (Oct. 1985), 1071-1080.
- GZSOYOCLU, G., AND WANG, H. A relational calculus with set operators, its safety, and graphical languages. IEEE Trans. Softw. Eng. 15,Q (Sept. 1989), 1038-1052.
- ROSENTHAL, A., AND REINER, D. Extending the algebraic framework of query processing to handle outerjoins. In Proceedings of the Conference on Very Large Data Bases (Singapore, Aug. 1984).
- SATO, H. Handling summary information in a database: Derivability. In Proceedings of the ACM SZGMOD Conference (Ann Arbor, Mich., Apr. 1981). ACM, New York, 1981.
- SELINGER, P. G., ASTRAHAN, M. M., CHAMBERLIN, D. D., LORIE, R. A., AND PRICE, T. G. Access path selection in a relational database management system. In Proceedings of the ACM SZGMOD Conference (1979). ACM, New York, 1979.
- SHOSHANI, A. Statistical databases: Characteristics, problems and some solutions. In Proceed- ings of the Conference on Very Large Data Bases (Mexico City, 1982).
- SU, S., NAVATHE, S. B., AND BATORY, D. Logical and physical modeling of statistical scientific databases. In Proceedings of the Second International Workshop on Statistical Database Manage- ment (Los Altos, Calif., Sept. 1983).
- ULLMAN, J. Principles of Database Systems. Computer Science Press, Rockville, Md., 1982.
- Table Producing Language System. Version 5. Bureau of Labor Statistics. Washington, D.C., July 1980.
- ZLOOF, M. M. Query-by-example: A database language. IBM Syst. J. (1977).
- ABDUL-QADER, WAEL ERAM benchmarking and STBE human factors study. M.S. thesis, Case Western Reserve Univ., Cleveland, Ohio, July 1987.
- JAESCHKE, G., AND SCHEK, H. J. Remarks on the algebra on nontirst normal form relations. In Proceedings of the ACM PODS Conference (Los Angeles, Mar. 1982). ACM, New York, 1982.