Scalable strong consistency for web applications (original) (raw)
Abstract
Web application workloads are often characterized by a large number of unique read requests and a significant fraction of write requests. Hosting these applications drives the need for the next generation CDN architecture that does more than caching the results of Web applications but replicates both the application code and its underlying data. We propose the design of a system that guarantees strong consistency for Web applications with high scalability. The proposed system is based on partial replication, where data units are replicated only to servers that access them often. This reduces the consistency overhead as updates are sent to a reduced number of servers. The novelty of our system is that the proposed partial replication is performed by the system automatically by analyzing the system's access patterns periodically. We explore the design space of this system, find the key issues that need to be addressed to build it and propose solutions to solve them. We further show that the proposed algorithms offer significant performance gains compared to existing solutions for a wide range of Web access patterns.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.
References (14)
- AKAMAI INC. Edge Computing Infrastructure.
- AWADALLAH, A., AND ROSENBLUM, M. The vMa- trix: A network of virtual machine monitors for dy- namic content distribution. In Proc. of the Seventh International Workshop on Web Content Caching and Distribution (Aug. 2002).
- BLOOM, B. H. Space/time tradeoffs in hash coding with allowable errors. Communications of the ACM 13, 7 (1970), 422-426.
- BORNHVD, C., ALTINEL, M., MOHAN, C., PIRA- HESH, H., AND REINWALD, B. Adaptive database caching with DBCache. Data Engineering 27, 2 (June 2004), 11-18.
- CAO, P., ZHANG, J., AND BEACH, K. Active cache: Caching dynamic contents on the Web. In Proc. of the Middleware Conference (Sept. 1998), pp. 373-388.
- GAO, L., DAHLIN, M., NAYATE, A., ZHENG, J., AND IYENGAR, A. Application specific data replica- tion for edge services. In Proc. of the Twelfth Interna- tional World-Wide Web Conference (2003), pp. 449- 460.
- HERLOCKER, J. L., KONSTAN, J. A., BORCHERS, A., AND RIEDL, J. An algorithmic framework for per- forming collaborative filtering. In Proc. of the 22nd ACM SIGIR conference on Research and development in information retrieval (1999), pp. 230-237.
- KE LARSON, P., GOLDSTEIN, J., GUO, H., AND ZHOU, J. MTCache: Mid-tier database caching for sql server. Data Engineering 27, 2 (June 2004), 27-33.
- LABRINIDIS, A., AND ROUSSOPOULOS, N. Webview materialization. In Proceedings of the 2000 ACM SIG- MOD international conference on Management of data (2000), ACM Press, pp. 367-378.
- OZSU, T., AND VALDURIEZ, P. Principles of Dis- tributed Database Systems, 2nd ed. Prentice Hall, Up- per Saddle River, N.J., 1999.
- RABINOVICH, M., XIAO, Z., AND AGARWAL, A. Computing on the edge: A platform for replicating in- ternet applications. In Proc. of the Eighth International Workshop on Web Content Caching and Distribution (Hawthorne, NY, USA, Sept. 2003).
- SIVASUBRAMANIAN, S., PIERRE, G., AND VAN STEEN, M. A system for on-demand Web application replication, Dec. 2003. http://www.globule. org/.
- SIVASUBRAMANIAN, S., SZYMANIAK, M., PIERRE, G., AND VAN STEEN, M. Web replica hosting sys- tems. Tech. Rep. IR-CS-001, Vrije Universiteit, Ams- terdam, The Netherlands, May 2003.
- SZYMANIAK, M., PIERRE, G., AND VAN STEEN, M. Scalable cooperative latency estimation. In Proceed- ings of the 10th International Conference on Parallel and Distributed Systems (ICPADS) (Newport Beach, CA, USA, July 2004).