Manar Abourezq, Abdellah Idrissi



Database-as-a-Service for Big Data: A Review

pdf PDF



The last two decades were marked by an exponential growth in the volume of data originating from various data sources, from mobile phones to social media contents, all through the multitude devices of the Internet of Things. This flow of data can’t be managed using a classical approach and has led to the emergence of a new buzz word: Big Data. Among the research challenges related to Big Data there is the issue of data storage. Traditional relational database systems proved to be unable to efficiently manage Big Data datasets. In this context, Cloud Computing plays a relevant role, as it offers interesting models to deal with Big Data storage, especially the model known as Database as a Service (DBaaS). We propose, in this article, a review of database solutions that are offered as DBaaS and discuss their adaptability to Big Data applications.



Cloud Computing; Big Data; Database as a Service



[1] J. Gantz and D. Reinsel, ―IDC: The Digital Universe in 2020: Big Data. Bigger Digital Shadows, and Biggest Growth in the Far East‖, 2012

[2] S. Radicati and Q. Hoang, ―Email statistics report, 2012-2016‖, The Radicati Group, Inc., London, 2012

[3] ―2015 State of the Cloud Report‖, RightScale, Inc., Retrieved from http://www.rightscale.com/lp/2015-state-of-the-cloud-report

[4] L. Kleinrock, ―A vision for the Internet‖, ST Journal of Research, Volume 2, Issue 1, 2005

[5] J. McCarthy, MIT Centennial Speech of 1961 cited in ―Architects of the Information Society: Thirty-five Years of the Laboratory for Computer Science at MIT‖, SL Garfinkel Ed, 1999

[6] D. Parkhill, ―The Challenge of the Computer Utility‖, Addison-Wesley Publishing Company, 1966

[7] A. Idrissi and M. Abourezq, ―Skyline in Cloud Computing‖, Journal of Theoretical and Applied Information Technology, Vol. 60, No. 3, February 2014

[8] M. Abourezq and A. Idrissi, ―Introduction of an outranking method in the Cloud computing research and Selection System based on the Skyline‖, Proceedings of the International Conference on Research Challenges in Information Science (RCIS), May 2014

[9] P. Mell and T. Grance, ―The NIST definition of cloud computing‖, National Institute of Standards and Technology, Issue 6, 2009

[10] S. Radack, ―Cloud Computing: A Review of Features, Benefits, and Risks, and Recommendations for Secure, Efficient Implementations‖, NIST, ITL Bulletin, June 2012

[11] Cisco Global Cloud Networking Survey, 2012, Retrieved from http://www.cisco.com/c/en/us/solutions/enterprise-networks/global_cloud_survey.html

[12] YouTube statistics, Retrieved from http://www.youtube.com/yt/press/statistics.html

[13] P. Vagata and K. Wilfong, ―Scaling the Facebook data warehouse to 300 PB‖, April 10, 2014, Retrieved from https://code.facebook.com/posts/229861827208629/scaling-the-facebook-data-warehouse-to-300-pb

[14] L. Tay, ―Inside eBay’s 90PB data warehouse‖, May 10, 2013, http://www.itnews.com.au/News/342615,inside-ebay8217s-90pb-data-warehouse.aspx

[15] J. Lin and D. Ryaboy, ―Scaling big data mining infrastructure: the twitter experience‖, ACM SIGKDD Explorations Newsletter, Volume 14, Issue 2, 2013

[16] M. Cox and D. Ellsworth, ―Managing big data for scientific visualization‖, ACM Siggraph, Volume 97, 1997

[17] P. Zikopoulos and C. Eaton, ―Understanding big data: Analytics for enterprise class hadoop and streaming data‖, McGraw-Hill Osborne Media, 2011

[18] C. Min, S. Mao, Y. Zhang, and V. Leung, ―Big data: related technologies, challenges and future prospects‖, Springer, 2014

[19] D. Boyd and K. Crawford, ―Critical questions for big data: Provocations for a cultural, technological, and scholarly phenomenon‖, Information, communication & society, Volume 15, Issue 5, 2012

[20] I. Abaker, T. Hashem, I. Yaqoob, N. Badrul Anuar, S. Mokhtar, A. Gani, and S. Ullah Khan, ―The rise of ―big data‖ on cloud computing: Review and open research issues‖, Information Systems, Volume 47, January 2015

[21] A. Cuzzocrea, I. Song, and K. C. Davis, ―Analytics over large-scale multidimensional data: the big data revolution‖, In Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP, pp. 101-104. ACM, 2011

[22] J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, A. H. Byers, and McKinsey Global Institute, ―Big data: The next frontier for innovation, competition, and productivity‖, 2011

[23] H. Landrock, O. Schonschek, and A. Gadatsch, ―Big Data Vendor Benchmark 2015 - A Comparison of Big Data Solution Providers‖, Experton Group AG, 2015

[24] S. Connolly, ―7 Key Drivers for the Big Data Market‖, May 14, 2012, Retrieved from http://hortonworks.com/blog/7-key-drivers-for-the-big-data-market

[25] Big Data definition in the Gartner IT Glossary, Retrieved from http://www.gartner.com/it-glossary/big-data

[26] ―What is big data?‖, Retrieved from http://www-01.ibm.com/software/data/bigdata/what-is-big-data.html

[27] Geoinformatics, Department of Civil Engineering, IIT Kanpur, Retrieved from http://gi.iitk.ac.in/gi/geoinformatics

[28] A. Banafa, ―The Future of Big Data and Analytics‖, School of Business and Information Technology, March 2014, Retrieved from http://www.kaplanuniversity.edu/information-technology/articles/future-of-big-data-analytics.aspx

[29] S. Abiteboul, ―Querying semi-structured data‖, Springer Berlin Heidelberg, 1997

[30] H. U. Buhl, M. Röglinger, F. Moser, and J. Heidemann, ―Big Data: A Fashionable Topic with(out) Sustainable Relevance for Research and Practice?‖, Business & Information Systems Engineering, Volume 5, Issue 2, 2013

[31] M. Walker, ―Data Veracity‖, Data Science Central, November 28, 2012, Retrieved from http://www.datasciencecentral.com/profiles/blogs/data-veracity

[32] S. B. Siewert, ―Big data in the cloud: Data velocity, volume, variety, veracity‖, IBM, July 9, 2013

[33] J. Gantz and D. Reinsel, ―Extracting value from chaos‖, IDC iview 1142, 2011

[34] ―Draft NIST Big Data Interoperability Framework: Volume 1, Definitions‖, NIST Special Publication 1500-1, April 6, 2015

[35] K. Fanning and E. Drogt, ―Big Data: New Opportunities for M&A‖, Journal of Corporate Accounting & Finance, Volume 25, Issue 2, 2014

[36] D. Boyd and K. Crawford, ―Critical questions for big data‖, Information, Communication & Society, Volume 15, Issue 5, 2012

[37] A. McAfee and E. Brynjolfsson, ―Big data: the management revolution‖, Harvard Business Review, Volume 90, October 2012

[38] S. Madden, ―From databases to big data‖, IEEE Internet Computing, Volume 16, Issue 3, 2012

[39] ―HGST Unveils Intelligent, Dynamic Storage Solutions To Transform The Data Center‖, Retrieved from http://www.hgst.com/press-room/press-releases/HGST-unveils-intelligent-dynamic-storage-solutions-to-transform-the-data-center

[40] ―Twitter Usage Statistics‖, Retrieved from http://www.internetlivestats.com/twitter-statistics/

[41] R. Krikorian, ―New Tweets per second record, and how‖, August 16, 2013, Retrieved from https://blog.twitter.com/2013/new-tweets-per-second-record-and-how

[42] G. Paterno, ―NoSQL Tutorial: A comprehensive look at the NoSQL database‖, Linux Journal, Volume23, Issue 67, 1999

[43] F. Chang et al., ―Bigtable: A distributed storage system for structured data‖, In Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation, 2006

[44] CL. Chen and C. Zhang, ―Data-intensive applications, challenges, techniques and technologies: A survey on Big Data‖, Information Sciences, Volume 275, 2014

[45] E. A. Brewer, ―Towards robust distributed systems‖, In ACM Symposium on Principles of Distributed Computing, Volume 7, 2000

[46] D. Pritchett, ―Base: An ACID alternative‖, ACM Queue, Volume 6, Issue 3, 2008

[47] R. Cattell, ―Scalable SQL and NoSQL data stores‖, ACM SIGMOD, Volume 39, Issue 4, 2011

[48] A. Moniruzzaman, ―NewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management‖, arXiv preprint arXiv:1411.7343, 2014

[49] S. Ghemawat, H. Gobioff, and ST. Leung, ―The Google file system‖, In Proceedings of the nineteenth ACM symposium on Operating systems principles (SOSP '03), 2003

[50] D. Borthakur, ―HDFS architecture guide‖, Hadoop Apache Project, 2008

[51] B. Antony, ―HDFS Storage Efficiency Using Tiered Storage‖, January 12, 2015, Retrieved from http://www.ebaytechblog.com/2015/01/12/hdfs-storage-efficiency-using-tiered-storage/

[52] ―NoSQL, NewSQL and Beyond: The drivers and use cases for database alternatives‖, April 15, 2011, 451 Research, Retrieved from https://451research.com/report-long?icid=1651

[53] D. Agrawal, A. El Abbadi, F. Emekci, and A. Metwally, ―Database management as a service: Challenges and opportunities‖, In IEEE 25th International Conference on Data Engineering, 2009

[54] W. Lehner and KU. Sattler ―Database as a service (DBaaS)‖, In IEEE 26th International Conference on Data Engineering, 2010

[55] ―Google Launches Bigtable, A Big Managed Database In The Cloud‖, Forbes, May 6, 2015, Retrieved from http://www.forbes.com/sites/paulmiller/2015/05/06/google-launches-bigtable-a-big-managed-database-in-the-cloud

[56] M. Burrows, ―The Chubby lock service for loosely-coupled distributed systems‖, In Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation, 2006

[57] J. Baker et al., ―Megastore: Providing Scalable, Highly Available Storage for Interactive Services‖, In Conference on Innovative Data Systems Research, Volume 11, 2011

[58] J. Corbett et al., ―Spanner: Google’s globally distributed database‖, ACM Transactions on Computer Systems, Volume 31, Issue 3, 2013

[59] L. Lamport, ―Paxos made simple‖, ACM Sigact News, Volume 32, Issue 4, 2001

[60] B. Lampson and D. Lomet, ―Distributed transaction processing using two-phase commit protocol with presumed-commit without log force‖, U.S. Patent 5,335,343, issued August 2, 1994

[61] ―SLA for availability‖, February 11, 2014, Retrieved from http://googlecloudplatform.blogspot.com.es/2014/02/google-cloud-sql-now-generally-available.html

[62] IBM white paper, ―Technical Overview: Anatomy of IBM Cloudant DBaaS‖, Retrieved from http://www-01.ibm.com/software/data/cloudant/

[63] IBM software White Paper, ―Build more and grow more with Cloudant DBaaS‖, Retrieved from https://cloudant.com/resources/white-papers/build-more-and-grow-more-with-cloudant-dbaas/

[64] P. Membrey, E. Plugge, and D. Hawkins, ―The definitive guide to MongoDB: the NoSQL database for cloud and desktop computing‖, Apress, 2010

[65] J. Han et al., ―Survey on NoSQL database‖, 6th IEEE international conference on Pervasive Computing and Applications, 2011

[66] ―Award Winning Heterogeneous Database Provisioning & Management Platform for Private, Public, & Hybrid Clouds‖, Retrieved from http://www.gomorpheus.com/morpheus_appliance_datasheet.pdf

[67] ―Overview of Postgres Plus Cloud Database‖, Retrieved from http://www.enterprisedb.com/Cloud

[68] J. Sullivan, ―EnterpriseDB's Postgres Plus Cloud DB Auto-Scales In AWS‖, January 29, 2014, Retrieved from http://www.tomsitpro.com/articles/enterprisedb-postgresql-cloud-database-aws,1-1617.html

[69] ―Amazon’s SimpleDB Developer Guide‖, Retrieved from http://aws.amazon.com/documentation/simpledb/

[70] S. Sakr and M. Gaber, ―Large Scale and Big Data: Processing and Management‖, CRC Press, 2014

[71] G. DeCandia et al., ―Dynamo: Amazon's highly available key-value store‖, In ACM SIGOPS Operating Systems Review, Volume 41, Issue 6, ACM, 2007

[72] L. Lamport, ―Time, clocks, and the ordering of events in a distributed system‖, Communications of the ACM 21, Issue 7, 1978

[73] G. Milener, ―What's new in SQL Database V12‖, May 15, 2015, Retrieved from http://azure.microsoft.com/en-us/documentation/articles/sql-database-v12-whats-new/

[74] ―Tabular Data Stream Protocol‖, Retrieved from https://msdn.microsoft.com/en-us/library/dd304523.aspx

[75] ―Windows Azure SQL Database: SQL Database Fundamentals‖, Retrieved from http://channel9.msdn.com/Series/Windows-Azure-SQL-Database

[76] ―Amazon RDS Multi-AZ Deployments‖, Retrieved from http://aws.amazon.com/rds/details/multi-az/

[77] J. Barr, ―Amazon RDS – Multi-AZ Deployments For Enhanced Availability & Reliability‖, May 17, 2010, Retrieved from https://aws.amazon.com/blogs/aws/amazon-rds-multi-az-deployment/

[78] J. Baron and S. Kotecha, ―Storage Options in the AWS Cloud‖, October 2013, Retrieved from http://aws.amazon.com/whitepapers/

[79] ―Amazon RDS now supports 3TB and 30,000 Provisioned IOPS per database instance‖, March 13, 2013, Retrieved from http://aws.amazon.com/about-aws/whats-new/2013/03/13/amazon-rds-3tb-30k-iops/

[80] ―The database architecture of salesforce.com, force.com, and database.com‖, September 15, 2011, retrieved from http://www.dbms2.com/2011/09/15/database-architecture-salesforce-com-force-com-and-database/

[81] ―AWS Still Bigger than its Four Main Competitors Combined Despite Surging Growth‖, Synergy Research Group, April 27, 2015, Retrieved from https://www.srgresearch.com/articles/aws-still-bigger-its-four-main-competitors-combined-despite-surging-growth

[82] « Protection of personal data », European Commission, Retrieved from http://ec.europa.eu/justice/data-protection/

[83] G. Furukawa, ―Sony's Two Big Mistakes: No Encryption, and No Backup‖, January 26, 2015, Retrieved from http://java.dzone.com/articles/sonys-two-big-mistakes-no

[84] « Apple denies iCloud, Find My iPhone security breach: Only 'very targeted attacks' », Tech Times, September 7, 2014, Retrieved from http://www.techtimes.com/articles/14717/20140907/apple-denies-icloud-find-my-iphone-security-breach-only-very-targeted-attacks.htm

[85] ―Fog Computing, Ecosystem, Architecture and Applications‖, Research at CISCO, Retrieved from http://www.cisco.com/web/about/ac50/ac207/crc_new/university/RFP/rfp13078.html

[86] J. Corbett et al., ―Spanner: Google’s globally distributed database‖, ACM Transactions on Computer Systems (TOCS), Volume 31, Issue 3, 2013

Cite this paper

Manar Abourezq, Abdellah Idrissi. (2017) Database-as-a-Service for Big Data: A Review. International Journal of Mathematical and Computational Methods, 2, 207-230


Copyright © 2017 Author(s) retain the copyright of this article.
This article is published under the terms of the Creative Commons Attribution License 4.0