dynamic resource allocation in the cloud computing
DYNAMIC RESOURCE ALLOCATION IN THE CLOUD COMPUTING USING NEPHELE'S ARCHITECTURE Koti Reddy S1, Ch. Subba Rao2 1 M.Tech, Dept. of C.S.E, Quba College of Engineering, Venkatachalam, Nellore, A.P, India Associate Professor, Dept. of C.S.E, Quba College of Engineering, Venkatachalam, Nellore, A.P, India 2 Abstract In recent years ad-hoc parallel data processing has emerged to be one of the most important applications for Infrastructure-as-aService (IaaS). Major Cloud computing companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for customers to access these services and to deploy their programs. However, the processing frameworks which are currently used have been designed for static, homogeneous cluster setups and disregard the particular nature of a cloud. Consequently, the allocated compute resources may be inadequate for big parts of the submitted job and unnecessarily increase processing time and cost. Nephel’s architecture offers for efficient parallel data processing in clouds. It is the first data processing framework for the dynamic resource allocation offered by today’s IaaS clouds for both, task scheduling and execution. Particular tasks of a processing job can be assigned to different types of virtual machines which are automatically instantiated and terminated during the job execution. Index Terms: Cloud Computing, Parallel Data Processing, Dynamic resource allocation --------------------------------------------------------------------- *** -----------------------------------------------------------------------1. INTRODUCTION Cloud computing is the delivery of computing and storage capacity as a service to a community of end-recipients. The name comes from the use of a cloud-shaped symbol as an abstraction for the complex infrastructure it contains in system diagrams. Cloud computing entrusts services with a user's data, software and computation over a network. There are three types of cloud computing: Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and Software as a Service (SaaS). Using Software as a Service, users also rent application software and databases. The cloud providers manage the infrastructure and platforms on which the applications run. End users access cloud-based applications through a web browser or a light-weight desktop or mobile app while the business software and user's data are stored on servers at a remote location. Proponents claim that cloud computing allows enterprises to get their applications up and running faster, with improved manageability and less maintenance, and enables IT to more rapidly adjust resources to meet fluctuating and unpredictable business demand. Once a user has fit his program into the required map and reduce pattern, the execution framework takes care of splitting the job into subtasks, distributing and executing them. A single Map Reduce job always consists of a distinct map and reduce program. 1.4 Advantages of Proposed System Fig-2: Cloud computing layers 1.1 Problem Definition The efficient parallel data processing is achieved by using Nephele’s framework by dynamic resource allocation offered by today’s IaaS clouds for both, task scheduling and execution. Particular tasks of a processing job can be assigned to different types of virtual machines which are automatically instantiated and terminated during the job execution. Based on this, we perform extended evaluations of MapReduce-inspired processing jobs on an IaaS cloud system. The processing framework then takes care of distributing the program among the available nodes and executes each instance of the program on the appropriate fragment of data. Most notably, Nephele is the first data processing framework to include the possibility of dynamically allocating/ deallocating different compute resources from a cloud in its scheduling and during job execution. 1.2 Disadvantages of Existing System The disadvantages of existing systems are as follows: Expensive Complex Increases data base organization The advantages of proposed systems are as follows: Dynamic resource allocation Parallelism is implemented Designed to run data analysis jobs on a large amount of data Many Task Computing (MTC) has been developed Less expensive More effective More Faster Language : ASP.Net, C# Web Programming : HTML, CSS Database : SQL Server 2008 Tools : Visual Studio 2010 Fig-3: Nephele's Architecture 4.2 Job Graph Defining a Nephele job comprises following mandatory steps: The user must connect to virtual machine and start his task The task program must be assigned to a vertex Finally, the vertices must be connected by edges to define the communication paths of the job OUTPUT-1 3.3 Hardware Requirements TASK-1 Cloud Computing : Cloud Provider Processor : 8 x 1.6GHz CPU Hard Disk : 320 GB RAM : 14GB RAM INPUT-1 Fig-4: An example of Job Graph in Nephele An Execution Graph is Nephele's primary data structure for scheduling and monitoring the execution of a Nephele job. The Execution Graph contains all the information required to schedule and executes the received job on the cloud. It explicitly models task parallelization and the mapping of tasks to instances. Depending on the level of annotations the user has provided with his Job Graph, Nephele may have different degrees of freedom in constructing the Execution Graph. Task 1 is e.g. split into two parallel subtasks which are both connected to the task Output 1 via file channels and are all scheduled to run on the same instance. The exact structure of the Execution Graph is explained in the following: Volume-2, Issue-4, 1144 – 1151 Objectives 1. Input Design is the process of converting a user-oriented description of the input into a computer-based system. This design is important to avoid errors in the data input process and show the correct direction to the management for getting correct information from the computerized system. 4.4 Output Design Fig-5: Execution Graph created from the original Job Graph A quality output is one, which meets the requirements of the end user and presents the information clearly. In any system results of processing are communicated to the users and to other system through outputs. In output design it is determined how the information is to be displaced for immediate need and also the hard copy output. It is the most important and direct source information to the user. Efficient and intelligent output design improves the system's relationship to help user decision-making. RESULT ANALYSIS The conclusion is that, clients can connect to virtual machines of the cloud and they can perform their business tasks in parallel by dynamic resource allocation mechanism of Nephele’s architecture. REFERENCES [1]. Daniel Warneke, Member, IEEE, and Odej Kao, Member,IEEE,” Exploiting Dynamic Resource Allocation for Efficient Parallel Data Processing in the Cloud”, IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 22, NO. 6, JUNE 2011. [2] Amazon Web Services LLC. Amazon Elastic Compute Cloud (Amazon EC2). http://aws.amazon.com/ec2/, 2009. [3] Amazon Web Services LLC. Amazon Elastic MapReduce. http://aws.amazon.com/elasticmapreduce/, 2009. [4] AmazonWeb Services LLC. Amazon Simple Storage Service. http://aws.amazon.com/s3/, 2009. [5] D. Battr´e, S. Ewen, F. Hueske, O. Kao, V. Markl, and D. Warneke.Nephele/PACTs: A Programming Model and Execution Framework for Web-Scale Analytical Processing. In SoCC ’10: Proceedings of the ACM Symposium on Cloud Computing 2010, pages 119– 130, New York, NY, USA, 2010. ACM. [6] R. Chaiken, B. Jenkins, P.-A. Larson, B. Ramsey, D. Shakib, S. Weaver, and J. Zhou. SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets. Proc. VLDB Endow., 1(2):1265– 1276, 2008. IJESAT | Jul-Aug 2012 Available online @ http://www.ijesat.org 1151