PERFORMANCE ANALYSIS OF
COMPUTING PROCESS EXECUTION IN
MULTICLOUD ENVIRONMENT
S.Srividhyalakshmi, R.Vinoth
Dhanalakshmi Srinivasan Engineering College
student.topper@gmail.com, softvin84@gmail.com
ABSTRACT:
Cloud computing is gaining acceptance in many IT organizations, as an elastic, flexible and variable-cost
way to deploy their service platforms using outsourced resources. Unlike traditional utilities where a
single provider scheme is a common practice, the ubiquitous access to cloud resources easily enables the
simultaneous use of different clouds. In this paper we explore this scenario to deploy a computing cluster
on top of a multi-cloud infrastructure, for solving Many-Task Computing (MTC) applications. In this
way, the cluster nodes can be provisioned with resources from different clouds to improve the costeffectiveness of the deployment, or to implement high-availability strategies. We prove the viability of
this kind of solutions by evaluating the scalability, performance, and cost of different configurations of a
Sun Grid Engine cluster, deployed on a multi-cloud infrastructure spanning a local data-center and three
different cloud sites: Amazon EC2 Europe, Amazon EC2 USA, and ElasticHosts. Although the testbed
deployed in this work is limited to a reduced number of computing resources (due to hardware and budget
limitations), we have complemented our analysis with a simulated infrastructure model, which includes a
larger number of resources, and runs larger problem sizes. Data obtained by simulation show that
performance and cost results can be extrapolated to large scale problems and multicloud infrastructures.
INDEX TERMS : Cloudcomputing, computing cluster, multi-cloud infrastructure, computing process.
I. INTRODUCTION
A cloud is a pool of virtualized computer
resources. A cloud can:
1) Host a variety of different workloads,
including batch-style back-end jobs and
interactive, user-facing applications.
2) Allow workloads to be deployed and scaledout quickly through the rapid provisioning of
Virtual machines or physical machines
3) Support redundant, self-recovering, highly
scalable programming models that allow
Workloads to recover from many unavoidable
hardware/software
failures
4) Monitor resource use in real time to enable
Page 49
Page 50
1. Public clouds
2. private Clouds
3. Inter-clouds or Hybrid Clouds,
say Mr.B.L.V. Rao- CIO and IT Leaders and
expert in cloud computing. At the foundation of
cloud computing is the broader concept of
infrastructure convergence (or Converged
Infrastructure) and shared services. This type of
data center environment allows enterprises to get
their applications up and running faster, with
easier manageability and less maintenance, and
enables IT to more rapidly adjust IT resources
(such as servers, storage and networking) to
meet fluctuating and unpredictable business
demand. Most cloud computing infrastructures
consist of services delivered through shared
data-centers and appearing as a single point of
access for consumers' computing needs.
Commercial offerings may be required to meet
service-level agreements (SLAs), but specific
terms are less often negotiated by smaller
companies.
III. CLOUD TECHNOLOGIES
Four Selected Clouds: Amazon EC2,
GoGrid,ElasticHosts, and Mosso We identify
three categories of cloud computing services
[19], [20]: Infrastructure-as-a-Service (IaaS),
that is, raw infrastructure and associated
middleware, Platform-as-a-Service (PaaS), that
is, APIs for developing applications on an
abstract platform, and Software-as-a-Service
(SaaS), that is, support for running software
services remotely. Many clouds already exist,
but not all provide virtualization, or even
computing services. The scientific community
has not yet started to adopt PaaS or SaaS
solutions, mainly to avoid porting legacy
applications and for lack of the needed scientific
computing services, respectively. Thus, in this
study we are focusing only on IaaS providers.
We also focus only on public clouds, that is,
clouds that are not restricted within an
enterprise; such clouds can be used by our target
audience, scientists. Based on our recent survey
Page 51
Page 52
It
may
be
established
where
organizations have similar requirements and
seek to share cloud infrastructure. Example of
community cloud is Google's Gov Cloud. Public
clouds provide services and infrastructure over
the Internet to the general public or a large
industry group and is owned by an organization
selling cloud. Major public cloud providers are
Google and Amazon. These clouds offer
thegreatest level of efficiency in shared
resources, however they are also more
vulnerable than private clouds. A Hybrid cloud
infrastructure, as the name suggests, is a
composition of private, public, and/or
community clouds possibly through multiple
providers. Reasoning for hybrid cloud
infrastructure is to increase security, better
management or failover purposes. For some it
may not be feasible to place assets in a public
cloud, therefore many opt for the value of
combining diffierent cloud deployment models.
The drawbacks of a hybrid cloud however is the
requirements of managing multiple diffierent
security platforms and communication protocols.
IV. CLOUD PERFORMANCE
EVALUATION
In this section we present a performance
evaluation of cloud computing services for
scientific computing. We design a performance
evaluation method, that allows an assessment of
clouds. To this end, we divide the evaluation
procedure into two parts, the first cloudspecific,
the second infrastructure-agnostic. Cloudspecific evaluation. An attractive promise of
clouds is that there are always unused resources,
so that they can be obtained at any time without
additional waiting time. However, the load of
other large-scale systems (grids) variesover time
due to submission patterns; we want to
investigate if large clouds can indeed bypass this
problem. Thus, we test the duration of resource
acquisition and release over short and long
periods of time. For the short-time periods one
or more instances of the same instance type are
repeatedly acquired and released during a few
minutes; the resource acquisition requests follow
a Poisson process with arrival rate = 1s. For
the long periods an instance is acquired then
Page 53
Page 54
REFERENCES
1. Amazon, Inc., Amazon Elastic
Compute
Cloud
(Amazon
EC2),http://aws.amazon.com/ec2/,
Dec. 2008.
2. GoGrid,
GoGrid
Cloud-Server
Hosting, http://www.gogrid. com, Dec.
2008.
3. A. Iosup, O.O. Sonmez, S. Anoep, and
D.H.J. Epema, The Performance of
Bags-of-Tasks
in
Large-Scale
Distributed Systems, Proc. ACM Intl
Symp. High Performance Distributed
Computing (HPDC), pp. 97-108, 2008.
4. I. Raicu, Z. Zhang, M. Wilde, I.T.
Foster, P.H. Beckman, K. Iskra, and B.
Clifford, Toward Loosely Coupled
Programming on Petascale Systems,
Proc. ACM Conf. Supercomputing (SC),
p. 22, 2008.
5. A. Iosup, C. Dumitrescu, D.H.J. Epema,
H. Li, and L. Wolters, How Are Real
Grids Used? The Analysis of Four Grid
Traces and Its Implications, Proc. IEEE
Seventh Intl Conf. Grid Computing, pp.
262-269, 2006.
6. U. Lublin, D.G. Feitelson, Workload
on Parallel Supercomputers: Modeling
Characteristics of Rigid Jobs, J.
Parallel and Distributed Computing, vol.
63, no. 11, pp. 1105-1122, 2003.
7. D.G. Feitelson, L. Rudolph, U.
Schwiegelshohn, K.C. Sevcik, and P.
Wong, Theory and Practice in Parallel
Job Scheduling, Proc. Job Scheduling
Strategies for Parallel Processing
(JSSPP), pp. 1-34,1997.
8. L. Youseff, R. Wolski, B.C. Gorda, and
C. Krintz, ,ISCN: towards a distributed
scientific computing environment Proc.
ISPA Workshops, pp. 474-486, 2006.
9. [9]
E. Deelman, G. Singh, M.
Livny, J.B. Berriman, and J. Good, The
Cost of Doing Science on the Cloud:
The
Montage
Example,
Proc.
IEEE/ACM Supercomputing (SC), p.
50, 2008.
10. [10]
M.R. Palankar, A. Iamnitchi, M.
Ripeanu, and S. Garfinkel, Amazon S3
11.
12.
13.
14.
15.
16.
17.
18.
Page 55