Computing
Infrastructure
Daniel J. Harvey
Acknowledgements
NASA Ames Research Center
Sunnyvale, California
Presentation Overview
What is grid computing?
Historical preliminaries
Grid potential
Technological challenges
Current developments
Future outlook
Personal research
What is Grid Computing?
The grid is a hardware/software infrastructure
that enables heterogeneous geographically
separated clusters of processors to be connected
in a virtual environment.
Heterogeneous: The computer configurations are
highly varied.
Clusters: Generally high performance computer
configurations consisting of tens or hundreds of
processors that interact through interconnections.
Virtual: The users operate as if the entire grid was
a single integrated system.
Motivation for Grids
The need
Solutions to many large-scale computational problems are not
feasible on existing supercomputer systems.
Expensive equipment are not always available locally.
There is a growing demand for applications such as virtual
reality, data mining, and remote collaboration
Possible solutions
Many very powerful systems are not fully utilized.
Pooling resources is cost effective.
Communication technology is progressing rapidly.
The internet has shown global interconnections to be effective
Historical Preliminaries
Existing infrastructures in society
Power grid
Transportation system
Railroad system
Requirements
Universal access
Standardized use
Dependable
Cost Effective
Early Testbed Project
Corporation for National Research Initiatives
Purpose
Connect the individual testbeds
Exploit available resources
NSF, DOE, NASA networking infrastructure
Supercomputers at dozens of laboratories
Broadband national networks
High-end graphics workstations
Virtual Reality (VR) and collaborative environments
Demonstrate innovative approaches to scientific
applications
Examples of I-Way
Demonstrations
N-body galaxy simulation
Coupled Cray, SGI, Thinking Machines, IBM
Virtual reality results displayed in the CAVE at
Supercomputing 95.
Cave is a room-sized virtual environment.
Industrial emission control system
Coupled Chicago supercomputer with Cave in San
Diego and Washington, D.C.
Teleimmersion, colaborative environment
Grid Potential
Applications identified by I-Way
DataGrid Testbed
Fundamental Particle Research
Remote instrumentation
DataGrid Testbed
Genome Research
Data mining, code management, remote GUI interfaces
DataGrid Testbed
Teleimmersive Applications
Object Oriented
Each object defines the rules for access
Well defined object interfaces
A core set of objects provide basic services
Users can define and create their own objects
Users and executing objects manipulate remote objects
High Performance
Users select hosts based on load and job affinity
Object wrapping for support for parallel programming.
User Autonomy
Users choose scheduling policies and security arrangements.
Globus
An open-architecture integrated “bag” of basic grid services
http://www. globus.org
Downloadable software
Detailed description of Globus
Personal Web Site (http://www. sou.edu/cs/
harvey)