09 Simd

Criteria 9
Classification of Parallel Computer Architectures

SIMD
"
Organization of the Control and Data Flow Address Space Organization Use of Physical Memory
"
"
Why Parallel Computing?

Although speed of computers is steadily increasing, new applications require even higher speed. Performance of serial computers is beginning to saturate. A natural way to solve the problem is the usage of an ensemble of processors parallel computers
Main Purposes for Parallel Computing
A parallel computer is simply a collection of processors, interconnected in a certain fashion to allow the coordination of their activities and the exchange of data Parallel computers need parallel algorithms, i.e. algorithms suitable for implementation on parallel computers Faster solutions Solution of large size problems
Flynns Classes of Computer Architectures

"

"
SI: Single Instruction Stream MI: Multiple Instruction Stream

Memory
SISD: Single Instruction Stream-Single Data Stream Monoprocessor Princeton (Von Neumann) -Architecture: common instruction and data stream Harward - Architecture separate instruction and data streams
CPU Memory
"
"
"
SD: Single Data Stream

"
"
MD: Multiple Data Stream

CPU
5

"

"
SIMD: Single Instruction Stream-Multiple Data Stream

array processors, vector computers
MIMD: Multiple Instruction Stream-Multiple Data Stream

Memory
Memory
CPU
7
CPU
Degree of Parallelism
Moderate Parallel Computers: SMP (Symmetric Multiprocessor) Massively Parallel Processor: MPP
Hugh number of processing nodes Hugh switching network Single computing resource - for a single job
Symmetric Multiprocessor
In einem (symmetrischen) Multiprozessorsystem knnen alle Prozessoren auf einen gemeinsamen Hauptspeicher zugreifen. Dadurch ist es ihnen mglich, Daten auszutauschen. Den Prozessoren steht aber nur ein gemeinsamer Datenweg zum Hauptspeicher zur Verfgung, z.B. ein gemeinsamer Bus.
9 10
Non-symmetric Multiprocessor
Global Address Space Distributed Physical Memory
Address Space Hauptspeicher

P M P M P M
Interconnection Network Verbindungsnetz, interconnection network, interconnect

11 12
Haupt- und Ein/Ausgabe-Prozessoren
Local Address Spaces

Address Spaces
Parallel Architectures
"
Control / Data Flow: centralized or distributed (Flynn) Address Space: local or global Physical Memory: private or shared Type of Interconnection Network
"
M
"
Interconnection Network
"
distributed and private physical memory

13 14
Classification of Parallel Architectures

Fife Dimensions
Control: Data Flow: Address Space: centralized or distributed sequential or parallel local or global
SISD: von Neumann

Fife Dimensions
Control: Data Flow: Address Space: centralized sequential global
Physical Memory: central or distributed Interconnect: static or dynamic

15
Physical Memory: central Interconnect: static

16
Parallel Architectures
SIMD: Vector Computers Array Processors
Control: Data Flow: Address Space: centralized parallel global Cray-1 Cray-2 Cray C-90
Vector Computers
NEC
Fujitsu
Physical Memory: central or distributed Interconnect: static

17
They all have different architectures.
18
Vector Computer (vector processor)
Example: Vector Operation

Vectors D1 Step 1 Datastreams
VADD
Step n Dn+1 Step m
Powerful CPUs Large Memory Several Vector Pipelines
19
20
Vector Pipelining
Vector Processor
MSU: memory MCU: memory control unit DTU: data transfer unit arithmetic pipelines
Load-Pipeline
Vector Registers
Main Memory
Operation-Pipeline Operation-Pipeline Store-Pipeline
Vector unit
21
22
Hardware Example
Example: MSIMD-Vector Computer
VC Vector Computer SH Synchronisation Hardware

23 24
Global Address Space
Performance of Vector Computers MSIMD
Global or Shared Address Space: Processors interact by modifying data in a shared address space. Hardware support exists for coordinating read and write accesses by the processors.
25
26
Array Processors
Central Program Control
Simulating Ocean Currents
CU
(b) Spatial discretization of a cross section
(a) Cross sections
PU
Model as two-dimensional grids
Discretize in space and time finer spatial and temporal resolution => greater accuracy
Massively parallel: Very many processing units PU Large memory Powerful I/O
Many different computations per time step set up and solve equations
Concurrency across and within grid computations

27
Static and regular
28
Array Processors
Centralized control single control unit Single-Instruction Stream - Multiple-Data Straem, (SIMD) The control unit dispatches instructions to each processing unit. Processing units can be selectively switched out. Primarily for data parallel programs
Overall Architecture of a Processor Array

Array Control execution of serial instructions
29
30
SIMD: Array Processors
Example: MasPar Control Unit
31
32
MasPar: Interconnection Networks

PE CU PE PE PE PE PE PE PE
Some SIMD-Array Processors
PE
PE
PE
PE
PE
PE
PE
PE
33 34
Connection Machine
CM-5
"
" "
Repackaged SparcStation 4 per board Fat-Tree network Control network for global synchronization
35
36
CM-5 Machine Organization

Diagnostics network Control network Data network
Evolution
"
PM PM Processing partition Processing Control partition processors I/O partition
SPARC
FPU
Data networks
Control net work
"
SIMD popular when cost savings of centralized sequencer high 60s when CPU was a cabinet Replaced by vectors in mid-70s More flexible w.r.t. Memory layout and easier to manage Revived in mid-80s when 32-bit datapath slices just fit on chip Simple, regular applications have good locality
"
$ ctrl MBU S Vector unit
$ SRAM
NI
"
Vector unit
DRAM ctrl
DR AM ctrl
DRAM ctrl
DRAM ctrl
Programming model needs fast global synchronization structured global address space
38
DRAM
DRAM
DRAM
DRAM
37

09 Simd

Diunggah oleh

Informasi Dokumen

Judul Asli

Hak Cipta

Format Tersedia

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Hak Cipta:

Format Tersedia

09 Simd

Diunggah oleh

Hak Cipta:

Format Tersedia

Criteria 9

Classification of Parallel Computer Architectures

Why Parallel Computing?

Main Purposes for Parallel Computing

Flynns Classes of Computer Architectures

Flynns Classes of Computer Architectures

SI: Single Instruction Stream MI: Multiple Instruction Stream

SD: Single Data Stream

MD: Multiple Data Stream

Flynns Classes of Computer Architectures

Flynns Classes of Computer Architectures

SIMD: Single Instruction Stream-Multiple Data Stream

MIMD: Multiple Instruction Stream-Multiple Data Stream

Global Address Space Distributed Physical Memory

Address Space Hauptspeicher

Interconnection Network Verbindungsnetz, interconnection network, interconnect

Haupt- und Ein/Ausgabe-Prozessoren

Local Address Spaces

distributed and private physical memory

Classification of Parallel Architectures

SISD: von Neumann

Physical Memory: central or distributed Interconnect: static or dynamic

Physical Memory: central Interconnect: static

Physical Memory: central or distributed Interconnect: static

They all have different architectures.

Vector Computer (vector processor)

Example: Vector Operation

Step n Dn+1 Step m

Powerful CPUs Large Memory Several Vector Pipelines

Operation-Pipeline Operation-Pipeline Store-Pipeline

Example: MSIMD-Vector Computer

VC Vector Computer SH Synchronisation Hardware

Global Address Space

Performance of Vector Computers MSIMD

Simulating Ocean Currents

(a) Cross sections

Model as two-dimensional grids

Concurrency across and within grid computations

Static and regular

Overall Architecture of a Processor Array

SIMD: Array Processors

Example: MasPar Control Unit

MasPar: Interconnection Networks

Some SIMD-Array Processors

CM-5 Machine Organization

PM PM Processing partition Processing Control partition processors I/O partition

Control net work

$ ctrl MBU S Vector unit

Anda mungkin juga menyukai