Anda di halaman 1dari 12

BEGINNERS GUIDE - ARCHITECTURE

Before beginning.. Node A hardware assembly containing several tightly coupled central processing units (CPUs).

Before beginning contd.. SMP Symmetric Multi Processing An SMP Teradata Database has a single node that contains multiple CPUs sharing a memory pool. MPP Massively Parallel Processing Multiple SMP nodes working together comprise a larger, MPP implementation of a Teradata Database. The nodes are connected using the BYNET, which allows multiple virtual processors on multiple nodes to communicate with each other.

Before beginning contd..

LOGICAL ARCHITECTURE

PARSING ENGINE

BYNET

AMP

AMP

AMP

AMP

AMP

AMP

DISK

DISK

DISK

DISK

DISK

DISK

Contd..

COMPONENTS IN DETAIL
PARSING ENGINE A Parsing Engine (PE) is a virtual processor (vproc). It is made up of the following software components: Session Control, the Parser, the Optimizer, and the Dispatcher.

PE contd..

PARSING ENGINE
Session Control Logon and Logoff Parser Interprets SQL statements and check syntax. Consults data dictionary to ensure that all objects exist. Also checks the access rights for users. Optimizer

Develops least expensive plan which are converted to executable steps. To maximize throughput and reduce resource contention, optimizer should know system configuration, available units of parallelism & data demographics. Teradata optimizer is robust and intelligent. Parallel aware and cost-based using full look-ahead capability. PE contd..

PARSING ENGINE
Dispatcher Controls the sequence in which steps are executed and passes the steps to BYNET. Composed of two tasks- execution control and response control. Makes sure that all AMPs have finished a step before the next step is dispatched. Depending on nature of SQL requests, a step will be sent to one AMP or all AMPs. Execution control Receives the step definitions from Parser. Transmits them to appropriate AMPs for processing. Receives status report from AMPs as they process the requests. Passes the results to response control one AMPs have completed the processing. Response Control Returns the result to the user.

BYNET
Dual-redundant, fault tolerant, bidirectional interconnect network that enables: Automatic load balancing of message traffic. Automatic reconfiguration after fault detection. Scalable bandwidth as nodes are added. Depending on the nature of dispatch request, the communication between nodes may be a: Broadcast message is routed to all nodes in the system. Point to point message is routed to specific nodes. Features of BYNET Fault-tolerant Load balanced Scalable High Performance

ACCESS MODULE PROCESSOR


The Access Module Processor is the virtual processor that is responsible for managing a portion of the database. Each AMP holds portion of a table. A database manager subsystem resides on each AMP. This subsystem will: Lock database s and tables. Create, modify or delete definitions of tables. Insert, delete or modify rows within table. Retrieve information from definitions and tables. Returns the result set to Dispatcher.

DISK ARRAYS
Disk Array is a configuration of disk drives that utilizes specialized controllers to distribute data and parity across disks while providing fast access and data integrity. The disk array controllers are referred to as dual redundant active array controllers, which means that both controllers are actively used, in addition to serving as backup for each other. Each AMP vproc must have access to an array controller, which in turn accesses the physical disks. AMP vprocs are associated with one or more ranks (or mirrored pairs) of data. The total disk space associated with an AMP is called a vdisk. A vdisk may have up to three ranks.

TYPES OF PARALLELISM IN TERADATA


Multi-AMP

A suitable primary index spreads the rows uniformly across all AMPS and hence a query that uses more than one AMP is called Multi-AMP.
Multi-Step Am AMP can do 81 tasks in parallel. For example in a multiple join statement, Teradata prepares all tables simultaneously for join. Multi-Statement Placing two or more SQL statements inside same transaction. In BTEQ, users can do this by placing a semi-colon at the beginning of the second SQL statement.. This improves system performance as they reduce processing overhead and they are atomic.

Anda mungkin juga menyukai