Anda di halaman 1dari 18

HP Vertica

Sreenivas Kalahasti

19-July-2016

Big Data Analytics

Agenda

What is Vertica
Release and Versions
Vertica VS the World
Vertica Overview
How does it work
How To Use Vertica( The right
way )
Where It Falls Short?
www.vertexcs.com

What is Vertica
True Columnar, MPP,
Analytic database
DBMS like other
DBMS( Oracle, Teradata,
MySQL)

Most Latest Version is HP


Vertica called 7.2 launched in
June 2016
Supports Standard SQL ( ANSI
SQL 99 ) called VSQL like other
Databases

Specifically made for


storing Structured and semistructured data
www.vertexcs.com

HP Vertica VS the World

www.vertexcs.com

Features of Vertica
Column Oriented Storage
Organization
Shared Nothing
Architecture
( Distributed )
High Performance
High Availability
Parallel Data Transfer

Auto Recover Process


Automatic DB Design
Standard SQL Interface
Data Compression
Easy Maintenance

www.vertexcs.com

Use cases

www.vertexcs.com

Vertica Database Architecture

www.vertexcs.com

How Does Vertica work

www.vertexcs.com

WOS Vs ROS

WOS (Write Optimized Store) :

ROS (Read Optimized Store) :

Mainly used to store INSERT,


UPDATE, DELETE and COPY
Actions.
WOS is arranged by Projection
,stores data without
compression or indexing.
Rows are stored in the same
sorting order which they are
inserted.

Highly Optimized and Read Oriented.


Arranged by Projection and stores
data with compression and indexing.
Use COPY...DIRECT and INSERT
(with /*direct*/ hint) to Load data
directly into ROS.

www.vertexcs.com

Projections in Vertica

www.vertexcs.com

10

Types Of Projections
Projection Types :
- Super Projection => Default
- Query Specific Projection => Contain only the specific columns
- Pre Join Projection => subset of query specific projections
- Buddy Projection => Copies of projections stored on an adjacent
node.

www.vertexcs.com

11

Delete / Update in Vertica

Deleted rows are only marked as deleted in Vertica


and will be stored in the Delete Vector on the disk.
To purge the Deleted rows we need to merge the ROS
,the Delete vector.

www.vertexcs.com

12

Best Practices in Vertica


Trickle Load => Load data incrementally with new and changed data.

www.vertexcs.com

13

Best Practices in Vertica


Moveout => Move data from memory (WOS) into a new ROS
container( Disk).

www.vertexcs.com

14

Best Practices in Vertica


Mergeout=> Use mergeout process to consolidate ROS containers and
purging deleted records.

www.vertexcs.com

15

Where Vertica Falls Short

www.vertexcs.com

16

HP Vertica
References
HP Vertica Essentials by Rishabh
Agrawal
HP Vertica Documentation

17

THANK
TIME FOR DISCUSSION & QUESTIONS
YOU

18