@TamaraDull
Big data
is not
new.
@TamaraDull
CRM
FINANCIAL
DATA
LOYALTY
CARD DATA
TROUBLE
TICKETS
PDF FILES
SPREADSHEETS
WORD
PROCESSING
DOCUMENTS
RFID TAGS
GPS
WEB LOG
DATA
PHOTOS
SATELLITE
IMAGES
SOCIAL
MEDIA DATA
BLOGS
FORUMS
CLICKSTREAM
DATA
VIDEOS
XML DATA
MOBILE
DATA
WEBSITE
CONTENT
RSS FEEDS
AUDIO
FILES
CALL CENTER
TRANSCRIPTS
POS DATA
On Todays Agenda
Whats Trending?
The 5 Questions
A Comparison and
Contrast Exercise
@TamaraDull
Part 1 of 3
Whats Trending?
@TamaraDull
SOURCE: http://wikibon.org/wiki/v/Big_Data_Vendor_Revenue_and_Market_Forecast_2013-2017
@TamaraDull
@TamaraDull
@TamaraDull
@TamaraDull
Part 2 of 3
The 5 Questions
@TamaraDull
The 5 Questions
1) What can Hadoop do that my data warehouse cant?
2) Were not doing big data, so why do we need
Hadoop?
3) Is Hadoop enterprise-ready?
@TamaraDull
@TamaraDull
(via Hadoop)
@TamaraDull
Are we
there yet?
schema-on-write
hierarchically archived
less agile, fixed
configuration
mature
business professionals
@TamaraDull
vs.
DATA
DATA LAKE
structured / semistructured / unstructured,
raw
PROCESSING
schema-on-read
STORAGE
object-based, no
hierarchy
AGILITY
SECURITY
USERS
@TamaraDull
strengths
weaknesses
lower costs
one-stop data shopping
data management
security
opportunities
threats
discovery
advanced analytics
status quo
skills
Part 3 of 3
@TamaraDull
A Functional Comparison
Business Requirements
Discovery of unexplored business
questions
Clean, transformed, high-quality
aggregated data
Low latency, interactive reports, OLAP
High volumes of raw, highly granular,
unstructured data
Exploratory analysis of preliminary data
@TamaraDull
Traditional
Big Data
@TamaraDull
Free downloads:
Requirements:
Large number of data sources,
users, complex queries, analyses
and analytic applications
Data integration and integrity
Reusability and agility to
accommodate rapidly changing
business requirements and long
data life
@TamaraDull
Source: Special Report Big Data: What Does It Really Cost?, Wintercorp, 2013
Requirements:
Rapid, intensive processing of a
small number of closely-related
data sets
Analysis reads the entire dataset
Life of the raw data is relatively short
Small group of experts collaborate
on analysis
@TamaraDull
Source: Special Report Big Data: What Does It Really Cost?, Wintercorp, 2013
Data Warehouse
Platform
Hadoop
Data Warehouse
Appliance
Hadoop
$44.6
$1.4
$22.7
$1.4
Initial acquisition
$10.8
$0.2
$5.5
$0.2
Upgrades
$16.4
$0.3
$8.4
$0.3
Maintenance/support
$15.9
$0.2
$8.2
$0.2
Power/space/cooling
$1.5
$0.6
$0.6
$0.7
Administration
$7.7
$8.5
$0.8
$0.8
Application development
$16.5
$36.0
$6.6
$7.2
ETL
$18.4
--
--
--
Complex queries
$88.7
$475.0
--
--
Analysis
$88.7
$219.0
--
--
$265.0 million
$740.0 million
$30.0 million
$9.3 million
Cost
System Cost
@TamaraDull
Source: Special Report Big Data: What Does It Really Cost?, Wintercorp, 2013
Wrap-Up
@TamaraDull
@TamaraDull
presents
@TamaraDull