@ impetuscalling
Outline
Big Data Current Scenario Cost components in a Big Data Warehouse Best Practices - Reducing the cost of Big Data solutions
Bytes produced every day Big data cost IDC/EMC Cost of wasted productivity because of information overload Estimated Internet Traffic by 2015 Size of the digital universe in 2011 90% of the data in the world today has been created in the last two years alone Estimated time for the digital universe to double
Age of Data
Age of Software
Age of Data
Build your own The promise of innovation Building reliable storage $1 per GB Add the cost of managing / monitoring / hosting
Cons
Software is free !! Glory to the Elephant Cost of Training thinking parallel is not intuitive Cost of Support support is not free
Cons
Rent what you need $14,000 a month for 100 TB data storage only
Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=56
Cons
Initial entry costs- Cost of experimentation Cost of integration and moving data - Cost of ETL Query and analytics capability Manageability On-going maintenance - Monitoring and tuning Changing capacity - Additional hardware Cost of compliance
Hardware
Lower cost of storage Lower cost of computation Make things faster Do more with less
Software
Just make sure your Read Throughput is high Setup data pipelines or use ILM Principles
10
Impetus Proprietary
11
Impetus Proprietary
12
Impetus Proprietary
13
14
15
16
Impetus Proprietary
17
Choosing MPP
$ per TB Driven
EMC Greenplum Teradata, Aster HP Vertica Oracle Exadata Netezza ParAccel Others
Impetus Proprietary
18
MapR HPCC Hadapt Pervasive DataRush, HStreaming Cloud Map Reduce DataStax Platform Computing MARS, GPMR ParStream
Impetus Proprietary
19
Column stores
HBase, Cassandra MongoDB, CouchDB Redis, Riak etc.; Kyoto Cabinet/Tokyo Tyrant, Berkley Neo4j SimpleDB
Documents stores
Key stores
GraphDB
Cloud stores
Impetus Proprietary
20
Postgres, InfiniDB, Infobright MySQL Cluster GridSQL, EnterpriseDB MS SQL Sybase IQ Specialized stores
Impetus Proprietary
21
Initial Entry Costs - Cost of Experimentation We recommend Follow Best Practices , Learn or Hire Cost of Integration and Moving Data- Cost of ETL We recommend - Remove costly licensed tools, switch to Map Reduce for ETL or ELT Manageability - Provisioning, management tools We recommend Opt for multi-vendor management toolsets, e.g. Impetus Ankush On-Going Maintenance- Monitoring and Tuning We recommend Automate! Automate! Automate! Changing Capacity - Additional Hardware Do you know the GPU?
2222
2323
About Us
Strategic partners for software product engineering and R&D Thought leaders in cutting-edge technologies Mature processes and practices that are methodical, yet flexible Diverse domain expertise
Impetus Proprietary
24
Questions
Impetus Proprietary
2626
Thank you
For more information, write to us at inquiry@impetus.com Click to edit Master subtitle style
@ impetuscalling