By
info@kellytechno.com
Graph mining
www.kellytechnmo.com
www.kellytechnmo.com
www.kellytechnmo.com
www.kellytechnmo.com
Moores Law
www.kellytechnmo.com
Allows
developers to
use multiple
machines for a
single task
www.kellytechnmo.com
www.kellytechnmo.com
Yahoo
Over 170 PB
eBay
Over 6 PB
Must support
partial failure
Must be scalable
www.kellytechnmo.com
Failure should
not result in the
loss of any data
www.kellytechnmo.com
www.kellytechnmo.com
www.kellytechnmo.com
HDFS
www.kellytechnmo.com
www.kellytechnmo.com
www.kellytechnmo.com
www.kellytechnmo.com
www.kellytechnmo.com
Map
Reduce
www.kellytechnmo.com
Fault-Tolerance
www.kellytechnmo.com
www.kellytechnmo.com
www.kellytechnmo.com
www.kellytechnmo.com
NameNode
Secondary NameNode
JobTracker
DataNode
TaskTracker
JobTracker
TaskTracker
www.kellytechnmo.com
Hive
Pig
HBase
Cascading
Flume
Presented By
info@kellytechno.com