Lesson Agenda
Introduction
Course Objectives
Course Agenda
Lab Environment
Additional resources
Introduction
Questions invoking interest and participatio
n
How much do you know about big data?
What is the job market for big data analys
t?
What kind of professional skills does it re
quire to be big data analyst?
What do you hope to learn from this cour
se?
Course Objectives
After taking this course, you should know or be able to:
Understand big data and significance to enterprise
Acquire raw data using Hadoop HDFS and Oracle NoSQL Databa
se
Organize the collected data using MapReduce, Hive and Oracle B
ig Data Connectors
Analyze the data using Oracle connector and R Engine
Understand the big data governance and big data best practices
Learn writing Hadoop jobs in different languages
Programming Languages: Java, Python
High-Level Languages: Apache Pig, Hive
Chapter Structur
Ch 1 Ch 2
e Ch 3
Overview of Technical Oracle Big Data
Big Data Foundation Solution
Introduction
Raw Data Ch 4 Ch 5
Acquisition Using HDFS Using Oracle
NoSQL DB
Collected Data
Organization Ch 6 Ch 7
Using Hadoop Using Hive and Enterprise Big
MapReduce Pig Data Strategy
Data Analysis
Ch 8 Ch 9 Ch 10 Ch 11 Ch 12
Fundamental R Using ORCH & Integration Governance Best
ORE Practices
Course Materials
Each student computer includes:
Oracle BigDataLite 4.0 VM
Class Notes (slides)
Lab Activity Guide
Practice files
Overview of Big Data
Hottest technical topic since 2010
Data grows extremely rapidly
Examples of big data sources:
stream data, social networks such as face
book, twitter, smartphone location-based
serivces, web server logs, blogs, data fro
m sensors
What Is Big Data?
Big Data is defined as voluminous, unstructured data from
many different sources, such as:
Social networks
Banking and financial services
E-commerce services
Web-centric services
Internet search indexes
Scientific searches
Document searches
Medical records
Web logs
And so on
Big Data Definition
No single standard definition