Syarif Hidayatullah
Shiftacademy Tutor
AWS & Google Cloud Certified
Konsep Metadata
adalah informasi terstruktur yang
mendeskripsikan, menjelaskan, menemukan,
atau setidaknya menjadikan suatu informasi
mudah untuk ditemukan kembali, digunakan,
atau dikelola
Jenis Data
data lokal
visualization
data survey
machine
data IOT (industry 4.0) learning
Garis Besar Arsitektur Big Data
Data Environment
data lokal
visualization
data survey
Data
Mart
Integration
Standarization
Staging / Raw
Big Data
Big data is larger, more complex data sets, especially
from new data sources. These data sets are so
voluminous that traditional data processing software
just can’t manage them. But these massive volumes of
data can be used to address business problems you
wouldn’t have been able to tackle before. – oracle-
Big Data
Source: databrick
Hive
The Apache Hive ™ data warehouse software facilitates
reading, writing, and managing large datasets residing
in distributed storage using SQL. Structure can be
projected onto data already in storage.
Source: databrick
Next –
Cloud Big
Data
• Easy to deploy
• Pay as you use
• Free tier!
I’m newbie, how do I start?
Get yourself ready!
• Linux Basic
• SQL Basic
• Python for Analytics
• Cloud 101