Anda di halaman 1dari 49

BIG DATA

Anne Mudya Yolanda, S. Stat., M. Si.


Program Studi Statistika – Universitas Riau
INTRODUCTION
1 2
Anne Mudya Yolanda, S. Stat., M. Si.
Kontak

annemudyayolanda@
085274983345
lecturer.unri.ac.id

7
Course
2 8
Big Data
Mata kuliah Analisis Big Data mencakup konsep analisis Big Data, termasuk
Volume, Velocity, dan Variety (3V), terdapat analisis prediktif, tanpa adanya
kendala dari besarnya data yang diolah. Adanya kemajuan teknologi dalam
hal penyimpanan, pengolahan, dan analisis Big Data meliputi (a) penurunan
secara cepat terhadap biaya penyimpanan data dalam beberapa tahun
terakhir; (b) fleksibilitas dan efektivitas biaya pada pusat data dan komputasi
awan untuk perhitungan elastisitas dan penyimpanan; dan (c)
pengembangan kerangka kerja baru seperti Hadoop Ecosystem, yang
memungkinkan pengguna untuk mengambil manfaat dari sistem komputasi
terdistribusi menyimpan sejumlah data yang besar melalui pemrosesan
paralel. Kemajuan teknologi ini telah menciptakan beberapa perbedaan
antara analisis tradisional dengan analisis tingkat lanjut pada Big Data.

9
Pemrograman Komputer
PRASYARAT : KOMPUTASI STATISTIKA
SKS : 3
SOFTWARE: R
10
CAPAIAN MATA KULIAH
• Mempelajari dan memahami konsep dasar analisis Big Data, termasuk Volume,
Velocity, dan Variety (3V).
• Mampu melakukan analisis prediktif atau implementasi metode tertentu lainnya
untuk mengambil nilai dari data, tanpa adanya kendala atas besarnya data yang
diperlukan.
• Mampu menjawab tantangan termasuk analisis, capture, curation, search, sharing,
storage, transfer, visualization, and information privacy dan peluang yang
ditimbulkan oleh "Big Data" dalam berbagai domain dan bagaimana teknik statistik
dan algoritma yang inovatif dapat membantu mengumpulkan data penting dan
mempercepat penemuan informasi dalam data yang besar.
• Mampu mengambil potensi dari data yang besar untuk membantu meningkatkan
operasi atau tindakan yang sebaiknya dilakukan dengan lebih cepat, yang
memunculkan pengambilan keputusan yang lebih cerdas dari Big Data.
• Mampu melakukan analisis data dengan bantuan perangkat lunak Statistika.

11
POKOK BAHASAN

1. Pengantar Big Data dan Analytics Lifecycle


2. Dasar-Dasar Metode Analytic
3. Teori dan Metode Analitik Data Tingkat Lanjut (Clustering)
4. Teori dan Metode Analitik Data Tingkat Lanjut (Regresi)
5. Teknologi dan Tools Big Data 1 of 3

12
RENCANA PERTEMUAN MINGGUAN
MINGGU PEMBAHASAN
1 Mampu menjelaskan dan memahami konsep Big Data
2-3 Mampu memahami fenomena, framework, peluang dan
tantangan dari keseluruhan aktivitas yang berhubungan
dengan Big Data
4-5 Mampu memahami Dasar-Dasar Metode Data Analytic
6-7 Mampu menjelaskan dan memahami Teori dan Metode
Analitik Data Tingkat Lanjut (Clustering) dan
mengerjakan studi kasusnya
8 UTS

13
RENCANA PERTEMUAN MINGGUAN

MINGGU PEMBAHASAN
9-10 Mampu menjelaskan dan memahami Teori dan Metode
Analitik Data Tingkat Lanjut (Klasifikasi) dan
mengerjakan studi kasusnya
11-12 Mampu menjelaskan dan memahami Teori dan Metode
Analitik Data Tingkat Lanjut (Regresi) dan mengerjakan
studi kasusnya
13 Mengetahui teknologi dan tools Big Data dan Mampu
memahami Technology and Tools Big Data 1 of 3
14 Mampu memahami tentang Metode Machine Learning
untuk Big Data
15 Memahami tantangan dan Peluang Big Data

14
KOMPONEN PENILAIAN

Kehadiran & Keaktifan

UAS TUGAS

UTS KUIS

15
Bobot Penilaian:
Bobot Nilai Harian (NH) nilai tugas terstruktur = 40%
Bobot Nilai Ujian Tengah Semester (UTS) = 20%
Bobot Nilai Ujian Akhir Semester (UAS) = 40%
Nilai Akhir = 40% NH + 20% UTS + 40% UAS

16
Penentuan nilai akhir didasarkan pada peraturan
FMIPA UNRI yang sedang atau akan berlaku pada saat
pemberian nilai akhir.
No Nilai Huruf Rentang Nilai Akhir Nilai Mutu
1 A 85 ≥ NA 4.00
2 A- 80 ≤ NA < 85 3.75
3 B+ 75 ≤ NA < 80 3.50
4 B 70 ≤ NA < 75 3.00
5 B- 65 ≤ NA < 70 2.75
6 C+ 60 ≤ NA < 65 2.50
7 C 55 ≤ NA < 60 2.00
8 D 40 ≤ NA < 55 1.00
9 E NA < 40 0.00 17
Referensi Utama
1. Big Data Analytics, 1st Edition. Editor(s): Govindaraju, Raghavan, and Rao. Release
Date: 07 Jul 2015. Imprint: Elsevier.
2. Data Science and Big Data Analytics: Discovering, Anayzing, Visualizing and
Presenting Data. Editor EMC Education Services. Januari 2015.
3. Judith S. Hurwitz, et. Al. 2013. Big Data for Dummies, John Wiley & Sons, Inc.,
Hoboken, New Jersey.
Referensi Pendukung
1. Walkowiak, S. 2016. Big Data Analytics with R: Utilize R to uncover hidden patterns
in your Big Data. PACKT Publishing
2. Ledolter, J. 2013. Data mining and Business Analytics with R. John Wiley & Sons.
3. Sudeep Tanwar , Sudhanshu Tyagi and Neeraj Kumar. 2020. Multimedia Big Data
Computing for IoT Applications. Springer.
18
ATURAN PERKULIAHAN

1. Perhatikan etika berpakaian di dalam kampus


2. Keterlambatan perkuliahan maksimal 15 menit
3. Kehadiran kurang dari 80% tidak diperkenankan mengikuti UAS.
Segala bentuk konsekuensi dari kondisi ini merujuk sepenuhnya
kepada peraturan akademik FMIPA UNRI.
4. UTS dan UAS susulan diberikan jika dapat menunjukkan surat
rekomendasi dari wakil dekan I
5. Keterlambatan tugas akan dikenai sanksi pemotongan nilai tugas.
Keterlambatan lebih dari tiga (3) hari maka tugas dianggap tidak ada
6. Segala bentuk plagiasi akan dikenai sanksi nilai akhir E

19
ATURAN PENGGUNAAN LABORATORIUM
1. Setiap mahasiswa harus memiliki akun pada SILK berupa NIM.
2. Dilarang makan dan minum di dalam laboratorium karena bisa
merusak komponen listrik pada komputer.
3. Sediakan USB (Flashdisk) untuk penyimpanan file
4. Shutdown komputer setelah penggunaan komputer
5. Bagi kelas yang memanfaatkan fasilitas google classroom, setiap
mahasiswa diwajibkan memiliki akun email di google.
6. Dilarang mengganti/menukar komponen komputer di laboratorium
tanpa izin dan sepengetahuan pihak Jurusan Matematika FMIPA
UNRI

20
Konsep Dasar
2 21
Sebelum belajar

◉ Things to be known
◉ 4670 kampus 27779 program studi
◉ 8.043.480 mahasiswa terdaftar
◉ 1.247.116 lulusan
◉ Pengangguran intelek ?

Kabar baik / kabar buruk?

Statistik Pendidikan Tinggi Indonesia 2018


22
23
Sebelum belajar

◉ Things to be done?
◉ Niat dan Mindset
■ Penggunaan Software
■ Pemrograman
■ Analisis
■ Pemecahan Masalah
◉ Susah = kalau tidak suka
◉ Mudah = kalau suka

24
Siapa yang lebih pintar?

VS

25
Komputer
◉ Menerima,
menginterpretasi, dan
menjalankan perintah
◉ Mampukah berpikir
mandiri?

26
Manusia
◉ Memiliki keinginan,
objective, goal
◉ Mampu berpikir
sendiri dan mandiri
◉ Mampu mengarahkan,
membuat komputer
untuk meraih goal

27
Manusia dan komputer

Kebutuhan,
keinginan,
goal,
objective

28
Manusia dan komputer

Translasi ke
dalam
bahasa yang
dipahami
komputer

29
Manusia dan komputer
algoritma
Bahasa pemrograman

interpretasi

PENGOLAHAN DATA
DALAM JUMLAH LEBIH output
BESAR

30
BIG DATA
AN OVERVIEW

31
The Evolution of Data Management

We are dealing with a lot of complexity when it comes to data.


Some data is structured and stored in a traditional relational
database, while other data, including documents, customer
service records, and even pictures and videos, is unstructured.
The availability and adoption of newer, more powerful mobile
devices, coupled with ubiquitous access to global networks will
drive the creation of new sources for data

32
the opportunity and challenge of big data

How companies can make sense of the intersection of all these


different types of data.
More of Data exists, and it varies in type and timeliness.
People are also finding more ways to make use of this
information than ever before.
Need to think about managing data differently.

33
Getting Started with Big Data

◉ Extremely large Volumes of data


◉ Extremely high Velocity of data
◉ Extremely wide Variety of data

34
Three attributes stand out as defining Big Data characteristics:

◉ Huge volume of data: Rather than thousands or millions of


rows, Big Data can be billions of rows and millions of
columns.
◉ Complexity of data types and structures: Big Data reflects
the variety of new data sources, formats, and structures,
including digital traces being left on the web and other
digital repositories for subsequent analysis.
◉ Speed of new data creation and growth: Big Data can
describe high velocity data, with rapid data ingestion and
near real time analysis

35
Why big data is important?

Because it enables organizations to gather, store, manage, and


manipulate vast amounts data at the right speed, at the right
time, to gain the right insights

36
Several industries have led the way in developing their ability to
gather and exploit data:

Credit card companies monitor every purchase their


customers make and can identify fraudulent
purchases with a high degree of accuracy using rules
derived by processing billions of transactions.

37
Several industries have led the way in developing their ability to
gather and exploit data:

Mobile phone companies analyze subscribers’


calling patterns to determine, for example, whether
a caller’s frequent contacts are on a rival network. If
that rival network is offering an attractive promotion
that might cause the subscriber to defect, the mobile
phone company can proactively offer the subscriber
an incentive to remain in her contract

38
Several industries have led the way in developing their ability to
gather and exploit data:

For companies such as LinkedIn and Facebook, data


itself is their primary product. The valuations of
these companies are heavily derived from the data
they gather and host, which contains more and more
intrinsic value as the data grows.

39
What’s driving the data deluge

40
Although genotyping analyzes
only a fraction of a genome and
does not provide as much
granularity as genetic
sequencing, it does point to the
fact that data and complex
analysis is becoming more
prevalent and less expensive to
deploy.

41
Data evolution and rise of Big Data sources

42
Emerging Big Data ecosystems
Key roles of the new Big Data ecosystem

is technically savvy with strong analytical skills. handle raw,


1. Deep unstructured data and to apply complex analytical techniques at
Analytical massive scales. need access to a robust analytic sandbox or
large-scale analytical workspace: statisticians, economists,
Talent mathematicians, and Data Scientist
Key roles of the new Big Data ecosystem

1. Deep
Analytical
Talent
has less technical depth but has a basic knowledge of statistics or machine
learning and can define key questions that can be answered using
2. Data advanced analytics. Tend to have a base knowledge of working with data,
or an appreciation for some of the work being performed by data scientists
Savvy and others with deep analytical talent: include financial analysts, market
research analysts, life scientists, operations managers, and business and
Professionals functional managers.
Key roles of the new Big Data ecosystem

1. Deep
Analytical
Talent

2. Data Savvy
Professionals

Technology providing technical expertise to support analytical projects, such as


provisioning and administrating analytical sandboxes, and managing large-
and Data scale data architectures that enable widespread analytics within companies
and other organizations. This role requires skills related to computer
Enablers engineering, programming, and database administration.
Examples of Big Data Analytics

Big Data presents many opportunities to improve sales and


marketing analytics. An example of this is the U.S. retailer
Target. Charles Duhigg’s book The Power of Habit discusses
how Target used Big Data and advanced analytical methods to
drive new revenue.

47
Statisticians determined that the retailer made a great deal of
money from three main lifeevent situations

💑
Marriage
💑
Divorce
👪
Pregnancy
when people tend to buy when people buy new when people have many
many new products products and change their new things to buy and
spending habits have an urgency to buy
them

48
Target determined that the most lucrative of these life-events is
the third situation: pregnancy

We was able to identify this fact and predict which of its


shoppers were pregnant. We knew a female shopper was
pregnant even before her family knew. This kind of knowledge
allowed We to offer specific coupons and incentives to their
pregnant shoppers. In fact, We could not only determine if a
shopper was pregnant, but in which month of pregnancy a
shopper may be. This enabled We to manage its inventory,
knowing that there would be demand for specific products and
it would likely vary by month over the coming nine- to
tenmonth cycles.

49
TODAY’S HILITE

LIFE STYLE
SOCIAL
MEDIA

COFFEE
FOOD

50
THANKS!
Any questions?

51
See you next meeting

52

Anda mungkin juga menyukai