SN-07 - Data Science (Basic Statistic)
SN-07 - Data Science (Basic Statistic)
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Menu Hari ini
Statistik Deskriptif
Distribusi Statistik
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Apa itu Statistika?
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Statistik
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Statistik vs Statistika
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Data
Cara
memperoleh Waktu
Sumber pengambilan
Sifat
Time Cross
Primer Sekunder Skala Internal Eksternal Series Section
pengukuran
Kualitatif Kuantitatif
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Aktifitas Statistika
Deskriptif Inferensial
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Populasi, Sampel, Teknik Sampling
Sumber gambar :
https://datatab.net/tutorial/descriptive-inferent
ial-statistics
Teknik Sampling
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Seberapa banyak sampel yang dibutuhkan?
Bergantung pada :
● Apakah populasinya berhingga atau tidak? Homogen/heterogen? (jika tak hingga dan
heterogen tentu butuh lebih banyak sampel)
● Teknik sampling yang digunakan
● Kebiasaan kasus di tiap domain ilmu
misalnya dalam kasus kesehatan, menemukan data pasien penyakit tertentu sangat sulit
sehingga 5-10 data sudah cukup
● Metode analisis/model yang akan digunakan.
Model yang nonlinier seperti neural network cenderung membutuhkan lebih banyak data
dibandingkan regresi linier.
Jika menggunakan machine learning, belum ada teori pasti minimal sampelnya (baca :
https://sites.uab.edu/periop-datascience/2021/06/28/sample-size-in-machine-learning-an
d-artificial-intelligence/
) namun jika menggunakan metode statistik, ada beberapa rumus yang bisa digunakan:
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh rumus ukuran sampel
yang terkenal
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Ada yang masih
ingat apa itu Statistik
Deskriptif?
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Measures of Central
Tendency
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Bagaimana mendapatkan beberapa
informasi, tanpa membaca poin data?
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh 1
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh 2
Data : {1, 1, 2, 3, 5, 8}
Mode = 1 (unimodal)
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Measures of Dispersion
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Mengambil “insight” suatu data
dari sebaran data
Standard
Range Variance IQR
Deviation
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh 1
Range
Alternatif : IQR = Q3 - Q1
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh 2
Variance
Standard
Deviation
Note: pembagi yang “n-1” itu untuk sampel, “n” itu buat populasi, aslinya standar deviasi ya akar dari
varians jadi rumusnya sama tinggal dipakein akar yang buat stdv)
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh 3
Jarak
Interkuartil
Proprietary document of Orbit Future Academy, 2021
Q3 - Q1 AI for Gen Y and AI for Start-Up
Outlier
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Kemiringan (Skewness)
Modus < Median < Mean Modus = Median = Mean Modus > Median > Mean
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Keruncingan (Kurtosis)
Leptokurtic curve menunjukkan data yang rentan terhadap nilai yang ekstrem, contoh
dalam kasus keuangan kurtosis yang tinggi pada grafik return saham menunjukkan resiko
yang tinggi terhadap return yang sangat besar atau sangat kecil.
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Yang cocok yang mana?
Best Measure of
Tipe Data
Central Tendency
Nominal = Mode
Ordinal = Median
Interval / Ratio = Median
(Skewed)
Interval / Ratio = Mean
(Non-Skewed)
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
LATIHAN
Okay guys,
Saat ini coba kalian pikirkan sebuah narasi informatif yang menggambarkan
data statistika deskriptif dan juga statistika inferensial.
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Distribusi
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Distribusi
Distribusi yaitu fungsi yang menunjukkan semua nilai dari sebuah data
dan seberapa sering nilai tersebut terjadi. Untuk mengeceknya bisa
menggunakan grafik, misalnya histogram atau kurva garis.
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Distribusi Statistik
Disini yang akan kita
bahas hanya 3 distribusi:
1) Data Kontinu →
Distribusi Normal
2) Data Diskrit →
Bernoulli dan
Binomial
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Distribusi Bernoulli & Binomial
Jika kejadian ini terjadi sebanyak “n” kali dan saling bebas,
maka distribusi tsb disebut binomial.
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Distribusi Bernoulli & Binomial
Misalkan p = peluang sukses, q = 1-p peluang gagal, maka:
q, jika x = 0, gagal
mean p np
varians pq npq
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Distribusi Binomial
Contoh:
Sebuah mata uang dilempar sebanyak 5 kali. Berapa probabilitas munculnya sisi gambar
sebanyak 2 kali?
Jawab:
Diketahui:
n=5
x=2
P (x,n) = nCx . px . q(n-x)
P (2,5) = 5C2 (1/2)2 x (1/2)(5-2)
= 10 x 1/8 x ⅛
= 10/32
= 5/16
Distribusi Normal
Pdf: f(x) =
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Distribusi Normal
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Distribusi Normal
Dari histogram
disamping, kira-
kira warna apa
yang
berdistribusi
normal?
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Teorema Limit Pusat
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Teorema Limit Pusat
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Statistik Inferensial
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Estimasi
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Estimasi
Akan diperkirakan
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Estimasi
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Estimasi
Sampling
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Estimasi Menggunakan
Regresi Linier Sederhana
Dengan:
Atau Y = ax+b a = slope
b = intercept
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh
Waktu Jumlah
penjualan product yang Seorang manager ingin mengetahui hubungan antara
terjual (Y)
lamanya tenaga penjualan melakukan penjualan dalam
1 2 satuan jam (x) dengan banyaknya produk yang berhasil
5 4 terjual (y). Dari sampel sebanyak 5 orang tenaga
4 6
penjualan, diperoleh data lamanya dan banyaknya
penjualan sebagai berikut,
2 4
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh
Xi Yi X2 XiYi
1 2 1 2
5 4 25 20
4 6 16 24
2 4 4 8
3 2 9 6
15 18 55 60
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Estimasi Interval
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Contoh
n = 36 Students
𝑋2 = 100 minutes
σ = 20 minutes
Confidence Interval = 95 %
Perkirakan waktu
belajar rata-rata
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Hipotesis
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
𝐻0 : Hipotesis Nol 𝐻a : Hipotesis Alternatif
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Z Score
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Uji Statistik
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Types of Error
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Activity
Total
Tahun
(Ton/Tahun)
Pemerintah Indonesia ingin memprediksi angka
2012 1.826,302 kebutuhan bahan kimia “XXX” di tahun 2021 dan
kebutuhannya di tahun 2045 nanti
2013 4.121,514
2014 4.606,627 Show data hasil kalian dan sampaikan dalam bentuk
yang informatif
2015 5.319,637
2016 6.946,482
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Referensi :
Sumber Website:
5 konsep dasar nya statistik untuk DS
https://towardsdatascience.com/the-5-basic-statistics-concepts-data-scientists-need-to-
know-2c96740377ae
Basic stastics
https://towardsdatascience.com/basic-statistics-you-need-to-know-for-data-science-1fd
d290f59b5
Stastik untuk data analisis
http://makemeanalyst.com/basic-statistics-for-data-analysis/
Konsep stastik untuk data sceince
https://www.mastersindatascience.org/learning/statistics-data-science/
Perhitungan Regresi Linier Sederhana
https://www.rumusstatistik.com/2020/05/regresi-linier-sederhana.html
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Referensi :
Sumber Buku Daftar Pustaka:
Furqon. 1999. Statistika Terapan Untuk Penelitian. Bandung : Alfabeta.
Kadir. 2016. Statistika Terapan. Jakarta: PT. Raja Grafindo Persada.
Landau, S & Everitt, B. S. 2004. A Handbook of Statistical Analyses Using SPSS.
New York: A CRC Press Company.
Rasyad, Rasdihan. 1998.Metode Statistik Deskriptif. Jakarta : Grasindo.
Somantri, Ating dan Sambas Ali Muhidin. 2006. Aplikasi Statistika dalam
Penelitian. Bandung : Pustaka Ceria.
Spiegel. M. R. & Stephens, L. J. 2004. Statistik. Jakarta: Erlangga.
Subana, dkk. 2000. Statistik Pendidikan. Bandung : Pustaka Setia.
Sudijono, Anas. 2008. Pengantar Statistik Pendidikan. Jakarta : Raja Grafindo
Persada.
Sudjana, M.A., M.SC.2005. Metode Statistika. Bandung : Tarsito.
Sugiyono. 2015. Statistika Untuk Penelitian. Bandung: Alfabeta.
Walpole, Ronald E, 1995. Pengantar Statistik Edisi Ke-4. Jakarta : PT Gramedia.
Walpole, Ronald E., et al. 2007. Probability & Statistics for Engineers &Scientists. New York: Prentice Hall
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Rangkuman
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
Quiz :
1. Apa perbedaan statistik dan statistika?
2. Perbedaan Median dan IQR?
3. Bagaimana mengambil “insight” suatu data dari sebaran
data?
4. Sebutkan 2 jenis Hipotesa!
5. Ceritakan tentang Regresi Linier Sederhana dan
penggunaanya.
Proprietary document of Orbit Future Academy, 2021 AI for Gen Y and AI for Start-Up
THANK YOU
Proprietary document of Orbit Future Academy, 2021 AI For Gen Y and AI For Startup