Welcome to
IPB University
Department of Statistics
Faculty of Mathematics and Natural Sciences
IPB University
Departemen Statistika
twitter: @kh_notodiputro Fakultas Matematika dan Ilmu Pengetahuan Alam
E-mail: khairil@apps.ipb.ac.id
Seri web-minar 3 Juni 2020
Pengantar
Sumber: https://www.educba.com/data-science-vs-statistics/
A data scientist makes hundreds of decisions every day. They range from small ones like how to tune a
model all the way up big ones like the team's R&D strategy.
Many of these decisions require a strong foundation in statistics and probability theory.
Inspiring Innovation with Integrity
7
Belajar Statistika untuk Sains Data
Core Statistics Concepts
Statistika Deskriptif, sebaran peluang, 1
pengujian hipotesis, regresi dan model
linear.
Statistical Bayesian Thinking
Machine Learning Peluang bersyarat, sebaran prior, sebaran 2
posterior, and kemungkinan maximum.
Bayesian Thinking
Statistical Machine Learning
Konsep pembelajaran mesin, model
Core Statistics Concepts klasifikasi, Metode Resampling, 3
Regularisasi dan Seleksi Model, Model
non-linear, Tree-based methods,
Support vector machine, unsupervised
learning.
Thomas Bayes secara genius berhasil merumuskan cara berpikir dan dan
cara orang mengambil keputusan ke dalam formula matematika. Dalil Bayes
sangat menakjubkan.
Inspiring Innovation with Integrity
12
Statistical Machine Learning
Machine learning allows computers to learn and discern patterns without actually
being programmed. When Statistical techniques and machine learning are combined
together they are a powerful tool for analysing various kinds of data in many computer
science/engineering areas including, image processing, speech processing, natural
language processing, robot control, as well as in fundamental sciences such as
biology, medicine, astronomy, physics, and materials. (Sugiyama, 2016)
+ =
Machine learning Statistics Statistical machine learning
Inspiring Innovation with Integrity
13
Statistical Machine Learning
Klasifikasi :
Regresi logistik
Tree-based Methods :
Fungsi diskriminan
Pohon regresi
Metode Resampling : Pohon klasifikasi
Validasi-silang Bagging, random forest, boosting
Bootstrap
Support Vector Machine :
Regularisasi dan Seleksi Model : Maximal margin classifier
Seleksi himpunan bagian terbaik Support vektor classifier
Metode penyusutan (shrinkage) SVM untuk kasus > 2 klasifikasi
Metode Reduksi dimensi
Unsupervised learning :
Model non-linear : Analisis komponen utama
Regresi splines Metode penggerombolan
Regresi lokal