Selamat datang di Scribd!

Lewati carousel

Comparing We Ka and R

Diunggah oleh

npnbkck

0% menganggap dokumen ini bermanfaat (0 suara)

16 tayangan5 halaman

compare machine learning software

Judul Asli

Comparing We Ka and r

Hak Cipta

Format Tersedia

PDF, TXT atau baca online dari Scribd

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Laporkan Dokumen Ini

compare machine learning software

Hak Cipta:

Format Tersedia

Unduh sebagai PDF, TXT atau baca online dari Scribd

Tandai sebagai konten tidak pantas

0% menganggap dokumen ini bermanfaat (0 suara)

16 tayangan5 halaman

Comparing We Ka and R

Diunggah oleh

npnbkck

compare machine learning software

Hak Cipta:

Format Tersedia

Unduh sebagai PDF, TXT atau baca online dari Scribd

Tandai sebagai konten tidak pantas

Lompat ke Halaman

Anda di halaman 1dari 5

Cari di dalam dokumen

Comparing Weka and R

Overview
Purpose and Audience
This document is intended for users in the process of evaluating and comparing data mining offerings from
Pentaho with the R statistical software suite. Wherever possible, this document attempts to use publicly
verifiable information and respected third-party sources.

Functional Comparison
Introduction
Weka and R are two prominent open-source software systems for analytics. Both originate from academia,
but have different goals and focus. While R comes from the statistics community and is a general-purpose
environment for statistical analysis, Wekas origin is in computer science, and, as such, was designed
specifically for machine learning and data mining. When choosing analytical software, you need to carefully
consider the goals for data mining within your organization, including potential deployment of predictive
models. Pentaho Data Mining, based on Weka, is 100% Java, facilitating simple integration and deployment
within a Pentaho BI solution.

Solution Breadth
Weka provides a broad selection of data mining and machine learning techniques, more so than R does. R is
a general purpose statistical environment and has facilities that you would not necessarily expect to see in a
data mining tool. Weka is arguably more user friendly, with familiar point-and-click graphical user interfaces,
while R is driven by what is essentially a functional programming language.

Data Import
Feature

Weka

Text file delimited

ARFF

C4.5 format

Database

SAS file

SPSS file

Minitab

Last Modified on December 17, 2007

Data Exploration/Visualization
Feature

Weka

Descriptive statistics

Frequency table

Scatter plot

Scatter plot matrices

Histograms

Tree/Graph visualization

Boxplots

ROC curve

Precision/recall curve

Lift chart

Cost curve

Weka

Oversampling/balancing

Random

Stratified

Equal width

Equal frequency

Data Preparation
Feature
Sampling

Discretization (binning)

Supervised

Reorder fields

Identifier fields

Normalization/standardization

Binarization

Derived fields

Outlier detection

Principal components

Random projections

Attribute selection

Arbitrary kernels

Pentaho TM

Comparing Weka and R

Modelling
Feature

Weka

Bayesian
Nave Bayes

Nave Bayes multinomial

Complement nave Bayes

Averaged one-dependence estimators

Weigted averaged one-dependence

estimators
Bayes nets
Nave Bayes trees

Bayesian additive regression trees

Lazy Bayesian rules

Functions
Linear regression

Logistic regression

Isotonic regression

Least median squares regression

Pace regression

Support vector machines

(via interface to third party app)

Multilayer perceptron (neural net)

(single hidden layer NN)

Radial basis function network

Gaussian processes

Voted perceptron

Lazy
K-nearest neighbors

Locally weighted learning

Trees
ID3

C4.5

CART

Decision stumps

Random forests

Best first tree

Logistic model trees

M5 model tree

Alternating decision trees

Interactive tree construction

KNN trees

Rules
Decision table

RIPPER

Conjunctive rule

Pentaho TM

Comparing Weka and R

Feature

Weka

M5 Rules

PART

Ripple down rules (Ridor)

NNge

OneR

Ensmeble learning
AdaBoost

LogitBoost

Additive regression

Bagging

Stacking

Dagging

Grading

MultiBoost

Voted classifier

MetaCost

Ensembles of nested dichotomies

Multi instance learning methods

Clustering
EM

KMeans

XMeans

COBWEB (hierarchical)

OPTICS

Farthest first clustering

Hierarchical clustering

Agglomerative nesting

Fuzzy C-means clustering

Bagged clustering

Cluster ensembles

Convex clustering

Association rules
Apriori

Predictive Apriori

Tertius

Generalized sequential patterns

Eclat

Pentaho TM

(via interface to third pary app)

(via interface to third party app)

Comparing Weka and R

Evaluation
Feature

Weka

Prediction accuracy

Confusion matrix

AUC

Information-retrieval stats

Information-theoretic stats

ROC / lift charts

Experiment facility

Deployment
Feature

Weka

Serialized java object

Java source code (limited)

PMML (limited)

Pentaho TM

Comparing Weka and R

Anda mungkin juga menyukai

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Dari Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Penilaian: 4 dari 5 bintang
4/5 (5794)
The Little Book of Hygge: Danish Secrets to Happy Living
Dari Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Penilaian: 3.5 dari 5 bintang
3.5/5 (400)
Understanding Cpu Pipelining Through Simulation Programming
Dokumen9 halaman
Understanding Cpu Pipelining Through Simulation Programming
npnbkck
Belum ada peringkat
Java Coding Standards
Dokumen76 halaman
Java Coding Standards
S R Krishnan
100% (2)
HW 9 Sol
Dokumen2 halaman
HW 9 Sol
npnbkck
Belum ada peringkat
Bystander Intervention PPT TAI
Dokumen12 halaman
Bystander Intervention PPT TAI
npnbkck
Belum ada peringkat
Giao Trinh Ky Thuat Lap Trinh Nang Cao
Dokumen108 halaman
Giao Trinh Ky Thuat Lap Trinh Nang Cao
Nguyễn Tuấn
Belum ada peringkat
Java Code Conventions
Dokumen24 halaman
Java Code Conventions
LeeLi
Belum ada peringkat
Manual Buiding 1
Dokumen1 halaman
Manual Buiding 1
npnbkck
Belum ada peringkat
PAT-Quick Start Guide
Dokumen20 halaman
PAT-Quick Start Guide
npnbkck
Belum ada peringkat
Restricted Boltzmann Machines On Multi-Core Processors: by Sai Prasad Nooka Stavan Karia
Dokumen31 halaman
Restricted Boltzmann Machines On Multi-Core Processors: by Sai Prasad Nooka Stavan Karia
npnbkck
Belum ada peringkat
DesignBuilder Simulation Training - HSD PDF
Dokumen20 halaman
DesignBuilder Simulation Training - HSD PDF
npnbkck
100% (2)
DesignBuilder Guide
Dokumen1 halaman
DesignBuilder Guide
npnbkck
Belum ada peringkat
Automatedtradingstrategies 2188856
Dokumen50 halaman
Automatedtradingstrategies 2188856
npnbkck
Belum ada peringkat
42 45 SeqClustering - Ps
Dokumen5 halaman
42 45 SeqClustering - Ps
npnbkck
Belum ada peringkat
Computer Manual in Pattern Classification
Dokumen12 halaman
Computer Manual in Pattern Classification
Xu Zhiming
0% (4)
Integrating Hardware With The Point of Sale: FR Ed Eric Van Der Essen
Dokumen19 halaman
Integrating Hardware With The Point of Sale: FR Ed Eric Van Der Essen
npnbkck
Belum ada peringkat
Tell Me More Common Problems
Dokumen5 halaman
Tell Me More Common Problems
npnbkck
Belum ada peringkat
Introduction To Machine Learning
Dokumen1 halaman
Introduction To Machine Learning
npnbkck
Belum ada peringkat
Writing The Literature Review Part 3: Your Turn Now!
Dokumen4 halaman
Writing The Literature Review Part 3: Your Turn Now!
npnbkck
Belum ada peringkat
Simulation of HVAC Plants in 2 Brazilian Cities Us
Dokumen9 halaman
Simulation of HVAC Plants in 2 Brazilian Cities Us
npnbkck
Belum ada peringkat
Scalable WhitePaper BigData Telecom
Dokumen7 halaman
Scalable WhitePaper BigData Telecom
npnbkck
Belum ada peringkat
ch09 04 14 14
Dokumen18 halaman
ch09 04 14 14
npnbkck
Belum ada peringkat
Visio-Drawing1 VSDX
Dokumen3 halaman
Visio-Drawing1 VSDX
npnbkck
Belum ada peringkat
MSDF - Study Paper
Dokumen20 halaman
MSDF - Study Paper
jrkumar
Belum ada peringkat
Lineq
Dokumen20 halaman
Lineq
npnbkck
Belum ada peringkat
Adecco Vietnam Salary Guide 2016
Dokumen50 halaman
Adecco Vietnam Salary Guide 2016
Trí Võ
Belum ada peringkat
Using Maple To Perform Least Squares Fit (Or Regression)
Dokumen16 halaman
Using Maple To Perform Least Squares Fit (Or Regression)
npnbkck
Belum ada peringkat
Eliminate Gerrymandering
Dokumen17 halaman
Eliminate Gerrymandering
npnbkck
Belum ada peringkat
Lect 30
Dokumen6 halaman
Lect 30
npnbkck
Belum ada peringkat
Shoe Dog: A Memoir by the Creator of Nike
Dari Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Penilaian: 4.5 dari 5 bintang
4.5/5 (537)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Dari Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Penilaian: 4 dari 5 bintang
4/5 (895)
The Yellow House: A Memoir (2019 National Book Award Winner)
Dari Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Penilaian: 4 dari 5 bintang
4/5 (98)
The Emperor of All Maladies: A Biography of Cancer
Dari Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Penilaian: 4.5 dari 5 bintang
4.5/5 (271)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dari Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Penilaian: 3.5 dari 5 bintang
3.5/5 (231)
Never Split the Difference: Negotiating As If Your Life Depended On It
Dari Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Penilaian: 4.5 dari 5 bintang
4.5/5 (838)
Grit: The Power of Passion and Perseverance
Dari Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Penilaian: 4 dari 5 bintang
4/5 (588)
On Fire: The (Burning) Case for a Green New Deal
Dari Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Penilaian: 4 dari 5 bintang
4/5 (74)
Yes Please
Dari Everand
Yes Please
Amy Poehler
Penilaian: 4 dari 5 bintang
4/5 (1891)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Dari Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Penilaian: 4.5 dari 5 bintang
4.5/5 (474)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Dari Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Penilaian: 4.5 dari 5 bintang
4.5/5 (266)
The Unwinding: An Inner History of the New America
Dari Everand
The Unwinding: An Inner History of the New America
George Packer
Penilaian: 4 dari 5 bintang
4/5 (45)
Fear: Trump in the White House
Dari Everand
Fear: Trump in the White House
Bob Woodward
Penilaian: 3.5 dari 5 bintang
3.5/5 (738)
Team of Rivals: The Political Genius of Abraham Lincoln
Dari Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Penilaian: 4.5 dari 5 bintang
4.5/5 (234)
Principles: Life and Work
Dari Everand
Principles: Life and Work
Ray Dalio
Penilaian: 4 dari 5 bintang
4/5 (599)
Angela's Ashes: A Memoir
Dari Everand
Angela's Ashes: A Memoir
Frank McCourt
Penilaian: 4.5 dari 5 bintang
4.5/5 (440)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Dari Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Penilaian: 3.5 dari 5 bintang
3.5/5 (2259)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Dari Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Penilaian: 4 dari 5 bintang
4/5 (1090)
Rise of ISIS: A Threat We Can't Ignore
Dari Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Penilaian: 3.5 dari 5 bintang
3.5/5 (137)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Dari Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Penilaian: 4.5 dari 5 bintang
4.5/5 (344)
Bad Feminist: Essays
Dari Everand
Bad Feminist: Essays
Roxane Gay
Penilaian: 4 dari 5 bintang
4/5 (1015)
Steve Jobs
Dari Everand
Steve Jobs
Walter Isaacson
Penilaian: 4.5 dari 5 bintang
4.5/5 (806)
John Adams
Dari Everand
John Adams
David McCullough
Penilaian: 4.5 dari 5 bintang
4.5/5 (2409)
The Glass Castle: A Memoir
Dari Everand
The Glass Castle: A Memoir
Jeannette Walls
Penilaian: 4.5 dari 5 bintang
4.5/5 (1712)
The Outsider: A Novel
Dari Everand
The Outsider: A Novel
Stephen King
Penilaian: 4 dari 5 bintang
4/5 (1839)
The Light Between Oceans: A Novel
Dari Everand
The Light Between Oceans: A Novel
M.L. Stedman
Penilaian: 4.5 dari 5 bintang
4.5/5 (789)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Dari Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Penilaian: 4.5 dari 5 bintang
4.5/5 (121)
Brooklyn: A Novel
Dari Everand
Brooklyn: A Novel
Colm Toibin
Penilaian: 3.5 dari 5 bintang
3.5/5 (1937)
The Woman in Cabin 10
Dari Everand
The Woman in Cabin 10
Ruth Ware
Penilaian: 3.5 dari 5 bintang
3.5/5 (2322)
A Man Called Ove: A Novel
Dari Everand
A Man Called Ove: A Novel
Fredrik Backman
Penilaian: 4.5 dari 5 bintang
4.5/5 (4609)
Little Women
Dari Everand
Little Women
Louisa May Alcott
Penilaian: 4 dari 5 bintang
4/5 (104)
Manhattan Beach: A Novel
Dari Everand
Manhattan Beach: A Novel
Jennifer Egan
Penilaian: 3.5 dari 5 bintang
3.5/5 (792)
Wolf Hall: A Novel
Dari Everand
Wolf Hall: A Novel
Hilary Mantel
Penilaian: 4 dari 5 bintang
4/5 (3811)
The Perks of Being a Wallflower
Dari Everand
The Perks of Being a Wallflower
Stephen Chbosky
Penilaian: 4.5 dari 5 bintang
4.5/5 (2103)
The Art of Racing in the Rain: A Novel
Dari Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Penilaian: 4 dari 5 bintang
4/5 (4200)
A Tree Grows in Brooklyn
Dari Everand
A Tree Grows in Brooklyn
Betty Smith
Penilaian: 4.5 dari 5 bintang
4.5/5 (1929)
Sing, Unburied, Sing: A Novel
Dari Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Penilaian: 4 dari 5 bintang
4/5 (1103)
Her Body and Other Parties: Stories
Dari Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Penilaian: 4 dari 5 bintang
4/5 (821)
The Constant Gardener: A Novel
Dari Everand
The Constant Gardener: A Novel
John le Carré
Penilaian: 3.5 dari 5 bintang
3.5/5 (104)
Social Networking Site For A College
Dokumen8 halaman
Social Networking Site For A College
Gaus Patel
Belum ada peringkat
SYS600 - DNP 3.0 Slave Protocol
Dokumen104 halaman
SYS600 - DNP 3.0 Slave Protocol
Bhageerathi Sahu
Belum ada peringkat
Project Report 2021
Dokumen23 halaman
Project Report 2021
Ananmay Dixit
40% (5)
SYCS Sem III SQ Oct 2021 Physical Computing and IOT Programming
Dokumen13 halaman
SYCS Sem III SQ Oct 2021 Physical Computing and IOT Programming
siddhi khamkar
Belum ada peringkat
Sappress Sap Governance Risk and Compliance PDF
Dokumen33 halaman
Sappress Sap Governance Risk and Compliance PDF
Prince McGershon
Belum ada peringkat
QuantEconlectures Python3
Dokumen1.362 halaman
QuantEconlectures Python3
Cristian F. Sanabria
Belum ada peringkat
HWC YWCi 9
Dokumen257 halaman
HWC YWCi 9
Stefan Mihai Dodan
Belum ada peringkat
Structured Web Documents in XML: Grigoris Antoniou Frank Van Harmelen
Dokumen109 halaman
Structured Web Documents in XML: Grigoris Antoniou Frank Van Harmelen
Kainat Ansar
Belum ada peringkat
SAP Tips & Tricks Questions:: /N /o Search - Sap - Menu Context Menu
Dokumen6 halaman
SAP Tips & Tricks Questions:: /N /o Search - Sap - Menu Context Menu
Subodh Kant
Belum ada peringkat
Konica 7060 Solutions
Dokumen103 halaman
Konica 7060 Solutions
jhall4363
Belum ada peringkat
Sales and Inventory System Thesis Docume
Dokumen29 halaman
Sales and Inventory System Thesis Docume
Nite Guerrero
Belum ada peringkat
Epm Automate
Dokumen130 halaman
Epm Automate
mickey
100% (1)
Serious Games in Humanitarian User Reasearch 2020 SCUK LLST Imaginetic PDF
Dokumen40 halaman
Serious Games in Humanitarian User Reasearch 2020 SCUK LLST Imaginetic PDF
Fressy Nugroho
Belum ada peringkat
DigitalFluencyCCIG V 1.0
Dokumen125 halaman
DigitalFluencyCCIG V 1.0
Lavesh
100% (1)
Firmware - Wikipedia
Dokumen3 halaman
Firmware - Wikipedia
selbal
Belum ada peringkat
Log 2012-10-27 17-01-52
Dokumen78 halaman
Log 2012-10-27 17-01-52
farahnami
Belum ada peringkat
BCS International Diploma in Business Analysis Syllabus: December 2016
Dokumen20 halaman
BCS International Diploma in Business Analysis Syllabus: December 2016
chee pin wong
Belum ada peringkat
DGX 230 - YPG 235 Owner's Manual
Dokumen128 halaman
DGX 230 - YPG 235 Owner's Manual
Ramon Paz
Belum ada peringkat
Allegro Platform System Requirements
Dokumen21 halaman
Allegro Platform System Requirements
Karla Ruiz
Belum ada peringkat
8 Bit Kogge Stone Adder - Final - Report
Dokumen8 halaman
8 Bit Kogge Stone Adder - Final - Report
bigfatdash
Belum ada peringkat
LEA Digital Control Systems e
Dokumen12 halaman
LEA Digital Control Systems e
proesant
Belum ada peringkat
VantageSerialProtocolDocs v261
Dokumen60 halaman
VantageSerialProtocolDocs v261
filipi_vc
Belum ada peringkat
Computers and Humans Apart) Was Coined in 2000 by Luis Von Ahn
Dokumen9 halaman
Computers and Humans Apart) Was Coined in 2000 by Luis Von Ahn
SahilaKhan
Belum ada peringkat
Romney Accounting Information Systems Chapter 7
Dokumen21 halaman
Romney Accounting Information Systems Chapter 7
Paul Elliott
67% (3)
Toaz - Info Tutor Hack Via Termux PR
Dokumen15 halaman
Toaz - Info Tutor Hack Via Termux PR
Perwira Daing Guntara
Belum ada peringkat
SOMF 2.1 Conceptualization Model Language Specifications
Dokumen27 halaman
SOMF 2.1 Conceptualization Model Language Specifications
Melina Nancy
Belum ada peringkat
System Software Conversion From Cisco IOS To
Dokumen39 halaman
System Software Conversion From Cisco IOS To
Yeffrey Garban Márquez
Belum ada peringkat
Relief Valve From Hysys
Dokumen8 halaman
Relief Valve From Hysys
armin
100% (1)
Principles and Applications of The ICL7660 CMOS Voltage Converter PDF
Dokumen10 halaman
Principles and Applications of The ICL7660 CMOS Voltage Converter PDF
Peter Jordan
Belum ada peringkat
Global Mirror Whitepaper
Dokumen30 halaman
Global Mirror Whitepaper
Amar Shringare
Belum ada peringkat