202s Factor Analysis Using SAS PROC FACTOR
Help the Stat Gonsuting Group by
SAS Libary
Factor Analysis Using SAS PROC FACTOR
‘This page was developed by the Consulting group of the Division of Statistics and Scientific Computing atthe University of Texas at Austin. We
{thank thom for parmiasion t diavibut tia our web sie,
Usage Note: stats
Copyright 1995-1997, ACITS, The University of Texas at Austin
Stalsveal Services, 478-9372
Originally avallable online at: htip:/Iss.utoxas.oduldocs/stat3. hi
Factor Analysis Using SAS PROC FACTOR
“Ts usage note dasctoes how to run a factor analysis specially an exploratory common facor analysis, using the SAS FACTOR procedure. This
document compose of toe soon: Irieducten. Outs of Uso, and An Mustaive Example. The Ieductin soton explains wa factor analysis is
Sha whon ane aholé use The not section Is «cata cui for anlng 2 fact analyus Fal tho last Secon Musvates the Use of amma
adr analy using acual data,
What Is Factor Analysis?
Factor analysis is a gone tom fra family of saistica ochniques concord wih th rduction of a st of observable variable in tors of a small
umber of ater aor hasbeen develope pimaniy fr anayangrelatontips among a umber f measurable nites fuch a survey Reso eet
‘cores. The undering assumption of facto’ analysis that there est 2 rumbor cf unebeered lent variables (or "factor that account forthe
Enreltons among obearved vate that fhe ftent vale are pias ut or he constr, the paral comeations among observed variables.
fi becorme zero, sther word, the ltet factors determin he value ofthe cbaerved variables.
ach cbsored vale () canbe express as weighted compost of ast of atntvaables (fs) such hat
‘whore ys the ith observed variable on ho factors, an ef the roscua fy on he factors. Given the assumption tat the residuals are uncoraated
eros the obser variables, the coraatons among ive cbsososvarble ie aos fr by the Ao
The folowing isan example asp path cagram fora facta.
nas model Ths gram Isa schematic epresentaton ofthe above formu
1 and F2 are two common factors. 1, Y2,Y3, ¥4 and YS ao observed variables, possibly 5 subtests a measures of thar observations such as
responses to tome on savey. oe, 63, of, an eS apresert resus or Ung actors, whic are assumed 'o be ureaelted wth each ae: Ay
foralaton between apa of he obsarved vanahas ean be explained in arms of thelr felatoships wh te tert vanales
Purposes of Factor Analysis
“Tne primary purpose of factor ana is data reduction and summarization. Factor analyls has been widely ued, especialy nthe behavioral adenoes, to
isos he coset ly ofa tet ora sale. For example, a prjhologt developed a new alter of 18 sublests lo measure live saline! payenlogeal
foniructs and waned to valde het battry. A samplao 900 subjects was drawn tom the population ane meaawea on the ata tS aaeents, The
5300 y 15 data mate was submited oa factor analysis procadur. The ouput om that procadure was 15 by 3 facorloaing mat, whch rpreserag
Ihe lange among the abvrved variables (he 15 sles ad the 3 Tatent facts, Th rsmber a acon extracted and the pall 3 elaonships
lamang the observed vaiales and Ine factors provided tho researcher with nfeation a th consuc val ofthe lest batey
Common Factor Analysis vs. Component Analysis
Factor analysis as a gener tem indudes prepa component analy. While the two tachriques are functionally very ilar and aroused forthe same
purpose ata rcton, they are gle iferant interme of uscosng asuplons
The tem comma’ in commen fctar analyse desrtbes the variance tht analyzed. asus thatthe variance ofa engl variable can be
decomposed into common vatancs tals shared by oer vaabosinlued nthe model, and uigue variance that's urquo fo aparculr variable and
includes the err component. Cammen factor analy (CFA) anaes ony the common vatanceo! the dbsowed variables, pina component anaes
“The sooction of ono tochniquo over tho oor is basod upon soveral itera Fst of al, whats the objctiv ofthe analysis? Common factor analysis and
pinpal component analysts are sarin the sense at he purpose of bath (a redice the orginal vale ino Tower compoate varabis, cal actors
fr papa campancts, However, ey are dine inthe senge thatthe obtained compost varabes serve fre: purposes In common fecor analy,
5 small numberof factor ar exractd to aocunl for In soreaelatons among th absarved varies tony th ate smension tha oxen why
‘he warble ete corelated wih each att In ppl eamponert analy the objective Isto scaunt fo the mamum portin the vanes present
‘he original st of variables win @ minimum numberof composts varabes called principal components
Secondly, wha are the assumptions abou the variance in the ginal vrs? the observed variables are measured relive ear free, (for example,
‘ge, yes of education, orpunber of family members), ria assumed thatthe errand apie varanceragresant a sal potion e he tte vaca
hepihwaw as ula edustalsesibraryactor_uthim 19202s Factor Analysis Using SAS PROC FACTOR
inthe aigina set of the variates, then pnipal eomponent sna appropiate. Bu the observed variables ae nt indicate ofthe latent canals
{Be measurod (suchas ost scares or fospanses otto sales), or fhe eo (urqus)valanesrepesors a sgifeat orion ofthe total variance,
thon the appropiate technique fo solo is common facior analysis. Sinc tho two methods oto yilésmifar results, only CFA wil bo ilusatod hore,
Outline of Use
1. Preparing Data
It is not uncommon in social scionco sluts fran investigate to conduc a facor analy just bocauso sore mutate data happen tbe avaabo. The
Invesigatr simply hunt fo latlenships among the varabls witout ay po typothess about he relalonahips rong the vas, Wh he
tallabity of powerul computers ang statistical packages ary acvanced mulvaate tenngues incising factor analy, which were one cafined to &
Specl population fra itod une re now realy scent o many india are throfore sos fo patel misans, Ono oy ve tal sos
factor analyos te to overlooks thal the qual: o ator anajic researc depends pamaly on the guatly of put dala subrea to the analy. The
xposson *Garoaye I, Garbage Ou is factor analy wal
Several important quations should te caer bya osercher preparing input daa for factor analy, Fst, what variables shud te induced inthe
nates? Factor analyse desgned to expian why oertan variable ae cores. Moreover, commen factor alse concern ony with that parton of
{eta variance shared by he variables inca in the model Therefore, you should not include variables thal are rot beloved tobe ralated to eachother
sry wa.
‘Second, now many valblas enol be ncaa? Factors are unobearved latent variables that canbe net rom a ast of serv vats, Therfore,
{actors cannot emerge unless thor a sficent number of oars variates that vary along te Inont conan. Yu cannot doin factor wih a
tingle observed vase, You shoud haves minimus of te cbaared vatales french factor expected to emerge. n Trustne's eminolgy, the
{actors define by only oe orto observed varables ar called “sala” or "doublet factors. which ae nl destable. tran has shown hai 3
oration mates stable for em factor analy, then (he inverse ofa colton male) shoul aperoae a dagonal mate asthe numberof
‘arable Inceases while the numberof aco remains constant Kale and Rca[2 proposed @ measure of samatng aeauacy, whic ndeates ow ri
Risto. agonal mati,
“Thc Is the numb of observations suena provi reable atimations ofthe comelations between the vaiables? Corelation coeffients tend tobe
Unaable and greatly infuanesd bythe presence af euler the samp a0 fot args. kis general unas fo conduc factor analysis ona sare of
Tema an 50 obsahations. Moreover, the sample sie should also be consdred i aati fo the rude" of variabios ndued nthe anal. Various
‘les of thumb have been groposed, wh the minimum numberof absenatons per varable ranging fam 51 10, Whe there seems Io be ho deve
answer to ns prdtem, everyone agrees tat tho mor obsonations you have, the mar vals you ests.
Fourth is comelaion aval meas of atsciton among the vatabes tobe anys? The coeaion coefficient is beng used #8 measure of
Conceptual slaty of he varables. If srong cuvlinearelatonaips ae present ang vafiales, fr example, the creation coaicen it an
appropriate moasu. In su casos, the rests o a actor aalyis based on corlationcooficens wil bo invalid. The varabls should moe the the:
‘tzumpions requ forthe crlaton cafe a wal. However, aodl and behavioral scancaa, wa seldom have variables thats mact these
assumptions. Orinal and dichotomous variables have besa submitted to a factor anal inthe socal and behavioral sciences. Unless he dstbutons of
the varabins ave strongly noanorma, factor anaes seers tobe robust to miner waatonso thavesasumptions,
2, Selecting a Factor Model
‘Onc the input data are prepared forthe analysis, it is nacesary to decide on factoring tecwnque, the i, 2 method of extracting factor. In parti, you
sed o dace whether you want to peo facor analy or pncpal components analysis. Tnere Ise procadiein SAS specfcly designed for pincial
fomponeris analyss PROC PRINCOM),whicn's dois by is uriqe exacion metho. On the other hand, you dace on factor aalyeis, then you
must choose an acracion tecnigue. Tare ae evarisy of diferent methods of factor extrac avalale In the PROG FACTOR procedute SAS
prncpal eampenort, propa acer. eave peripal facto. unworged last-sqares factor, maxomurethalood face, ala fator mage analy. and
Fars component ssa. The wo most commonly employed fatr avai ochnigues are npalcomporart and pana actor analy, As esas
above, PCA saul aferent om Fa. The diferent FA techniques employ diferent eter for avacing factors. Discussions on choos aterent
mmotheds of factor exracton canbe found in Loon
3. Estimating Communalities
‘As mentioned eae. n rincial components anal wo do ro make a ssinionbolwwen common and unique pat ofthe variation present ina
‘arabe, The comet favarance) mata, wth? (vaancee) down the man agora, Subrted ta an analyse. On the ether hand, a canon factor
nates begins by suetting the chagonal of the corelation mats with aha ae calle por communalty estimates (2), The communaiy estimate for a
Varabe isthe estimate ofthe popaton af the vance ofthe vatae that is bother fee and shared with ale variables the mate. Since the
encap of commen variance Is nypthatial, we never ke exacl in advance wat proportion of he vafanceis common and wha! proportions Usque
Song versie. Thorelore,esimates of eommuna ies ned to Be supp Tora Tacoranalyis, These estate can bo spaced th tne PRIORS:
‘option tothe PROC FACTOR statement. The spletepproach eo ue he largest bealtecorsltion fra varble wth ary other arable asthe
communally estimate forte variate (PRIORS-MAN) A more sophsticated aperoach Is to use the squared mule coreaton (Rt) between the vate
Ssh ol aha vaaes (PRIORS=SMC}, As the sume of varies Ineases, the impotance of szcrate pa" esimates decreas.
“There aro ail ther methods of estimating communalties aval in SAS. Interested readers shoud refer o SAS manvall, Some method eroud be
thoran because SAS by dau sels al ros communalios to 1.0, which i the same a8 requesting a prncial components rales. This default sling
has caused misunderstanding among the rovce users who are not aware ofthe consequenos of oveloaking he Gaul sang. Mary esearer clam to
havo conducted a common Tacoraravls whan actual a papcpal componetsaralyss was pores
4, Determining the Number of Factors
Determining the optimal numberof actor to exrac isnot sigan Ink ne the decison i imately subjective, There ae soverletor forthe
umber of factors tobe exacted, bi these ae userpic guidelines raha han an exac quanti sole. in pracie, moe factor analyst seldom
tea single eterno dade onthe numberof factor fo extract. Some ofthe most commonly vse guidelines arate Kar Cutlman rl, percentage
Voranes he see tos zo ofthe Tessin, an narra
Kaiser-Guttman rule
‘Toe “gorau states that
the rumber of factor tobe etactad shows be equal to the numberof factors having an eiervalis(arance) greater nan 10, The ratonae for nosing
ths parestor vats hat a factor must hav worsen oa large a It of 9 single tandatand oral wanabl, Real thal n pincpal components
fsnatys “fe rzanad in th main agonal of he eorelaton mets, therefor fp slanderzedvarables thar Is aol vaance ofp to be decomposed
into factors. This ru, owaver, is mee appropriate fr PCA than FA. ant shuld bo adjusted dowrwars when tha common facior modal s chose. Ina
fommon fate anal, commuraiy estimates ar inserts othe man gona he creation males. Therto
Secomposed int aortas than p thas been suggested thal he latent fos (elgenalue) cfleran shouldbe lower ana around the average af he init
armunalty estimates. The PROC FACTOR statment has the option MINEIGENe alling you to spo the att root clo val. For same,
hiepihwaw as ula edustaisasibraryator_uthim 29202s Factor Analysis Using SAS PROC FACTOR
Percentage of Variance
“Another eter, related to the latent ot criterions the percentage or proprt.on ofthe common variance (dtined by the sum of communal estimates)
‘tat explaod by suecosio factors. For example, Ifyou set the etn Ine at 75 porcont ofthe common varares (PROPORTIONS.75
PERCENT=73), then factor il be extastoa wl the sum of egarvalie [othe elaine fare exceeds 75 pret ofthe common varanee, fined a¢
‘he sum of inal communal eximstes,
Scree Test
‘Sematimes poting tho eigenvalues agains tho coresponding acer numbers ges insight Into the maximum numberof factors to exact. The SCRE
‘ptonn the PROC FACTOR slatomentprodveos 9 sree fl thal lutates he rate of charge inthe magrude othe eigorvales forthe factors. The
tals of deine tends tobe fast for he tf Tatars but hen levels of. The stow" of he pot at whe the cuve bend, considered to ceate he
‘maximum number of factors to extract. The figure below iusttates an exanple ofa rather ial scree pet. where a lar obow oocued at he fourth
the eigewvaluesgracualy levels of. The ecve plot suggests a maximum of four feos nthe example. One lege factor tna the number at he eboWw mig
be apropiate you are concerned about geting an overly dain solution, However mary swe pas dono giv sch a cba inabon of he nombar of
\
N
Analysis of Residuals
the factors ate dong good jb in exoining ne corelationsarnong the arial variables, we expect he preted creation malin Rt closely
fppronmate the input coraation mata. In ather words, we expec the raul max R- Reto aproemate a rul mais, The RESIDUAL (or RES) opton
inne PROC FACTOR stators pints tho rescual coralaton mats and the paral corelaten matrix (creltin between vaabls aftr ho factors are
Pataled out or statistical onvlod)-I th residual conelaions or partial coelations ae ‘alavaly large (0-1), ten eter the aor re ot dang @
20d ab laning ne data or wo may noed to extract mare factors fo moo closaly explain he oselatons f macmum Ikalnooa actors (METHOD-ML)
$e oxracied, hon the output nudes the Chi-square test for be siianen of resuas alo the earacton of the given Tato. Ths test corpses two
Scpartsrypathess tests. The iat os labeled, "Test of HO. No common facies” teal he rll Mathes hat ne eammaonfacors ean suentyoxpan
{he intercalation among the variables included in the analyss. You want tvs test obo statistically significant (p <8). A nensigcatvaluo fortis
tes latte suggests that yout teraelaions my nat be ong enaugh fo warantposerming factor ana sncs the results om such a anal
ula probably nat be replated,
“The second Chi-square test satis, balled "Tes of HO: N factors ao sufiet” is ho toto ho nul hypthoss that N common factors are suit to
‘plan the nterconlations among tne vtables, whee N's the rumber ot factors you specty wih an NFACTORS=N option inthe PROG FACTOR
Statomer. Ths tats ust for tosing te hypothesis tet a gen number factors ae suficlet fo account for your data in tsinstance yer goals a
Small chi-square value relative Ls dog'os of feodom. The oucoro oss ina lage pvalue (p> 08). Ono downsio of this tof thal the C>-sa.ars
teas vary sensive to sample ze: given large dagrecs of redo, th el wil oomaly reject th rl hypathesis ofthe rescaling @ nll
tat, even when the factor analysis olan le very good, hereto, be sarfl in iterprtng tis tess agnicance value, Some data sets donot end
Themselves ta good factor sles egal ofthe numero factors earl,
Interpretability
“Another very important bu often ovedooked crtrian for deleining the number f factors ste teprelabity of he factors eatracted. Factor salons
shoul be ctalusted no only accorang fo ematical teri ut also czorang tothe erteron of” theratical meaningfulness." trating more factors il
‘uzrarteo thal the rsiualcoraations gl smaller ana thus thal tho chi-square valves gt smal relive fo the numberof degrees of roodor. However,
onteretablefctrs may nave lle Uy. Thal 2 ferprelani weefector soliton may be mote usu (nt La marlon mare parimonicus} theo 2
Tot nterretacle ourfortorsliion with attr goodnas-o katie
A Priori Hypotheses
“The problem of dotomining he number of factors not a cancom he researcher hasan a prion hypothesis about te numb of factors to otrac. That
is en'# pir hypthess can pode a esteron fr the number a actors fo be extracted a theory or pradaus esearch suggest acertain numberof
{actors ana the analyst warts crf the hypothesis replicate the previous stucy, hen a factor arly with he prespecid urbe” factors canbe
run, The NEACTOR ft Nor) option in PROC FACTOR enttcl te cas-supplos numb of factors. Ulimatly, the cles for delerining the rambo?
Sf factors sou be the replay of te olson. is importa to ealtet oly factor that canbe expected o repeat tnamashes whan anew Sample of
Subjocsis employes
5. The Rotation of Factors
‘Once you dei on the numberof factor o extract, the not lpia sip sto determin the msthod of tation. The fundamental thoorem of factor
Sats is invariant win olaons. Tat the inal actor patter max snot nq, We can gel an fete oumber Of sokdion, wich produce he
fame carelton matric by rotting the reference ares ofthe factor aon to spl the factor iture ans to aciave a more meaning and
interscoable tolution, The fa of simple srclra as roid the moa sormon ba er lation. th gal being orotate thn factors sano 80
fs fo ave as many zee loadings on each factor as possible. The following igure is a simplified example of rotation, showing only ane vaiable rom asa of
vera variables,
hiepihwaw as ula edustaisasibraryator_uthim 39202s Factor Analysis Using SAS PROC FACTOR
“The vatable V1 inal has factor loadings coreation) of 7 and 6 on factor 1 and acer 2 respectively. Howover, after rotation the factor loadings have
hanged to 9 and 2 onthe relate facor and fatr 2 respectively, whichis cbsorfoa simple sucure and east lore.
‘The smzlest aso of rotation ' an othogonrotain in which he angle batwoon the rterence axes of factors sre mantned st 90 degroes. More
complicated forms of roaton allow tne ange between the refrenesaxosto be str than a ight angle, factors ae allowed o be caelated with each
ster, Those fypes of rotational procedures ar refered toe oique rttions. Orthogonal ation procedures are mere common sod tran obque
fottion precedes Io some stations, theory may mandate thal undorjng att construc be unzafelsoa wih eae) ater, and therefore abla rotation
procodutes wil net be apprptto noir stuatons whers the carelaions botwosn tho uncedyng consuls ae not assumed fo Be 20, eblque tation
prosestres may eld simpler and more intrprtable actor pattems
‘A number of ortogonal and oblique rotton procedures have been proposed. Each procedure Nasa slay ternt simolaty function to be maximizes
“Tha ROTATES option inthe PROC FACTOR satemonl suppor ve orthogonal tation methods. EQUAMAX. ORTHOMAX, QUARTIMAX, PARSIMAX,
and VARIMAX, an to oaique otaion methods: PROCRUSTES and PROMAX. The VARIMUAX mathod Mas been the mal commonly used ethogonsl
rotation procedure
6. Interpretation of Factors
(no par ofthe output rom aftr analysis sa matrix of faciorleacngs. A fectr loading er factor suture matnx is an by m mati of corsations
bation te ginal variables a thor actors, hore is the name of variables and ms te number 0 relanod factor. When an oBqe aon
tatned a perormed. the out also includes a factor pattom maths, when ea matrix ot standard regression ootioes fr each of fe orginal
‘arabes on ie rlatd feciors, The mearing othe rotates factor ar infer rom the variables signin loaded on ha factors. A decison needs (0
Semade regarng whel coraiutes a safeat ood. A le of usm requ used i thal factor lacing geal than 30 in abslute valve ae
‘conaigored fo Ge spmfcant This rare i just a qudaino and may need fo bo adjstes. Asin sample zo aa ha rumba of variables Ineeass ho
ttteron may ned to beaded sghly dowrward lt may need tobe adjusted upwar a the number ofctorsmereases, The procaduredescibed next
utes the stea of interpreing afar mate
4. dering scat loans: The analyst starts wth the fst variable (ro) and examines he fatrloaings horizontal fom If tort, underining
them thoy ae sigicat This proces repeated Yr al he ahr variables, You can iru SAS lo perform thi lop by srg the FUZZ= opin ine
PROG FACTOR statement. For instence, FUZZ" 0 pis only he facorlosings greater thane equal oa absalue valu.
Ideal. wo expecta single sigifcart loading foreach variable on only oe factor: across ach row thers cry one undeined factor leasing Its nt
Uneamnon, however, to absare st oadings a varabe whieh has multiple ance! oadngs On the ther hand there ae vase (at a fo oa
‘lnioany onary face, than the analyst should eealy evaluate aos vatables ana corader Geng a new factor slzon ee almintig ther
2. Naming of Factors: Onc all significant loadings are iene, the analyst alempis to assign some meaning tothe factors basod onthe pattems of the
fsetoroadngs, To do us, the anahs examines te sigicaladings foreach face elumn), Ingomar, te lage the abeole Sze of the factor
Toacing Tor vaale, te mare imporant the vale sn irpreting the face. The sgn othe lange also needs tobe conser in labaling the
Factors. Iemay bo important to rovers th seorng oft negaialy wersos ams in Lkot-ypoinsrunorts to proven ambiguty. Thal Ukotsypo
Instruments some tems fe ofen negalvely worded so that hgh scores on hase tems acualy ree low degrees of hs aude 2 eorltat beng
‘oasuree, Romombor tat ho factr loadings roprosnt the creation or inoarasseritionbelween a varabe ans tho later facrs). Coniarng al ho
‘arabe ising on Fat, incloing the ze an Sign fe ong, the veaigator mokes a determination wet whal the Undering ator may
represent.
7. Estimating Factor Scores
‘A factors ler crtnuum along which we can laste datapoints according tthe varying amount cf the construct thet thoy possoss. Factor soars can
‘quant individual cases on 3 latent corinasm using a zzore scale whit ranges fon approximately 3.00 «3.0. The FACTOR procedure can provide the
stmates scoring coettenta whch ate than used In PROC SCORE to produce a matic of estimates facar sores. You ean then outs thete seared &
SAS dataet for further anaes.
8. Factor Analysis Decision Diagram
“Tp falowing gram iusrats a general econ proces for factor analy, Tis decision process is desrbod hae anes low of evans forthe sake
af spl. However, woul bs more reali to Nave a numberof feedback ope included the saga. That's, depending anthers a 9 ven
Stage, ary previoaly mado decison may noed o be modi
9. Confirmatory Factor Analysis
Contrator factor analy allows yout test vor speci hypotheses regarding the numberof factors, factor lasings, a factor intrcoralations.
However, te moe corps to un than ordinary extort factor ary, and a fll@aeaeson of Ie bayend he seape ofthis dacumert.
Factor Analysis Decision Diagram
hiepihwaw as ula edustaisasibraryator_uthim202s Factor Analysis Using SAS PROC FACTOR
Ill. An illustrative Example
Below san lustre example ofthe apocation of commen factor anahsls to dary tho topes described in tho previous sections. Factor analysis has been
Srample lusrates the appition of eammon Taco analysis lo provide evicence of canst vally ofthe Wechsler Intallgence Scale for Chron (WISC-
mn,
“Tne Weel Ineigonce Scale fr Chdran (WISCAI ws designed asa test of genera itligence to prone estimates ofthe illus bites for
chiéren aged between 6 and 16. Tho WISCHI consists af 13 subosts, cach measuring a affornt foto Itligance. The mateo iereoralations
mong the 13 subtest, whic soved as the inp Sata, wa obaied rom the marl] and is shown in Table 2 nepactono! Ibe coraaion max shows
‘atthe caelaons ate substan, cating the presence ofa sslanial general factor
Table 1. Correlation matrix for 13 subscales
PROC FACTOR can hand input data consiting of ether a colton matric or te raw data mtx us o produce the corelaton mats. The caelaton
Imatrxcan be a SAS datas goerated fom tho PROC CORR procadur or can bea tot fe containing the wor tran (nla the main diagonal
{ earelaon mata. For ou example, a tod fe of corelaions is cested and ced WISC.DAT. The folowing SAS DATA sop code defines ho ype of he
inp data fle WISG.DAT ae a corolaton mati, and labels Ks variables, Tho -TYPE.=CORAY statement mist be pod exactly a shown
DATA di (TYPE=CORR);
“TYPE_=*CORR’;
IWFILE “wisc.dat’ MISSOVER;
INPUT inf sim ari voc com dig pic coé pia blo obj sym naz;
RUN;
“Tne flowing SAS code call the FACTOR procedure with some options. METHOD=P or METHOD-PRINCIPAL species the method for entractng fctors
'o be tho prncipakansfcterng method. Ths option in conjunction with PRIORS=SMC pects apical factor analysis, Tho option ROTATE=PROMAK,
psfoon an abique tation aller an orthogonal VARIMAX flan It specie hate Secause the ype cosas that eansitute human
Intelignce,wrion WISC-Il atoms to measire, are Dalievod tobe Interelsea wi each eter. The CORR option requests tha coeaton marx be
print, and the RES or RESIDUALS option regal ha a rsa coraation max b pinto. Theresia crelon mat shows the ci orence
Between the dbsered coelaton mata and he preted covlaien matt I the eine factors ae sufcet to expla the corelatonsariong the
bsorvod variables, the roscualcorelion malik Is expcted to approximate a ull atx (mos valuas =. 10),
PROC FACTOR DATA=D1 METHOD=P PRIORS=SMC ROTATESPROMAX SCREE CORR RES;
RUN;
“Table 2 shows tho pie communalty estimates for 13 subtests usd inthis analysis. Tho squrod mutsle caelatiens (SMC), weich ae printod blow,
reproznt the propoion of variance ofeach! th 13 subtests shares by all taining wot. The sles! MAZES hae the prior communally estimate of
107732, wich mean that only 13% ofthe varanes ofthe sustes! WAZES i hated by all athe subtests, nating thal ts subtest Measures @ some
hiepihwaw as ula edustaisasibraryator_uthim 59202s Factor Analysis Using SAS PROC FACTOR
ser subtests. A smal eormmunalty estimate might dea ha the variable or tem my need tobe mosied o even dropped.
Table 2. Initial Conmunality Estinates
Initial Factor Method: Principal Factors
Prior Conswnality Estimates: snc
INFO sm ARITH voc ‘comp
0.594574 0.587543 2.481994 0.636296 0.473358
brerr PrcTcom covIN. PICTARG
2.224104 0.385580 2.306120 2.287693
BLOCK OBJECT SYMBOL Mazes
2.533202 0.439176 2.422932 0.132220
Eigenvalues of the Reduced Correlation Matrix:
Total = 5.50479208 average = 0.47348554
‘Tho sum ofall por communaliy estimates, 5.505 in this example, ithe estimate of the common varie among al subless, This ria estimate ofthe
amma varance constitutes abot 42% ofthe tla varance present among a 9 eubtst,
“Table 3 shows the factor numbers ad corosponding elgrvales. Acorn othe Kalser and Guttman rl, only one factor canbe reained because only
the fest factor has an agorvalie eater than one, However, at siggested nthe prevous sein, th eeon may be applicable oly fo pineal
component analy nt commen factor anahyse. Two factors can be rtaned ite average eigenvalue (0.123) atead ofl eed ote eteon, The
authors of WISClrotined al factors wth postive eigenvalues and thus retained the fst ou actors. Tho fh aa folowing facers have negative
Sgenvalues, which may not be ively appeaing ist at noglve variance snl. THs clit eesare ony in aman factor nals othe
Festnion tat the sum o eigenvalues bs at equal othe ecimated comman varance, ne the tl vance
Table 3. Eigenvalues of the Reduced Correlation Matrix
Eigenvalue 5.1046 0.6838 0.4021 0.1479 -0.0130
Difference «4.4208 0.2817 0.2542 0.1609 0.0004
Proportion 0.9273 0.1242 @.0731 0.6269 0.0024
Cumulative 0.9273 1.0515 1.1286 1.1516 1.1492,
6 7 8 ° 10
Eigenvalue -.0224 -0.0569 -0.0782 -0.0848 -0.0897
Difference 18345 0.0213 2.0065 0.0049 0.0412
Proportion _-@.0841 -8.6103 -0.6142 -0.0154 -0.0163,
Cumulative laase 111347 1.1285 1.1051 1.0888
uo B
Eigenvalue -0.1318 -@.1547 0.2031
Difference 0.0237 6.0485
Proportion 0.0238 -@.6281 ~0.0369,
Cunulative 1.0658 1.0369 1.0000
“Tha sz0o plot sawn below sooms to suggest tho prsones ofa ganral factor as eects fam the inepecion ofthe colton mati. A large fest
‘gervalue (5.1) and e much smaller secana egentlue (0.88) suggests the presence ofa dominant losl ‘actor. Stetcing ta the lt oe Might
Soe that a secondary elbow cezred atthe th aco. imphing a four‘acorslton, That is aqivalrs to etaiing all factors wth postive geal,
Resour has soggestod thatthe suture ofthe Woche ellgerce seas are ara. Tat at htop af ha ery al ustests converge tow
Single general facta below which ate coveralls genera factors dened by usar of subtests. A Tour‘acer salon Is more interesing ard meaningful
than a engl factor soliton to investigate tbe Herarhiea stcre of tne WISC-I The ress presen inthe foloing secon willbe based on fur
‘ect soln, which was abained by Yepeting the nbs wih the NFACTOR option spodliyng tht the st ou fare be Yotaied
Scree Plot of Eigenv
Table 4. Initial Factor pattern
hiepihwaw as ula edustaisasibraryator_uthim202s Factor Analysis Using SAS PROC FACTOR
FACTORL FACTOR2 FACTORS FACTOR
INFO 0.76124 -@.26507 0.00573 0.00419 INFORMATION
SIM 0.75825 -@.26807 0.00088 -0.01732 SIMILARITY.
ARITH 0.70320 -0.04219 0.07006 0.21817 ARITHMETIC
voc @.77712 -0.29967 0.08268 -0.07819 ‘VOCABULARY
CoNP 0.67220 -@.21792 0.11383 0.09479 CCONPREHENSTON
DIGIT 0.45938 0.01293 @.10982 0.23284 DIGIT SPAN
PICTCOM 2.61799 0.06079 -0,23502 0.05384 PICTURECOMPLETION
CODING 9.40429 0.33855 0.34093 0.06015 CODING
PICTARG 0.54687 0.11799 -@.0165 0.13620 PICTURE ARRANGEMENT
BLOCK 2.71609 0.21503 -0.2255 0.06332 BLOCK DESIGN
OBJECT 9.62675 0.21928 -0.2652 0.01736 ‘OBJECT ASSEMBLY
SYMBOL 2.57731 0.36078 0.23968 -0.03620 SYMBOL SEARCH
MAZES 0.32498 0.21379 -@.12221 -0.00324 mazes.
variance explained by each factor
FACTOR. FACTOR. FACTORS Factor
5.104620 2.683788 0.402128 0.147927
Final Comunality Estimates: Total = 6.338464
“Table 4 above shows ie ial unetated factor structure matrix, which cons ofthe creations between the 13 subtests and the four retained factors,
‘Tha covet estimate ofthe common vatancs is now 8398, which is somonhatfrgor than the il esate of 5.505,
The at-lagonaletsmants of the rasiualcoraation mate ae all close o 0.0", indeating that he coeations among the 13 subtests can be repreduend
{aiy accurately fom the rolaines factors. Te rot mean souared of-Saporal reidval is 0.0178, The Inspection ofthe paral caraaion max vols
Similar result: the coreations among te 13 subst afte the rliened factors are secounted for areal ows to zea. The ook mea squared par
ferelton 0.03, nesting ha fou tet Tacos can acutely aecaun for the obaerved coraaionsamang the 13 subtest,
“The table shown below ith factor sructre mati aftr tha VARIMAX rotation. The corelatios gester than 0.20 are wndadined. There ae sore spt
[buds whore a vafible ssigifcardy (0.3) lomded on mote than one Vclr. This ral, however, ant ierpeted because an oblique slton Hes
been recused
Table 5. Rotated Factor Pattern (VARIMAX)
Table 5. Rotated Factor Pattern (VARIMAX)
FACTOR FACTOR2 FACTOR3 FACTORS
INFO @.71862 0.29392 9.12626 @.17632 INFORMATION
SIM 0.72023 0.29506 0.12237 @.16230 SIMILARITY
ARITH 0.48726 0.30656 0.23918 0.38771 ARITHMETIC
Voc 0.77718 0.23819 2.17933 @.11727 VOCABULARY
CONP 0.65565 0.29763 9.21399 @.08092 COMPREHENSION
DIGIT 2.29824 0.16987 0.20796 @.34843 DIGIT SPAN
PICTCOM @.37579 0.53504 0.10572 @.07124 PICTURE COMPLETION
CODING 2.12040 0.14820 0.59510 @.08546 CODING
PICTARG 0.33269 0.37653 8.28170 @.00121 PICTURE ARRANGEMENT
BLOCK 9.32270 0.64662 0.21651 @.21154 BLOCK DESIGN
OBJECT 9.26569 0.63181 0.17377 @.10766 OBJECT ASSEMBLY
SYNBOL 0.21005 0.32244 0.59566 @.13894 SYMBOL SEARCH
MAZES 0.07226 0.36298 0.15838 0.06487 MAZES
Variance explained by each factor
FACTOR FACTOR? FACTOR3 FACTORS
2.891010 1.894832 1.110948 0.441675,
“Table 6 shown baw ihe facoreucture mar after he oblque PROMAX rotation, which allows he latent actors tobe corlaed wih each other. The
rate of ter factor eaelatons (Table 7) shows Ina the factors ae substantalyeorelated wih each ater. The neracorcoraions range between
0.4 ana 0.65 we submit those inorcoralted factors to naw aco” analy, we might be abl to obsain a ingle saconcrdor factor whch eouls
orrospond othe general eigen og acorn prvis research. Ono dows ofan oblique rotation molhod is at I he coeltions among the
{actors are substan ts sometimes ait to dsingulsn among factor by examining he factor oadnge, in uch usions you shoud vest gate
‘he factor patom matt uhh is @ max ofthe sandarizd coeficants forthe rgresion of he factors on tne aoseres vaab he
Table 6. Factor Structure (Correlations)
FACTOR FACTOR2 FACTOR3 FACTORS
INFO 0.80153 0.56964 9.33700 @.52105 INFORMATION
SIM 0.80059 0.55913 0.33257 @.50906 SIMILARITY
ARITH @.65384 0.55813 0.42927 0.65702 ARITHMETIC
voc 184027 0.53362 2.37803 2.48942 VOCABULARY
COMP @.71732 0.45943 0.37569 @.41350 COMPREHENSION
DIGIT 9.40958 0.35214 0.32524 @.50255 DIGIT SPAN
PICTCOM @.53937 0.64229 0.30602 @.37733 PICTURE COMPLETION
CODING @.28294 0.32896 0.63030 0.31811 CODING
PICTARG 0.47527 0.51677 0.41891 @.30366 PICTURE ARRANGEMENT
BLOCK 9.56601 0.77315 0.44326 @.54029 BLOCK DESIGN
OBJECT 9.48561 0.71459 0.37858 @.41641 OBJECT ASSEMBLY
SYMBOL 2.42630 0.52381 0.69512 @.44612 SYMBOL SEARCH
MAZES 0.21660 0.39830 0.25905 0.22942 MAZES
Table 7. Inter-factor Correlations
hiepihwaw as ula edustaisasibraryator_uthim
79202s Factor Analysis Using SAS PROC FACTOR
FACTOR FACTOR2 FACTOR FACTORA
FACTOR 1.00000 0.64770 0.43503 0.58664
FACTOR2 @.64770 1.20000 9.52336 0.57564
FACTORS @.43503 8.52336 1.09000 @.47436
FACTORS @.58664 0.57564 0.47436 1.00000
“Table Bis the factor pat mati, which wil be used to interpret the meaning of he factors. The values in this matic are the sandarzed rgresion
eatfient, which ae funcional rlted Yo the par or sampetilcarelaianbetwean avaible and ne fator wheather factors are hed one.
“Thaefore 2 value nthe mar reprosert the ivuel ang nonredundant conten thal sac aco s making to pede a euboet, The regression
Table 8. Rotated Factor Pattern (Standardized Regression
Coefficients)
FACTOR FACTOR2 FACTOR FACTORA
INFO 8.73663 0.06911 -0.0553 0.07540 INFORMATION
SIM 0.74378 @.07445 -0.05694 0.05688 SIMILARITY
ARITH 0.35704 0.08393 0.05243 0.37438 ARITHMETIC
voc @,85010 -0.@2674 0.02492 0.00572 VOCABULARY
CONP —@.71870 -@.0391 0.09895 -0.0325 COMPREHENSION
DIGIT @.16057 -@.01259 9.08321 0.37555 DIGIT SPAN
PICTCOM @.24101 @.54702 -0.06151 -2.04977 PICTURE COMPLETION
CODING 9.00651 -€.01816 0.62315 0.02916 CODING
PICTARG @.25467 @.31837 0.20034 ~@.12403 PICTURE ARRANGEMENT
BLOCK 2.26661 0.65412 0.01652 @.11685 SLOCK DESIGN
OBJECT 9.80111 @.69028 0.00237 -2.00618 OBJECT ASSEMSLY
SYMBOL 2.03508 @.17311 8.56088 0.05983 SYMBOL SEARCH
MAZES 2.08719 @.40886 0.07943 0.00754 MAZES
“The subtsts significantly Iaded onthe et factor are Information, Siiaty, Arithmetic, Vocabulary. and Comprehension subtests, These ae the subtsts
‘hat are oaly presented and requ verbal sponses. Thefts this faaat may be named “Vers! Compehenson™ The seco facts dated by the
Tellowing subtests: Picture Completion, Picture Arangerert, Block Design, and Object Assorby Alo these subtests havea gecmaic or configura
Eamponer in than tase sblests measur tho ils that requro the manual manpulaion or orgarzation of pire, abject, blocks, an th He,
Tartare, hs factor may be namad "Perceptual Organcation Ta two euttast leaded onthe td factors are Coding and Symmbal Saorn subtests. Both
sublosts moasure cata the spoad of simp coding or sarching process. Therefor, his factor can be named "Procostng Spoes." Fray, Artec
fd Digit Span subtest dnt the fourth fcr ah subtests deal win ane problems or numbers 30 Tat hs factor canbe named "Numercl
‘Ablity" Toa last two facors are coublets since thoy af oesed by only two suztetseacr. Therefore, hey are conceplaly weak campared Io te st
two factors rd mare sublets may noes to be aod hess flrs to make them coneepivly sound
ts posible o estimate the fctr scores, ra subject's relative staring oo aac ofthe factors, the anal subject byvaable raw data males
salable. To compute the factor scares forall subjects onal factors, use te allowing SAS coc:
PROC FACTOR DATAsraw (other options here) OUTSTA
PROC SCORE DATA=raw SCORESfact’ OUT=scores;
facts
RUN;
‘hore row the ginal data mar, Fact isthe matrix of factor sexing eaeint, and scores is the max! factor sere for sects,
Footnotes.
1 Gutta, L. (953) "mage Theay forthe Stuur of Quanttatve Variables", Psychometce, 18, 277-296,
2. Kaisor, HF, and Ric, (1974) "Lite, MarkIV", Edveational and Peyhologicl Measurement, 24, 111117
53. Lochin, C1992) Latent Varable Model. Eibeum Assocste, Hillsdale NJ
4. SASISTAT Users Guid, 1980, SAS Institut Ins, p. 785.
‘5. Manual fr the Wechsler neligence Seale for Chidren (WISC-I), New Vork 1994
How oot thie pon ‘Report an err on ths page o Inve a comm
“The cortnt of his wab st should not be canst as an endorsement of ary particu wob sil, book, or sollware prc by th Univesity of Caf,
hiepihwaw as.ula edustalsasibraryiactor_uthim
a9202s
hiepihwaw as ula edustaisasibraryator_uthim
Factor Analysis Using SAS PROC FACTOR
99