0 Suka0 Tidak suka

0 tayangan5 halamanok

Feb 03, 2019

© © All Rights Reserved

- Frequency Analysis of Extreme Events
- Process Capability
- sppsurvival
- jurnal reading 2 rikojumat11 - Copy.ppt
- OLS
- Review of A Large Clinical Series: Predicting Death for Patients With Abdominal Septic Shock
- SSCE2193 Ch3 Exercises
- Ratio and Product Type Estimators Using Stratified Ranked Set Sampling
- Manual Cost Cultivation Surveys 23july08
- Back Matter
- Meng 381 Assignment 1
- Bankruptcy Prediction With Industry Effects
- mit6 041f10 l23
- How Durable is the Effect of Low Intensity
- Van Der Meer_Code for Dike Height Design & Examination Based on Wave Run-up and Wave Over Topping
- Probsta Chapter 8
- Comparison of Fatigue Data Using the Maximum Likelihood Method
- St 205_lecturer Notes
- Quadratic mean differentiability Example
- STAT2303 UARK Week 03 Lesson 01 Confidence Intervals

Anda di halaman 1dari 5

# dj

The Nelson–Aalen estimator is a nonparametric esti- " =

A(t) , (1)

mator which may be used to estimate the cumu- r

t ≤t j

j

data (see Survival Distributions and Their Charac- where rj is the number of individuals at risk (i.e.

teristics). Since no distributional assumptions are alive and not censored) just prior to time tj . Thus

needed, one important use of the estimator is to check the Nelson–Aalen estimator is an increasing right-

graphically the fit of parametric models, and this is continuous step function with increments dj /rj at

the reason why it was originally introduced by Nel- the observed failure times. The variance of the Nel-

son [10, 11]. Independently of Nelson, Altshuler [2] son–Aalen estimator may be estimated by

derived the same estimator in the context of com- # (rj − dj )dj

peting risks animal experiments. Later, by adopting σ 2 (t) =

" . (2)

a counting process formulation, Aalen [1] extended t ≤t

(rj − 1)rj2

j

setups, and studied its small and large sample proper- It may be shown (see below) that the Nelson–Aalen

ties using martingale methods. The estimator is nowa- estimator (1) as well as the variance estimator (2) are

days denoted the Nelson–Aalen estimator, although almost unbiased. In large samples the Nelson–Aalen

other names (the Nelson estimator, the Altshuler estimator, evaluated at a given time t, is approxim-

estimator, the Aalen–Nelson estimator, the empiri- ately normally distributed, so a standard 100(1 −

cal cumulative hazard estimator) are sometimes used α)% confidence interval for A(t) takes the form

as well. Below we present a number of situations " ± z1−α/2"

where the Nelson–Aalen estimator may be applied A(t) σ (t), (3)

and exemplify its use in one particular case. Further-

with z1−α/2 the 1 − α/2 fractile of the standard nor-

more, we indicate how counting processes provide a

mal distribution. The approximation to the normal

framework which allows for a unified treatment of all

distribution is improved by using a log transform giv-

these diverse situations, and we summarize the most ing the confidence interval

important properties of the Nelson–Aalen estimator.

$ %

A detailed account is given in [3, Section IV.1].

" exp ±z1−α/2 "

A(t)

σ (t)

. (4)

"

A(t)

sizes [5].

Consider first the survival data situation, where we Right censoring is not the only kind of data incom-

want to study the time to death (or some other pleteness in survival analysis. Often, e.g. in epidemi-

event) for a homogeneous population with hazard ological applications, individuals are not followed

rate function

! t α(t) and cumulative hazard rate function from time zero (in the relevant time scale, typically

A(t) = 0 α(s) ds. Assume that we have a sample of age), but only from a later entry time (conditional on

n individuals from this population. Our observation of survival until this entry time). Thus, in addition to

the survival times for these individuals will typically right censoring, the survival data are subject to left

be subject to right censoring, meaning that for some truncation. For such data we may still use the Nel-

individuals we only know that their true survival son–Aalen estimator (1) and estimate its variance by

times exceed certain censoring times. The censoring (2). The number at risk, rj , now is the number of

is assumed to be independent in the sense that the individuals who have entered the study before time

additional knowledge of censorings before any time tj and are still in the study just prior to tj . For left-

t does not alter the risk of failure at t (see Censored truncated data the numbers at risk, rj , may be low

Data). We denote by t1 < t2 < · · · the times when for small values of tj . This will result in estimates

deaths are observed and let dj be the number of " which have large sampling errors. But because

A(t)

individuals who die at tj . the increments of the Nelson–Aalen estimator are

2 Nelson–Aalen Estimator

small time values has no influence on the increment

" − A(s)

A(t) " of the Nelson–Aalen estimator over a 2.0

later time interval (s, t]. An estimator for the variance

of this increment is " σ 2 (t) − "

σ 2 (s). 1.5

Quite often we want to estimate the survival distri-

bution function S(t) = exp[−A(t)], representing the 1.0

probability that an individual will be alive at time

t. This may be done from right-censored and/or 0.5

left-truncated survival data by the Kaplan–Meier

estimator. The relation A(t) = − ln S(t) suggests 0.0

that the cumulative hazard rate function alternatively 0 2 4 6 8 10

may be estimated as minus the logarithm of the Years since randomization

Kaplan–Meier estimator. Even though this estima-

Figure 1 Nelson–Aalen estimate of the cumulative haz-

tor numerically will be close to the Nelson–Aalen

ard rate function for death for 138 placebo-treated male

estimator, the latter is the canonical one from a theo- patients with liver cirrhosis, with 95% log-transformed con-

retical point of view. Furthermore, the Nelson–Aalen fidence intervals

estimator may be used in a number of different sit-

uations (see below) while the alternative estimator

highest just after randomization. (This may, at least in

applies only to the survival data situation.

part, be due to heterogeneity which is not accounted

for in our simple analysis.) The hazard rate func-

tion is approximately 0.3 per year for the first 9–10

An Illustration months and slightly below 0.2 per year thereafter

when estimated as the average slope of the curve

To give an illustration of the Nelson–Aalen esti-

over the relevant time periods. More formal proce-

mator we use data from a randomized clinical trial

dures for smoothing the Nelson–Aalen estimate in

for patients with histologically verified liver cirrho-

order to obtain an estimate for the hazard rate func-

sis. Patients were recruited from several hospitals

tion itself are available but will not be considered here

in Copenhagen between 1962 and 1969 and were (see Smoothing Hazard Rates). A further discussion

followed until death, lost to follow-up or until the and analysis of the cirrhosis data is given in [12]. The

closing date of the study, October 1, 1974. The time data were also used for illustrative purposes in [3].

variable of interest is time since entry into the study.

Patients are right censored if alive on October 1,

1974, or if lost to follow-up before that date. Multi-state Models and Recurrent Events

We consider only the 138 placebo-treated male

patients. Their median age at entry was 57 years, The survival analysis setup considered above may

while the lower and upper quartiles were 51 and be generalized in two directions. More than one

66 years, respectively. Of the 138 patients, 88 died type of event may be considered for each individual

during the study. The Nelson–Aalen estimate for under study, and/or the event in question may happen

these patients is shown in Figure 1 with 95% con- more than once for each individual. Examples of

fidence intervals computed according to (4). Even the first type are competing risks with two or more

though the cumulative hazard rate function provides causes of death and the Markov illness–death model

a useful summary measure (e.g. [6, Section 2.3]), it with the states “healthy”, “diseased”, and “dead”

is usually the hazard rate function itself which is the (see Counting Process Methods in Survival Ana-

entity of real interest. So when interpreting the esti- lysis). More generally, we may consider any Markov

mate in Figure 1, we mainly focus on the “slope” of process with a finite number of states which may be

the curve. The estimate of the cumulative hazard rate used to model the life history of an individual. An

function is steeper for the first 9–10 months after example of the second type is an inhomogeneous

randomization than at later times. Therefore we have Poisson process with intensity α(t) modeling the

evidence that the risk of dying for these patients is occurrence of some recurrent event like episodes of

Nelson–Aalen Estimator 3

angina pectoris in patients with coronary heart disease as individual i). Typically the µi (t) will be known

or infections in AIDS patients. For both of these from published life tables for the general population.

two types of situations we observe the times when In this situation the Nelson–Aalen estimator may be

events occur for a number of individuals (modeled used to! estimate the cumulative relative mortality

t

as iid copies of the relevant process) who need A(t) = 0 α(s) ds. All that is required is that rj in

not all be observed over the same interval of time. (1) be taken to denote the sum of the external rates

The Nelson–Aalen estimator may then be applied to µi (tj ) for all individuals at risk just prior to tj . An

estimate cumulative intensities. illustration of this use of the Nelson-Aalen estimator

To be specific, consider a finite-state Markov is provided by Breslow & Day [7, Chapter 5].

process with transition intensities αgh (t) for g # = h.

Focusing on fixed g and h in what follows, we

drop the subscripts and write just α(t) for the g → An Epidemic Model

h transition intensity. Furthermore, we denote by

t1 < t2 < · · · the times when transitions from g to A simple model for the spread of an infectious dis-

h are observed. Let dj be the number of individ- ease in a community is the following (see Epidemic

uals who experience a g → h transition at tj , and Models, Stochastic). At the start of the epidemic, i.e.

write rj for the number of individuals in state g at time t = 0, some individuals make contact with

(i.e. at risk for a g → h transition) just prior to time individuals from elsewhere and are thereby infected

tj . Then! the cumulative g → h transition intensity with the disease. There are no further infections

t

A(t) = 0 α(s) ds may be estimated by (1) and its from outside the community during the course of the

variance by (2). Similarly, the integrated intensity of epidemic. Let S(t) and I (t) denote the number of

an inhomogeneous Poisson process may be estimated susceptibles and infectives, respectively, just prior to

with the tj s denoting the times of observed events, time t. Assuming random mixing, the infection inten-

and the dj s and rj s being the corresponding num- sity in the community at time t becomes α(t)S(t)I (t),

bers of events and numbers at risk, respectively. An where α(t) is the infection rate per possible con-

illustration of the use of the Nelson–Aalen estimator tact. We denote by 0 < t1 < t2 < · · · the times when

to estimate integrated Markov transition intensities is individuals are infected and let dj denote the num-

given by Keiding & Andersen [9]. ber infected !at tj . Then the cumulative infection

t

rate, A(t) = 0 α(s) ds, may be estimated by the Nel-

son–Aalen estimator (1) where now rj = S(tj )I (tj );

Two Other Applications see Becker [4, Section 7.6] for an illustration.

For the situations considered so far, (1) and (2) apply

with rj the number at risk at tj for the event in

question. The use of the Nelson–Aalen estimator is, Counting Process Formulation and Small

however, not restricted to such situations. We mention Sample Properties

here two other applications and return to a general

discussion below. In general we consider the occurrences of some

events of interest (e.g. deaths, occurrences of a dis-

ease, infections), and denote by 0 < t1 < t2 < · · · the

Relative Mortality

times when an event is observed. We assume that two

Our first example considers right-censored and/or or more events cannot occur at the same time, so that

left-truncated survival data, but they no longer come there are no tied observations. (The handling of ties

from a homogeneous population. Rather, we assume is discussed briefly below.) Then the process N (t)

that the hazard rate function of the ith individual may counting the number of observed events in the time

be written as the product α(t)µi (t), where α(t) is interval [0, t] is a (univariate) counting process. The

a relative mortality common to all individuals and behavior of N (t) is governed by its intensity process

µi (t) is the hazard rate function at time t for a person λ(t) given heuristically by

from an external standard population corresponding

to the ith individual (e.g. of the same sex and age λ(t) dt = Pr(event occurs in [t, t + dt)|Ft− ).

4 Nelson–Aalen Estimator

!t

Here Ft− represents all the information available to 2

0 [J (s)/Y (s) ] dN (s). Thus the variance estimator

the researcher just before time t. The counting pro- (2) is almost unbiased when there are no ties. Finally,

cess satisfies Aalen’s multiplicative intensity model a martingale has uncorrelated increments, and by

if we may write its intensity process as (7) this is (almost) the case for the Nelson–Aalen

estimator as well.

λ(t) = α(t)Y (t), (5) In the presence of ties, i.e. when the number of

events dj at tj exceeds one, the process N (t) count-

for some unknown function α(t) and some observable

ing occurrences of events in [0, t] may have jumps of

process Y (t) whose value at time t is known just

size two or larger and is therefore no longer a count-

prior to t. All the situations considered above give

counting processes which fulfill (5). Survival data 'n process. Often, however, we may write N (t) =

ing

from a homogeneous population, finite-state Markov i=1 Ni (t), where Ni (t) is a counting process reg-

istering the events for individual i. If we consider a

processes, and the inhomogeneous Poisson process,

homogeneous population where the rates of occur-

all give a Y (t) process which is the number at

rence of the events are the same for all individuals,

risk just prior to time t. For the model for relative

we may adopt the discrete extension of the model

mortality, Y (t) is the sum of the µi (t) for those

described in [3, pp. 180–181]. For this extended

at risk just before t, while for the epidemic model,

model, the arguments of [8, pp. 94–96], apply, to

Y (t) = S(t)I (t). The common structure of all these

show that the variance estimator (2) is almost unbi-

models when formulated as counting processes is

the reason why the Nelson–Aalen estimator may be ased also in the presence of ties. This justifies the

applied to all these diverse problems. use of the tie-corrected estimator (2) for all situa-

In fact, the counting process formulation provides tions considered above, except for the model with

a framework which makes it simple to study the sta- relative mortality and the epidemic model. Within the

tistical properties of the Nelson–Aalen estimator. We framework of the extended model the Nelson–Aalen

briefly indicate a few main steps and refer to [3, estimator is a nonparametric maximum likelihood

Section IV.1.1] for a thorough treatment. First, we estimator; see [3, Section IV.1.5] for details and fur-

note that, with rj = Y (tj ), we may write the Nel- ther discussion.

son–Aalen estimator (1) as

& t

J (s)

" =

A(t) dN (s), (6) Weak Convergence and Confidence Bands

0 Y (s)

where J (s) = I (Y (s) > 0) and 0/0 is interpreted as By (7) the martingale central limit theorem may be

used to prove that, considered as a stochastic process,

! t Then using (5), (6), and the decomposition N (t) =

0.

the Nelson–Aalen estimator (properly normalized)

0 λ(s) ds + M(t) of a counting process into a sum

of its integrated intensity process and a local square converges weakly to a mean zero Gaussian martin-

integrable martingale M(t), we obtain gale. In particular, for a fixed t it is asymptotically

normally distributed, a fact that was used in con-

" = A∗ (t) + M ∗ (t).

A(t) (7) nection with the confidence intervals (3) and (4).

!t The weak convergence result also makes it possible

Here A∗ (t) = 0 J (s)α(s) ds is almost the same to derive confidence bands for A, i.e. limits which

as A(t) when there is only a small probability contain A(t) for all t in an interval [τ1 , τ2 ] with a

!that Y (s) = 0 for some s ≤ t, while M ∗ (t) = prespecified probability.

t

0 [J (s)/Y (s)] dM(s) is a stochastic integral and as One important class of such confidence bands are

such is a local square integrable martingale. Relation the equal precision bands. The standard and log-

(7) is the key to studying the statistical properties transformed equal precision bands are obtained by

of the Nelson–Aalen estimator. Since M ∗ (t) has replacing z1−α/2 in (3) and (4) by d1−α , the 1 − α

expected value zero for any given t, we have EA(t) " = fractile in the distribution of the supremum of the

∗ absolute value of a standardized Brownian bridge

EA (t), so the Nelson–Aalen estimator is almost

unbiased. Furthermore, an unbiased estimator for the (over a certain time interval). This fractile may be

variance of M ∗ (t) is its optional variation process found (approximately) by solving (with respect to d)

Nelson–Aalen Estimator 5

( ) $ %

4φ(d) 1 "

σ (τ2 ) [1] Aalen, O.O. (1978). Nonparametric inference for a

+ 2φ(d) d − ln = α,

d d "

σ (τ1 ) family of counting processes, Annals of Statistics 6,

701–726.

where φ(d) is the standard normal density function. [2] Altshuler, B. (1970). Theory for the measurement of

competing risks in animal experiments, Mathematical

The equal precision bands require " σ (τ1 ) > 0, so they Biosciences 6, 1–11.

cannot be extended all the way down to t = 0. [3] Andersen, P.K., Borgan, Ø., Gill, R.D. & Keiding, N.

Typically, one will also omit the largest values of (1993). Statistical Models Based on Counting Processes.

t. The standard equal precision band has poor small Springer-Verlag, New York.

sample properties, so even with sample sizes in the [4] Becker, N.G. (1993). Analysis of Infectious Disease

hundreds the use of the log transformed confidence Data. Chapman & Hall, London.

[5] Bie, O., Borgan, O. & Liestøl, K. (1987). Confidence

band is recommended [5]. As an illustration we use

intervals and confidence bands for the cumulative hazard

once more the liver cirrhosis example. Considering rate function and their small sample properties, Scandi-

the interval from 4 months (1/3 year) to 8 years, navian Journal of Statistics 14, 221–233.

we have " σ (1/3) = 0.027 and " σ (8) = 0.163, so that [6] Breslow, N.E. & Day, N.E. (1980). Statistical Methods

d0.95 = 2.99. Therefore the 95% log transformed in Cancer Research. Vol. 1: The Analysis of Case-

equal precision band for the cumulative hazard rate Control Studies, IARC Scientific Publications, Vol. 32.

function between 4 months and 8 years may be International Agency for Research on Cancer, Lyon.

[7] Breslow, N.E. & Day, N.E. (1987). Statistical Methods

obtained from (4) by using the fractile 2.99 instead

in Cancer Research. Vol. 2: The Design and Analysis of

of the value 1.96 used for the pointwise confidence Cohort Studies, IARC Scientific Publications, Vol. 82.

intervals in Figure 1. A detailed study of the weak International Agency for Research on Cancer, Lyon.

convergence of the Nelson–Aalen estimator and the [8] Fleming, T.R. & Harrington, D.P. (1991). Counting

derivation of confidence bands are provided by [3, Processes and Survival Analysis. Wiley, New York.

Section IV.1.2-3]. Here another class of confidence [9] Keiding, N. & Andersen, P.K. (1989). Nonparametric

bands, the Hall–Wellner bands, is also discussed. estimation of transition intensities and transition prob-

abilities: a case study of a two-state Markov process,

We finally note that semi-Markov processes (or

Applied Statistics 38, 319–329.

Markov renewal processes), where the transition [10] Nelson, W. (1969). Hazard plotting for incomplete

intensities (only) depend on the sojourn times in the failure data, Journal of Quality Technology 1, 27–52.

states, do not give rise to counting processes which [11] Nelson, W. (1972). Theory and applications of hazard

fulfill the multiplicative intensity model (5). Thus the plotting for censored failure data, Technometrics 14,

results outlined above do not immediately extend to 945–965.

such models. However, it turns out that enough of [12] Schlichting, P., Christensen, E., Andersen, P.K., Fauer-

holdt, L., Juhl, E., Poulsen, H. & Tygstrup, N., for The

the above structure is preserved to be able to define

Copenhagen Study Group for Liver Diseases (1983).

Nelson–Aalen estimators also for such semi-Markov Prognostic factors in cirrhosis identified by Cox’s regres-

processes and to derive identical asymptotic results sion model, Hepatology 3, 889–895.

for these as for the case of Markov processes; see [3,

Section X.1] for a discussion and further references. ØRNULF BORGAN

- Frequency Analysis of Extreme EventsDiunggah olehburreiro
- Process CapabilityDiunggah olehAl Engg
- sppsurvivalDiunggah olehselinblue
- jurnal reading 2 rikojumat11 - Copy.pptDiunggah olehRiko Jumattullah
- OLSDiunggah olehNiki
- Review of A Large Clinical Series: Predicting Death for Patients With Abdominal Septic ShockDiunggah olehProf Dr Dr Ernst Hanisch
- SSCE2193 Ch3 ExercisesDiunggah olehAbdullah Belasegak
- Ratio and Product Type Estimators Using Stratified Ranked Set SamplingDiunggah olehinventionjournals
- Manual Cost Cultivation Surveys 23july08Diunggah olehdushyantrocky
- Back MatterDiunggah olehOleg Sofrygin
- Meng 381 Assignment 1Diunggah olehAdnan Noor
- Bankruptcy Prediction With Industry EffectsDiunggah olehtjtjsqh
- mit6 041f10 l23Diunggah olehapi-246008426
- How Durable is the Effect of Low IntensityDiunggah olehZulvikar Umasangadji
- Van Der Meer_Code for Dike Height Design & Examination Based on Wave Run-up and Wave Over ToppingDiunggah olehapi-3709579
- Probsta Chapter 8Diunggah olehjimii
- Comparison of Fatigue Data Using the Maximum Likelihood MethodDiunggah olehtommasobrugo
- St 205_lecturer NotesDiunggah olehPeter Ng'orogita
- Quadratic mean differentiability ExampleDiunggah olehmamurtaza
- STAT2303 UARK Week 03 Lesson 01 Confidence IntervalsDiunggah olehTodd Thomas
- Bootstrapping.pdfDiunggah olehkakonauta
- lec2-14Diunggah olehmohamed
- Questions - CQE Test (Part02) - AKDiunggah olehShirley
- 663270_663268_171169_MODULE _ 15,16 B7Diunggah olehLong Lie
- Public Open Space and WalkingDiunggah olehDesarts Sonnants
- Stochastic Solutions ManualDiunggah olehAftab Uddin
- polya note pdfDiunggah olehNashitah Abd Kadir
- Ug2Diunggah olehKosachtraves
- Formulae and theorems.pdfDiunggah olehPRUDHVI me15b053
- Clinical Features of Primary Brain Tumours uDiunggah olehLady Kwee

- Summary EconometricsDiunggah olehFatimah Ahmad
- [Bird] Analysis of variance via confidence intervals.pdfDiunggah oleheman_tenan2220
- SchoukensSystemIdentification2013.pdfDiunggah olehDev Ganatra
- OPRE 6301-SYSM 6303 Chapter 10 Slides_studentsDiunggah olehSrinivas Reddy
- ASCE_7690_9780784480038Diunggah olehshadabg04
- Exam c Syllabus June 2016Diunggah olehphulam146
- Solution Manual for Introduction to Econometrics - Gary KoopDiunggah olehMasoomeh Akbarzadeh
- Lecture 15-3 Cross Section and Panel (Truncated Regression, Heckman Sample Selection)Diunggah olehDaniel Bogiatzis Gibbons
- Sampling Error and BiasDiunggah olehJessica Angelina
- KernelDiunggah olehwillrent
- chapters1n2Diunggah olehSharon Huang Zihang
- Chapter 2 Sampling and MeasurementDiunggah olehPhillipBattista
- Book Mixed Model HendersonDiunggah olehHoracio Miranda Vargas
- How to Write a Scientific Paper—–Writing the Methods SectionDiunggah olehJames Cojab Sacal
- Teachers Resource Binder - Chapter 07Diunggah olehADAM ASHLEY
- Are Fallen Angels SpecialDiunggah olehAnamaria Cociorva
- 04_Jackknife.pdfDiunggah olehuniversedrill
- Chap 005Diunggah olehmallick5051rajat
- tirc_80Diunggah olehregreg88regreg
- IJC JC2 H1 Maths 2012 Prelims 1 Question PaperDiunggah oleh遠坂凛
- Ch_03_Wooldridge_5e_PPT.pdfDiunggah olehMelissa Moreno
- 3The Exponential Smoothing ModelDiunggah olehNsr Murthy
- 1030 Data QualityDiunggah olehDiana Andrea Cardona Peña
- 02 SimpleLinearRegression2017forStudentsDiunggah olehLEWOYE BANTIE
- Candidate Weak PRFDiunggah olehdora
- Quantitative Mineral Resource Assessments_An Integrated Approach--Donald Singer, W. David MenzieDiunggah olehparagjdutta
- Chapter2.pdfDiunggah olehPatrick Mfungwe
- SOA Exam CDiunggah olehNoDriver
- DatabaseDiunggah olehEric Li
- LS vs MAD Wilson 1978 Decision_SciencesDiunggah olehTf Chung

## Lebih dari sekadar dokumen.

Temukan segala yang ditawarkan Scribd, termasuk buku dan buku audio dari penerbit-penerbit terkemuka.

Batalkan kapan saja.