Anda di halaman 1dari 5

2014

2014
13th
International
International
Conference
Conference
onon
Information
Information
Technology
Technology

A Model For Prediction Of Human Depression Using Apriori Algorithm

Lambodar Jena
Utkal University, Narendra K. Kamila
Bhubaneswar, India C.V.Raman College of Engineering,
e.mail:lmjena@yahoo.com Bhubaneswar, India

Abstract- Depression a latest epidemic of modern era test for diagnosing most forms of mental illness;
has always drawn the attention of researchers to typically, the diagnosis is based on the patients
find & evaluate the level, causes & prevention. self-reported experiences, behaviors reported by
According to psychiatrists its not a psychological relatives or friends, and a mental status
disorder but it creates the stimulation & simulation of
co-ordination failure. The worst case of the leading
examination[3,4]. In this paper the main objective is
depression level may contemplates a person to to predict & prevent the level of depressional
attempt suicide, loss of energy, insomnia etc. This feelings which acts as a silent killer in human body;
epidemic causes severe personal and public health which ultimately can be able to protect wastage of
challenges. Each year thousands of millions of people lives, and helps in sustainable development. Here
are suffering from depression and a few receives we review data on various aspects of depression
adequate treatment. Keeping view at the trend of originating in India. The code words like FA
predicting, this paper has been proposed & tried to (Family Problem), FP (Financial), ORG
find out the persons affected by depression and their (Organization), RL (Relationship), ADD
level of illness.The work is self motivated. Here the
concept of apriori algorithm & association rule
(Addiction), CD (Congenital diseases),
mining is used in order to extract the information HL(Hallucination) etc. are used for representing
from a large collection of database of concerned different depressional factors. The updation value
persons. can be added & the respective graph can show in
addition with the previous data value for a person.
Keywords-Depression, Apriori Algorithm, Data For each transaction the creation of nodes takes
mining place; each node gives the transactional level of all
the respective persons on the basis of the
I. INTRODUCTION depressional factors. Here we tried to use the
Depression is nothing but a low mood, aversion to concept of Apriori algorithm in the field of
activity that affects a persons attitude, behavior & diagnosis of diseases which is completely new
feeling of well being. There are different factors introduction in medical field rather than using in the
which are responsible for the origin of depression in market analysis sector. From the gathered
psychological state of a person. It is a disorder of information a relational database is created and the
major public health importance, in terms of its sample data is being stored for implementation. As
prevalence and the suffering, dysfunction, the inference of the proposed model the scenario of
morbidity, and economic burden[1]. It also causes the depressed patients can be obtained & safety
the contemplates to suicide, loss of interest, measures can be taken in order to restrict the global
helpless, hopeless feelings & psychological major issue of this severe health hazard. Association
disorder. This paper aimed at explaining, predicting rule mining is implemented on the basis of the
& estimating the population having depressed factors provided in the database. Data item set are
feelings whether knowingly/unknowingly. Mental generated & are implemented as per the algorithm
illness is a leading cause of disability worldwide. It specified. This paper aimed at three basic aspects.
is estimated that nearly 300 million people suffer As per the requirement for related data collection a
from depression (World Health Organization, field survey has been adopted through a
2001). Reports on lifetime prevalence show high questionnaire round among the individuals of a
variance, with 3% reported in Japan to 17% in the community. Questions asked to them were derived
US. In North America, the probability of having a from the basic factors mentioned which causes
major depressive episode within a one year period depression. After having the sample data a model is
of time is 35% for males and 810% for females designed by implementing the concepts; when data
[2]. However, global provisions and services for is entered then respective graphs are formed. Out of
identifying, supporting, and treating mental illness the total number of data inputs we can get the
of this nature have been considered as insufficient details of the individuals having greater than and
[3]. Although 87% of the worlds governments offer equal to the cut off value of depressional factors and
some primary care health services to tackle mental from the graph itself the inference can be drawn.
illness, 30% do not have programs, and 28% have
no budget specifically identified for mental
health[3] . In fact, there is no reliable laboratory

978-1-4799-8084-0/14 $31.00 2014 IEEE 240


DOI 10.1109/ICIT.2014.65
II. LITERATURE SURVEY across completely different regions of the country.
The survey focuses on analysis done by numerous Studies have additionally tried to tell apart the
researchers on depressive disorders in Asian nation. philosophy in depression from negative symptoms
For this, an intensive net search was done of schizophrenic psychosis and therefore the
victimization key words like depression, life events, philosophy in clinical depression.
prevalence, classification[1], cultural problems, III. RELATED RESEARCH WORK
outcome, prevention, incapacity and burden etc in
numerous combos. The varied search engines were Rich bodies of work on depression in psychiatry,
utilized in addition thorough search of all the psychology, medicine, and sociolinguistics describe
problems of Indian Journal of Psychiatry[1] on the efforts to identify and understand correlates of
market on-line was done. Hand search of a number MDD in individuals. Cloninger et al.,[22] examined
of the missing problems was conjointly tried and the role of personality traits in the vulnerability of
this yielded a couple of a lot of articles. A study individuals to a future episode of depression,
supported the globe Health Organization's World through a longitudinal study. On the other hand,
mental state Survey Initiative has aforesaid that found support for the claim that negative processing
Asian nation has the very best rate of major biases, particularly (cognitive) biases in resolving
depression within the world. The typical life rates of ambiguous verbal information can predict sub-
depression, consistent with the study, were found to sequent depression. Robinson and Alloy[6]
be 14.6% in ten high financial gain countries, and similarly observed that negative cognitive styles and
11.1% in eight low- to middle-income countries. stress-reactive rumination were predictive of the
However life incidents of what was known as Major onset, number and duration of depressive episodes.
Depressive Episodes (MDE), were highest among Finally, Brown et al., found that lack of social
Indians at 35.9%, whereas China was at the bottom support and lowered self-esteem are important
at 12%. Average share of MDE was, however, factors linked to higher incidences of depression.
significantly higher in high-income countries at Among a variety of somatic factors, reduced energy,
28.1%, compared to 19.8% within the low- to disturbed sleep, eating disorders, and stress and
middle-income countries. Those that supported tension have also been found to be correlates of
anybody of those queries were place through a form depressive disorders[7]. In the field of
to find MDE. To be positive for MDE, a respondent sociolinguistics[8] showed that linguistic analysis of
had to qualify for 5 of the "nine cardinal speech could classify patients into groups suffering
symptoms," together with indicators in craving, from depression and paranoia. Computerized
sleep, mood, lack of concentration and tendency of analysis of written text through the LIWC program
dangerous thoughts. Many studies drained subjects has also been found to reveal predictive cues about
with depression have shown high level of co neurotic tendencies and psychiatric disorders[9].
morbidity of each medical specialty and physical Although studies to date have improved our
sicknesses, particularly in aged people with understanding of factors that are linked to mental
depression. On the opposite hand studies on the disorders, a notable limitation of prior research is
prevalence of medical specialty morbidity in that it relies heavily on small, often homogeneous
physical disorders have shown that depression is samples of individuals, who may not necessarily be
sort of rife in these conditions. Studies drained representative of the larger population. Further,
subjects with intentional self damage, subjects with these studies typically are based on surveys, relying
serious suicide makes an attempt, divorce seeking on retrospective self-reports about mood and
couples, earth quake victims, girls when birthing, observations about health: a method that limits
obsessional compulsive disorder, alcohol dependent temporal granularity. That is, such assessments are
subjects, medical in-patients, medicine disorders, designed to collect high level summaries about
finish stage nephritic sickness cancer chronic and experiences over long periods of time. Collecting
disfiguring skin disorders age connected finer-grained longitudinal data has been difficult,
degeneration HIV-infected heterosexuals, industrial given the resources and invasiveness required to
population have shown that depression is that the observe individuals behavior over months and
most typical or one among the foremost common years. As depression causes the tendencies,
medical specialty designation in such patient teams attempts, contemplates to suicide which affects the
and prevalence rate as high as 86.7% has been socio-economic and sustainable development, it is
reported. very much required to predict and evaluate the
Many studies have reported the symptom profile of psychological behavior of individuals from a
subjects with depressive disorders. The findings of group/community. This became an emerging
symptomatology generally may be understood with problem of modern era which also causes another
regard to bodily symptoms, guilt and different health challenge can be predicted. Thus identifying
depressive ideations, dangerous behavior, the individuals from a group/community possessing
philosophy of delusions and sleep design. Studies the depressional tendencies is a major concern.
have additionally compared the symptomatology Implementation of Apriori algorithm in medical

241
concern draws out the inference from a large TABLE-1
gathered data by using data mining. Transaction ID Dipression Factors
IV. PROPOSED WORK T1 {FA, ADD, REM, CD, FP, HL}
The concept of data mining i.e Apriori algorithm is T2 {UE, ADD, REM, CD, FP, HL}
being used here for the extraction of information T3 {FA, AP, CD, FP}
from a large collection of sample data. Association T4 {FA, SLP, RL, CD, HL}
rule mining is properly implemented, related and
demonstrated in accordance with the necessity of T5 {RL, ADD, ADD, CD, ORG, FP}
the work. Apriori algorithm is used for frequent There is a simple golden rule that: an item/itemset
item set mining and association rule learning over is frequently occurred if it is occurring at least 60%
transactional databases. of times which is the support. So for here it should
Apriori is an algorithm for frequent itemset mining be occurred at least 3 times. Count the number of
and association rule learning over transactional transactions in which each factor occurs(shown in
databases. It proceeds by identifying the frequent Table-2).
individual items in the database and extending them
to larger and larger item sets as long as those item
TABLE-2
sets appear sufficiently often in the database. The
frequent item sets determined by Apriori can be Item No of transactions
used to determine association rules which highlight FA 3
general trends in the database.
ADD 3
Algorithm: REM 2
Join Step: Ck is generated by joining Lk-1 with CD 5
itself FP 4
Prune Step: Any (k-1)-itemset that is not frequent
HL 3
cannot be a subset of a frequent k-itemset
UE 1
Pseudo-code: AP 1
Ck: Candidate itemset of size k SLP 1
Lk: frequent itemset of size k
L1= {frequent items}; RL 2
for(k= 1; Lk!=; k++) do begin ORG 1
Ck+1= candidates generated from Lk;
for each transaction t in database do The item is said frequently occurred if it is
increment the count of all candidates in Ck+1 that are occurring at least 3 times. So in this step remove all
contained in t the factors that are occurring less than 3 times from
Lk+1= candidates in Ck+1 with min_support Table-2 and we are left with
end
return kLk;
TABLE-3
A. Implementation
The above algorithm is implemented in JAVA Item Number of
language to get the output and inference. Here about transactions
500 records has been taken as test data for the FA 3
model. The model is tested on 500 individuals and ADD 3
successfully predicted the percent of individuals are CD 5
suffering depression. We have considered the
FP 4
following factors of depression: Lifestyle, Life
events, Non-psychiatric illness, Acquired infection, HL 3
Medical treatments, Professional activities, Stress, Now lets find a pair of factors(shown in Table-4)
and Relationship Status etc. The questions were that are occurring frequently from the Table-3.
based on Family problem(FA), Financial
TABLE-4
problem(FP),Unemployed(UE),emuneration
(REM),Addiction(ADD),Workplace(ORG), Factor pairs
Relationship(RL),Congenital diseases(CD),
{FA,ADD}
Apprehension(AP),Hallucination(HL), and Sleeping
problem(SLP). {FA,CD}
Suppose there are records(shown in Table-1) of {FA,FP}
large number of transactions at a sample data set as
follows:

242
{FA,HL} On the same way
{ADD,CD} {CD} => { ADD,FP} : Confidence is 3/5 x 100% =
60%
{ADD,FP}
{FP} => {ADD,CD} : Confidence is 3/4 x 100% =
{ADD,HL}
75%
{CD,FP} {CD,FP} => {ADD} : Confidence is 3/3 x 100% =
{CD,HL} 100%
{FP,HL} {ADD,FP} => {CD} : Confidence is 3/3 x 100% =
Now count how many times each pair is occurring 100%
together. Remove all the item pairs with number of {ADD,CD} => {FP} : Confidence is 3/4 x 100% =
transactions less than three and we are left with 100%
table-5.
V. TESTING OF THE MODEL
TABLE-5
In this work we have taken the various features of
Item Pairs Number of the individuals as input. Based on the input features
transactions transaction graph is generated. The sample output
{FA,CD} 3 graph of the model is shown in Fig-1 and Fig-2 for a
{ADD,CD} 3 small set of data. This shows the association of
{ADD,FP} 3 different individuals affected by different factors of
depression based on multiple features. The inputs
{CD,FP} 4
are supplied to the system via user interface and
{CD,HL} 3 processed after clicking the process button. Then
transaction lines are drawn with respect to the
These are the pairs of factors frequently occurring inputted depression factors. Common depression
together. Now lets find a set of three factors that factors of each input sequence are joined together in
are occurring together. We use the above table the graph. To detect the persons who are affected by
(table 5) and make a set of 3 factors(table-6). potential depression factors a cut of value has to be
entered in to the system through the user interface.
TABLE-6 Fig-2 shows the transactions with cut of value for a
Item Set Number of small set of data . The cut of value is the number of
transactions depressional factors common to a number of
{ADD,CD,FP} 3 individuals belonging to a group/community.
{CD,FP,HL} 2
Now after this the confidence can be measured to
predict the dipressional level. From the above table-
6 it is observed that the dipressional factors
ADD,CD,FP are occurring together frequently.
Now find the association between them.To do this,
list all the subsets of frequently occurring factors
{ADD,CD,FP} and obtain the following subsets,
{ADD}
{CD}
{FP}
{ADD,CD}
{CD,FP}
{ADD,FP}
Now, find the association among all the subsets

{ADD} => {CD,FP} : (if 'ADD' is occurring, what Fig.1 Plot for the transactions of factors of different individuals
is the probability that 'CD' and 'FP' would be occure
in the same transcation) ADD is occurring in 3
transactions and in all those transactions 'CD' and
'FP' are also occurring, i.e. Confidence = 3/3 x 100
= 100%. We can say with 100% confidence that if
ADD is occurring CD and FP would also be
occurring. So, if a person having Addiction(ADD),
then he/she also have Congenital Diesease(CD) and
Financial Problem(FP).

243
Intelligent Agent Technology (WI-IAT), volume 1,
pages 492499, 2010.
[6] Robinson, M. S., & Alloy, L. B.. Negative cognitive
styles and stress-reactive rumination interact to
predict depression: A prospective study. Cognitive
Therapy & Research, 27(3), 275-292, 2003.
[7] Al-Yamani, M. H., & Abdel-Khalek, A. M..Somatic
symptoms in anxiety and depression patients. Derasat
Nafseyah, 14(1), 123-152,2004.
[8] Fox AJ, Goldblatt PO, Jones DR. Social class
mortality differentials: Artefact, selection or life
circumstances? Journal of Epidemiology and
Community Health. 1985;39:18.
[9] M. De Choudhury, S. Counts, and E. Horvitz. Major
life events and behavioral markers in social media:
Case of childbirth. In 16th ACM Conference on
Computer Supported Cooperative Work (CSCW
Fig.2 Graph shows the transactions with cut of value=2 2013), 2013.
[10] M. De Choudhury, S. Counts, and E. Horvitz.
VI. CONCLUSION Predicting postpartum changes in emotion and
The overall objective of the work is to predict more behavior via social media.In ACM SIGCHI
Conference on Human Factors in Computing
accurately the presence of depression and tried to Systems (CHI 2013), 2013.
maintain the socio-economic development up to [11] R. Chunara, J.R. Andrews, and J.S. Brownstein.
some extent by eliminating this epidemic. This Social and news media enable estimation of
paper aimed its proper utilization at different levels epidemiological patterns early in the 2010 Haitian
cholera outbreak. The American Journal of Tropical
and communities by predicting the despressional Medicine and Hygiene, 86(1):3945, 2012.
level. It will provide the smooth and reliable [12] M. Krieck, J. Dreesman, L. Otrusina, and K.
environment of work place and successful Denecke. A new age of public health: Identifying
professional life. The work can be extended in disease outbreaks by analyzing tweets. Proceedings
future for prediction at individual level and of Health WebScience Workshop, ACM Web
Science Conference, 2011.
management with security, and providing
[13] Posternak MA; Solomon DA; Leon AC.. The
prevention techniques. Other data mining natural-istic course of unipolar major depression in
techniques can also be used for prediction e.g. the absence of somatic therapy. J.Nerv & Mental
clustering, time series, association rules etc. The Disease 194(5):32429, 2006.
text mining can be used to mine and analyze huge [14] M.J. Paul and M. Dredze. A model for mining public
amount of unstructured data available in healthcare health topics from Twitter. Technical report, Johns
Hopkins University,2011.
industry database.
[15] Adam Sadilek and Henry Kautz. Modeling the
impact of lifestyle on health at scale. In Sixth ACM
REFERENCES International Conference on Web Search and Data
Mining, 2013.
[1] Grover S, Dutt A, Avasthi A. An overview of [16] Park, M.; Cha, C.; & Cha, M.. Depressive Moods of
Indian research in depression. Indian J Psychiatry Users Captured in Twitter. In Proc. ACM SIGKDD
2010;52:S178-S188. Work-shop on Healthcare Informatics (HI-KDD),
[2] Andrade L e tal.. The epidemiology of major 2012.
depressive episodes: results from the International [17] Moreno, M.; Jelenchick, L.; Egan, K.; Cox, E. et al..
Consortium of Psychiatric Epidemiology (ICPE) Feeling bad on Facebook: depression disclosures by
Surveys. International Journal Methods Psychiatric college students on a social networking site.
Research. 2003;12(1):3-21. Depression and Anxiety 28(6):447455, 2011.
[3] Choudhury Munmun de, Gamon Michael, Hoff [18] Kotikalapudi, R.; Chellappan, S.; Montgomery, F.;
Aaron, and Roseway Asta, "Moon Phrases": A Wun-sch, D.; & Lutzen, K.. Associating depressive
Social Media Facilitated Tool for Emotional symptoms in college students with internet usage
Reflection and Wellness., European Alliance for using real Internet data. IEEE Technology and
Innovation, May 2013 Society Magazine, 2012.
[4] Wang Y, Yang Y, Hui L, Tie C, Li F, Xu ZQ, Wang [19] Lustberg L; & Reynolds CF. Depression and insom-
C (2013a) A neuropeptide Y variant (rs16139) nia: questions of cause and effect. Sleep Medicine
associated with major depressive disorder in replicate Reviews 4 (3): 253262, 2000.
samples from Chinese Han Population. PLoSOne [20] Roger Detels. Epidemiology: the foundation of
8:e57042 public health. Oxford Textbook of Public Health.
[5] S. Asur and B.A. Huberman. Predicting the future 978-0-19-969347-4; June 2011
with social media. In 2010 IEEE/WIC/ACM [21] Cloninger, C. R. "The science of well-being: an
International Conference on Web Intelligence and integrated approach to mental health and its
disorders." World Psychiatry 5: 71-76,2006.

244

Anda mungkin juga menyukai