Anda di halaman 1dari 13

International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509

A Review of Chronic Diseases Map


and Data Trends Analysis
Mustafa Yousef Mustafa ZGHOUL

Abstract In this paper, an attempt is made to review In addition, WHO reported that the chronic diseases are the
and analysis the status of the chronic disease the gulf major disease burden in the gulf region, "the chronic diseases
region. In addition, illustrate the methods of visualization cause more than 60% of all deaths in the Gulf Cooperation
of the chronic diseases information, which are the major Council (GCC) countries [1] as shown in Figure2.
disease burden in the gulf region. This paper, is reviewed
and discussed the methods of analysis the chronic diseases
data with reference to univariate time series model. The
Forecasting model based simple linear regression is
proposed and implemented. The Visualization of digital
data is considering as one of the most important
techniques for presenting the distribution of statistical
data within a small space. An interactive disease map is
proposed and implemented to determine the chronic
diseases numbers and locations.

Keywords: Chronic diseases, visualization, statistical data


analysis, linear regression, time series, forecasting, mapping
Fig. 2: The Diabetes Growth Rate and Affected Population (in 000)
during 2000 to 2030, Source WHO.
I. INTRODUCTION
Chronic diseases (CD) pose a constant danger and a large Moreover, depending on the ministry of health statistics in
force because of their increased risk to human life and the the Sultanate of Oman during the period (1990 - 2005),
economy of nations. Statistics issued by the World Health approximately 75% of diseases burden is attributable to chronic
Organization (WHO) indicate that the number of people living diseases" [2]. The total number of death because of chronic
in chronic disease increasing in all parts of the world as diseases are a double number of death caused by infectious
illustrated in Figure 1. diseases, like pulmonary tuberculosis, viral hepatitis (A),
malaria, and AIDS (HIV). In addition, the chronic diseases
affect women and men at younger ages. There are a number of
risk factors causes the chronic diseases including physical
inactivity, unhealthy diet and use of tobacco. The chronic
disease is a factual dangerous and it is growing rapidly over the
time, Therefore, many governments spent billions of dollars for
controlling and preventing the expansion of CD [3]. WHO
reports indicate that the occurrence of CD will be raised
dramatically in 2023 as shown in Figure 3.
For this reason, one of the solutions to reduce the rate of
chronic diseases is increasing the health awareness in the
community. Therefor, design and implement a web application
for the chronic diseases surveillance is much needed, which
Fig. 1: The total NCD deaths by region 2012
will help in raising the knowledge of culture-related chronic
(AFR: Africa; AMR: America; SEAR: South East Asia; EUR: European; diseases and methods of reducing its effects.
EMR: Eastern Mediterranean; WPR: Western Pacific; Source WHO)

Masters Student in Computer Science, Faculty of Computing and IT, Sohar


University, Sohar, Sultanate of Oman, mzghoool@gmail.com

73
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
diseases in real time and understanding the reasons of these
diseases [6]. A time series is a group of observations or
statistics which being recorded or collected at regular intervals
and sorted by the time [7]. Therefore, the proposed system will
provide an accurate analysis for predicting unseen data over
time. Moreover, the trend analysis is a type of technical
analysis that tries to predict the future values of data
(information in sequence over time) based on past data [8],[9].
The analyze of the statistical data based on trends analysis
method is called linear regression analysis and which will help
the decision makers in the ministry of health to determine the
expansion and needs of chronic diseases [10],[11].
Cartographic visualization is used symbolism techniques,
which is referred to locations inside the map and represent their
Fig. 3: Expected rate of CD in 2023 (Source WHO) multiple data values [12]. In addition, it allows the users to
select statistical and geographical subsets. Local statistics of
The chronic disease (or non-communicable disease) is a univariate distributions can be calculated and visualized in a
disease that remains or continues for a long period of time, dynamic figure for exploration like scatterplots, dot-plots,
starts from about three months or more. Generally, these population cartograms, choropleths, parallel coordinates plots,
diseases cannot be prevented using a treatment or take a and polygon maps as shown in Figure 4.
medicine. Ministry of health in Sultanate of Oman is working
on educated members of the community against the chronic
diseases like diabetes, blood pressure, etc., in order to increase
the health awareness. The infectious disease (or communicable
disease) is a disease that generated by microorganisms, like
bacteria, parasites, fungi and viruses. In addition, can be
infected other people quickly through sneezing and coughing,
or through physical contact. Examples of infectious diseases
are malaria, pulmonary tuberculosis, viral hepatitis (A) and
AIDS (HIV) [4]. Therefore, the governments wok hard to
reduce the expansion of the chronic disease and infectious
diseases by offering intensive programs and studies of injuries
prevention methods [5]. The main objective of this paper is to Fig. 4: Cartographic visualization source [12]
promote awareness of the expansion of chronic diseases in
GCC. Therefore, design and implement a web application for According to Statistics Netherlands (2012), "data
analyzing, forecasting and visualizing chronic diseases is much visualization is the art of presenting data in a visual manner.
needed. This system will collect the data of chronic disease and So the data becomes apparent. Data visualization is a helpful
illustrate the analysis statistics rates in all GCC. In addition, it tool for all phases of the statistical process. It has two goals in
will provide a very clear visualization figures based on statistics: data exploration and communication" [13].
interactive disease map. Diagrams can detect the pattern in large amounts of data. This
method views the major findings in the data to the users. There
are many types of diagrams, like radar plot, bar chart, and line
II. VISUALIZATION METHODS AND TECHNIQUES chart. It used to represent the time series or to make a
comparison between two variables. On the other hand,
Many techniques are used to visualize and analysis the proportional symbol map is a good way of viewing statistical
data. The interactive disease map is one of a powerful method data. It aims to view the characteristics of a subject on the map.
for data visualization. It is used to illustrate information clearly "In a proportional symbol map, a symbol is plotted on the
and efficiently via plots, statistical and information graphics. center of a region, and the surface area of the symbol is scaled
The visualization of statistical data aims to present a large with the value of the variable" [13]. For example, consider the
amount of information in the short time. Numerical data can be proportional symbol map as shown in Figure 5.
graphed using lines, bars or dots to visual communicate a
quantitative message. Recent studies proved that the interactive
disease map helps the users to analyze and understand the
meaning of complex data easily. Generally, tables are used
where the users look up to a specific measurement, while
charts and maps are used to present patterns or relationships of
multivariate data. The diseases map is a method for
representing different diseases for tracking the expansion of

74
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
which will show on the map itself. As an example, consider the
visualization of varying data by symbol size on the map as
shown in Figure 7. [15]

Fig. 5: Proportional symbol map source [13]

According to Heinrich Hartmann (2016), "statistical Fig. 7: Visualization of varying data by symbol size on map
techniques are the art of extracting information from data. source [15]
Moreover, one of essential data analysis methods is
visualization. The human brain can process geometric According to Adriana REVEIU, Marian DARDALA
information much more rapidly than numbers." [14]. There are (2011), cartographic visualization provides the facilities to
many methods for visualization, like rug plots, histograms (for represent statistical data. It is one of the most important tools in
one-dimensional data), scatter plots (for two-dimensional data), geographical information systems (GIS). In addition, it aims to
line plots. [14]. as an example, consider the proportional show the distribution of statistical data inside the regional map.
symbol map as shown in Figure 6. Moreover, it uses different symbols to show more information
at the same time inside the map. These symbols view
quantitative details for the users. [16]

III. ANALYSIS OF CHRONIC DISEASES


The analysis of chronic diseases often depends on
longitudinal cohort (or group) studies, and there are many
methodological issues related to chronic disease studies, the
first one is based on changing definitions of risk factors and
outcomes over time. Moreover, the second issue is missing
data.
To perform analysis in the existence of missing data, there
are many procedures to do that: the first one, make analysis to
individuals which data is complete. The second one, insert the
existing values to individuals which data is incomplete and
then analyzing the dataset. Moreover, the second analytical
technique is most appropriate. There are many analytic
Fig. 6: Visualization methods source [14] techniques for chronic diseases modeling: [17]
1. Logistic regression analysis: this can examine the effects
The histogram is a most popular visualization method. In of risk factors on the development of the disease.
addition, it is commonly used to show the distribution of 2. Survival analysis (time to event data): this deal with time
numerical data. [14] According to Ben Fry (2008), "within a until the event of interest occurs.
small space, the visual can communicate (or announce) more 3. Neural networks.
information than the table." [15]. Mapping is one of the 4. Tree-based classification methods.
modern techniques, which is used to visualize the information. 5. Longitudinal data analysis (mixed models, generalized
The process of mapping starts with collect the data values and linear models and generalized estimating equations).
store it inside data set (database). Then design (or draw) the The formula for representing the regression analysis model
map to represent the data values on it. After that determine is:
some locations (or points) on the map, usually these locations Yi = a + b * Xi
should be the center of each governorate. Afterward use
functions to load the set of data values from the database, Where the regression parameters, a: is the intercept (on the
y axis) . And b: is the slope of the regression line

75
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
plans for prevention and health care. Also, implement and
The Least Squares method used to estimate the slope and enhance the community participation in prevention and health
intercept regression parameters, as the following: care. Most of the solutions for preventing chronic diseases are
The formula for calculating slope regression parameter (b) expensive [18]. Gregory Hartl & Menno van Hilten (2012) [1],
is: explains that the non-communicable diseases (NCDs) are
caused more than 60% of all deaths in the Gulf Cooperation
Council (GCC) countries. The risk factors are an unhealthy
diet, the use of tobacco and the physical inactivity. The Gulf
The formula for calculating intercepts regression parameter
Cooperation Council (GCC) countries (Saudi Arabia, Bahrain,
(a) is:
Sultanate of Oman, Kuwait, Qatar, and United Arab Emirates)
are adopting a regional strategy to address, prevent and control
of non-communicable diseases (NCDs), like diabetes, cancer,
and chronic respiratory disease. It aims to reduce exposure to
people from different risk factors and improving the services of
IV. LITERATURE SURVEY preventing and treating the health problems [1]. Hill A.G., et
This section presents the literature survey of the chronic al. (200), mentioned that the awareness about the chronic
diseases data analysis. The researchers were implemented diseases has been grown in Oman. Morbidity for diagnosis
different computing techniques for forecasting and visualizing related to chronic diseases, especially cancer, cardiovascular
the unseen data. It will cover the expansion rate of chronic disease, and the endocrine disease was growing and becoming
diseases in the global. In addition, the forecasting models and a significant share of Oman's burden of disease [19]. Jawad A.
visualization techniques are included. (WHO) global report [3] Al-Lawati et al. (2008), believes that the chronic diseases are
statistics shows that the number of deaths in 2005 is about (58) posed the main challenge for Omani population [2]. Depending
million, around (35) million approximately of deaths resulted on the ministry of health statistics in Oman during the period
by chronic diseases. It represents about 60% of global deaths, (1990 to 2005), approximately 75% of diseases burden is
which caused by chronic diseases, "only 20% of chronic attributable to chronic diseases. "The distribution of chronic
disease deaths occur in high-income countries, while 80% diseases and related risk factors among the general population
occur in low and middle-income countries, where most of the is similar to that of industrialized nations: 12% of the
worlds population lives". The chronic diseases will rise about population has diabetes, 30% is overweight, 20% is obese,
70% of the total deaths in the world at 2030. Khatib O. [18] 41% has high cholesterol, and 21% has the metabolic
stated, "chronic diseases are the major disease burden in the syndrome". They conclude that the chronic diseases are the
Eastern Mediterranean Region. There are many risk factors major exhaustion on human and financial resources for
associated with chronic diseases, most of them are related to Sultanate of Oman, and this will affect the advances in the
the lifestyle and can be controlled. Such as low vegetable and health care system that has been achieved. Similarly, some
fruit intake, physical inactivity, high fast food consumption and related works focus on using visualization techniques like
high cholesterol are dominant causes of cardiovascular disease interactive map. Jason Dykes (1998), implemented a
and some types of cancer. Also obesity and overweight can cartographic visualization for locating symbols on a plane to
raise the risk of chronic diseases, like heart disease and show the statistical distributions of one or more variables" [12].
diabetes". There are many approaches can help to deal with this
problem such as developing national strategies, policies, and

76
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
different methods for data visualization techniques is presented
V. RESULTS AND DISCUSSIONS and reviewed. The Cartographic technique is appropriate for
Depends on statistical data which collected from ministry visualization of Chronic Diseases data because it illustrates the
of health in sultanate of Oman, it shown that the prevalence of distribution of the data among different geographic locations.
diabetes in Oman is increasing over years. as illustrated in The literature survey proved that a simple linear regression is
Figure 8. an appropriate method for data analysis and forecasting data
with one variable only.

REFERENCES

[1] Gregory Hartl, Menno van Hilten (January 2012). WHO recognizes
progress of Gulf States for adopting regional strategy to address non-
communicable diseases", [Online], available at:
http://www.who.int/mediacentre/news/statements/2012/ncds_20120106/en/
(accessed 15 September 2016)
[2] Al-Lawati JA, Mabry R, Mohammed AJ (July 2008), Addressing the
Figure 8: Diabetes in Oman (source MoH Oman) Threat of Chronic Diseases in Oman", CDC, VOLUME 5: NO. 3.
[3] World Health Organization (2005), WHO global report: "Preventing
chronic diseases: A vital investment". Geneva: ISBN 92 4 156300 1.
After applying the equation of simple linear regression, the
[4] What are infectious diseases?", [Online], available at:
equation will be (Y = 4319.3 + 127.9 * X). and the value of R- http://www.yourgenome.org/facts/what-are-infectious-diseases (accessed 29
squared variable equals to (0.46), this value will describe how July 2016)
data fitted to the regression line. As shown in Figure 9. [5] Directorate General for Disease Surveillance and Control in Ministry of
Health - Sultanate of Oman, [Online], available at:
https://www.moh.gov.om/en/web/directorate-general-of-disease-surveillance-
control/ (accessed 02 June 2016)
[6] Mapping Disease", [Online], available at: http://www.the-
scientist.com/?articles.view/articleNo/35349/title/Mapping-Disease/ (accessed
23 July 2016)
[7] "Time Series", [Online], available at:
http://www.statslab.cam.ac.uk/~rrw1/timeseries/t.pdf (accessed 03 October
2016)
[8] "Trend Analysis", [Online], available at:
http://pubs.usgs.gov/twri/twri4a3/pdf/chapter12.pdf (accessed 15 June 2016)
[9] Time trends analysis", [Online], available at:
https://ec.europa.eu/jrc/sites/default/files/epaac-wp9-session2-crocetti.pdf
(accessed 02 July 2016)
[10] Jabar H. Yousif, Classification of Mental Disorders Figures based on Soft
Figure 9: Forecasting results for diabetes in Oman after 5 years Computing Methods. International Journal of Computer Applications
117(2):5-11, May 2015.
The results of this forecasting model shown that the [11] Jabar H. Yousif and Mabruk A. Fekihal. Neural Approach for
prevalence of diabetes in sultanate of oman will increase. And Determining Mental Health Problems. Journal of Computing, Vol.4, Issue 1,
this will help the decision makers to take attention about spread pp6-11 ISSN 2151-9617 ,NY, USA, January 2012.
of chronic diseases. [12] Jason Dykes (1998), "Cartographic visualization: exploratory spatial data
analysis with local indicators of spatial association using Tcl/Tk and cdv",
This research paper will implement the simple linear UK, The Statistician, 47, Part 3, pp. 485-497.
regression method for statistical data analysis and forecasting [13] Edwin de Jonge (2012), "Data Visualization", Netherlands, Statistics
because the collected data is one variable changed over time. In Netherlands, ISSN: 1876-0333.
addition, this paper will select the cartographic visualization [14] Heinrich Hartmann (2016), "Statistics for Engineers", USA, ACM Queue
method to show the distribution of the data among different 10.1145/2890780.
geographic locations. In addition, it uses symbols to represent [15] Ben Fry (2008), "Visualizing Data", USA, OReilly, ISBN-10: 0-596-
the data; this symbol is scaled with the value of the variable. 51455-7.
[16] Adriana REVEIU, Marian DARDALA (2011), "Techniques for
Statistical Data Visualization in GIS", Romania, Informatica Economica, vol.
CONCLUSION & FUTURE RESEARCH DIRECTION 15, no. 3/2011
[17] Ralph D'Agostino, Lisa M. Sullivan (January 2002), "Chronic Disease
Data And Analysis: Current State Of the Field", Boston (US). Digital
This paper provided an overview of the chronic diseases in Commons. Vol. 1: No 2, 228-239, Article 32.
GCC. First, it presents the risk factors that causes the chronic [18] Khatib O. (2004), "Non-communicable diseases: risk factors and regional
diseases. Then it explains the different methods for statistical strategies for prevention and care", East Mediterr Health J 2004; 10(6):778-
88.
data analysis and forecasting models. Moreover, presents

77
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
[19] Hill AG, Muyeed AZ, Al-Lawati JA. (2000), "The mortality and health
transition in Oman: patterns and processes". Muscat (OM): World Health
Organization, Regional Office for the Eastern Mediterranean, UNICEF Oman.
[20] Definition of Chronic disease", [Online], available at:
http://www.medicinenet.com/script/main/art.asp?articlekey=33490 (accessed
07 June 2016)
[21] "Histogram", [Online], available at:
http://searchsoftwarequality.techtarget.com/definition/histogram (accessed 03
October 2016)
[22] Astrid Schneider, Gerhard Hommel, and Maria Blettner (2010), "Linear
Regression Analysis", Germany; Medicine. 107(44): 77682.
[23] Christoph Klose, Marion Pircher, Stephan Sharma (May 2004),
"Univariate Time Series Forecasting", UK.
[24] Choong-Yeun Liong and Sin-Fan Foo (2013), "Comparison of Linear
Discriminant Analysis and Logistic Regression for Data Classification",
Malaysia, LLC 978-0-7354-1150-0.
[25] Suzilah Ismaila, Rohaiza Zakariaa and Tuan Zalizam Tuan Mudab
(2014), "Univariate Time Series Forecasting Algorithm Validation",
Malaysia, LLC 978-0-7354-1274-3.
[26] Adela SASU (2013), "A Quantitative Comparison of Models for
Univariate Time Series Forecasting", Romania, University of Brasov, 117-
124.
[27] Kelly H. Zou, Kemal Tuncali, Stuart G. Silverman (2003), "Correlation
and Simple Linear Regression", USA, Radiology; 227:617628.
[28] Mohammad S. Alam (2013), "Analytical Review of Data Visualization
Methods in Application to Big Data", Russia, Journal of Electrical and
Computer Engineering, Article ID 969458, 7 pages.
[29] Peter Filzmoser, Karel Hron, Clemens Reimann (2009), "Univariate
statistical analysis of environmental (compositional) data: Problems and
possibilities", Austria, ScienceDirect, STOTEN-11466.

78
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509

Method of
S Author year Place Major Findings Merits Limitations
study

1 Hill A.G., Muyeed 2000 Oman Numerical and Diabetes has become a The awareness about The percentage for
A.Z., Al-Lawati J.A. Analytical major chronic (or non- the chronic disease has diabetes in Oman was
[19] communicable) disease been grown in Oman. the highest reported in
problem in Oman. The the arab countries.
prevalence of diabetes
varies among different
governorates in Oman; the
percentage for diabetes in
2000 is 13%.

2 Khatib O. [18] 2004 Oman Analytical The incidence of chronic There are many The risk factors
diseases is rising in the strategies: developing associated with chronic
middle east region. In 2000, a national strategies, diseases are: physical
47% of the regions load of policies and plans for inactivity, unhealthy
disease is due to Non- prevention and care, diet and smoking.
communicable diseases and also implement and
it is expected that this will enhance community
rise up to 60% by the year participation in
2020. prevention and care.

3 World Health 2005 Geneva Analytical and The number of deaths in Many governments Chronic diseases are a
Organization [8] Numerical 2005 is about (58) million, around the world spent threat and it is growing
around (35) million billions of dollars for over the time and it will
approximately of deaths medication to control affect all countries.
resulted by chronic and prevent the spread
diseases. It represents about of chronic diseases
60% of global deaths which
caused by chronic diseases.
If the current trends
continue, chronic diseases
by 2030 will rise about
70% of the total deaths
globally.

4 Jawad A. Al- 2008 Oman Analytical The chronic diseases pose Health planners and Chronic diseases are the
Lawati, Ruth Mabry the main challenge for decision makers in major exhaustion on

79
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
and Ali Jaffer Omani population. During Oman have greater human and financial
Mohammed [20] the period (1990 to 2005), commitment to the resources for Oman,
approximately 75% of provision of services and this will affect the
diseases burden is for people with advances in the health
attributable to chronic chronic diseases. care system that has
diseases. been achieved.

5 Gregory Hartl, 2012 Geneva Analytical The chronic diseases cause The regional strategy There are many risk
Menno van Hilten more than 60% of all deaths in all the Gulf factors related to
[1] in the Gulf Cooperation Cooperation Council chronic diseases, like:
Council (GCC) countries. (GCC) countries aims unhealthy diet, the use
All the GCC countries to reduce exposure of of tobacco and the
adopting regional strategy people to risk factors physical inactivity.
to address prevent and and improving the
control of chronic diseases. services to prevent and
treat these health
problems.

6 Ralph D'Agostino, 2002 Boston Logistic Regression The model for chronic The logistic regression There are many
Lisa M. Sullivan (US) Analysis, and disease consists of four analysis can examine methodological issues
[17] Survival Analysis stages or phases: disease the effects of risk related to chronic
free, pre-clinical (latent factors on the disease analysis, the
period), clinical development of first one is based on
manifestation, and follow- disease. And the changing definitions of
up. Good statistical survival analysis (time risk factors and
approaches involve to event data) deal outcomes over time.
hypothesizing models for with time until the And the second issue is
these stages, collecting event of interest missing data.
appropriate data, and then occurs.
fitting and testing the
appropriate models. The
risk factors are: age,
gender, smoking status,
blood pressure and
cholesterol.

7 Kelly H. Zou, 2003 USA Simple Linear Simple linear regression -The values of - missing values is a
Kemal Tuncali, Regression measure the linear dependent variables common problem.
Stuart G. Silverman relationship between a can be estimated from
[27] predictor variable and an the observed..

80
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
outcome variable.
- It summarise the real
observations of model.

8 Christoph Klose, 2004 UK Autoregressive Forecasting of time-series - The ARMA model - ARMA model is
Marion Pircher, moving average can be done with different process is stationary. suitable only for the
Stephan Sharma (ARMA) linear linear models, including time series that is
[23] model Autoregressive (AR), stationary (for example
- Also it can minimize
Moving average (MA), and its mean and variance
the number of
ARMA model. For should be constant over
parameters.
building ARMA model, time).
there are three phases:
model identification,
- It is required that there
estimation (or fitting) and
are at least 40
validation.
observations in the
input data.

- It is also supposed that


the values of the
estimated parameters
are constant during the
series.

9 Peter Filzmoser, 2009 Austria multivariate data The process of statistical It can estimate and - This method is
Karel Hron, analysis data analysis starts with view the relationship complex and involves
Clemens Reimann looking at the data with between different high level of
[29] appropriate graphical tools. variables. mathematical
For instance, histogram is calculations.
giving us an idea about the
data distribution.
- The results are not
sometimes easy to
interpret.

- It needs a large
amount of data.

10 Astrid Schneider, 2010 Germany Linear Regression There are three types of - Describe the The most common
Gerhard Hommel, Analysis regression analysis: Linear relationships between problem in medical data
and Maria Blettner regression, Logistic dependent and analysis is missing

81
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
[22] regression, and Cox independent variables. values.
regression. And the Linear
Regression is the most
-The values of
common approach or tool
dependent variables
used for modelling
can be estimated from
univariate time-series. And
the observed.
this tool is used for
statistical and predictive
analysis. - The risk factors that
affect the outcome can
be identified.

- It summarise the real


observations of model.

11 Choong-Yeun Liong 2013 Malaysia logistic regression There are two methods used - Logistic regression - Logistic regression
and Sin-Fan Foo (LR) analysis and for classifying data: logistic (LR) method performs (LR) needs long
[24] Linear discriminant regression (LR) and Linear better distribution of computing time with
analysis (LDA) discriminant analysis the data. large sample size.
(LDA), these methods of
classifying data depends on
- Linear discriminant - The Linear
set of predictor variables.
analysis (LDA) needs discriminant analysis
The Logistic regression
short computing time (LDA) will give low
(LR) is more robust than
with large sample size. performance when the
the Linear discriminant
prior probability is
analysis (LDA).
equal for all groups.

12 Adela SASU [26] 2013 Romania ARIMA The purpose from the The Linear regression - ARMA model is
forecasting process is to (LR) and Multilayer suitable only for the
observe or model the perceptrons network time series that is
Linear Regression
existing data series to (MLP) give more stationary.
(LR(
predict accurately the future accurate predictions.
unknown data values, for
- ARIMA gives less
Multilayer example: after five years.
accurate predictions
perceptrons network
results.
(MLP)

13 Suzilah Ismaila, 2014 Malaysia Quantitative The process of forecasting - This technique gives - This technique require
Rohaiza Zakariaa (Projective) is not simple, because it valuable judgment. large amount of data in

82
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
and Tuan Zalizam forecasting requires expert and implicit order to formulate the
Tuan Mudab [25] technique knowledge in producing - Also it giving more mathematical model.
precise forecast values. For weight to recent data.

statistical data, the


quantitative forecasting
technique is suitable.

14 Jason Dykes [12] 1998 UK Cartographic Cartographic visualization - Cartographic - Very hard for
visualization is a map based method. visualization use the automatically created
And it is used by locating locations inside map to dynamic maps.
symbols on a plane to show show data values.
the statistical distributions
of one or more variables.
- Interactive way.
Also local statistics and
indicators can be calculated
and visualized in a dynamic
fashion for exploration.

15 Ben Fry [15] 2008 USA Mapping The visual can - make comparison - The size of symbol
communicate or announce between different may hide the location
more information than the areas. on map.
table within a small space.

- present the - It consumes more time


geographic location to execute, depends on
and the distribution of size of data.
the data.

16 Adriana REVEIU, 2011 Romania Cartographic Cartographic visualization - Show the distribution - It consumes more time
Marian DARDALA visualization provides the facilities to of statistical data to execute, when the
[16] represent statistical data inside the regional volume of data increase.
inside the map. map.

- use different symbols


to show more
information at the
same time inside the
map.

83
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509

17 Statistics 2012 Netherla Proportional symbol Proportional symbol map is - present the - It is difficult to
Netherlands [13] nds map a good way of viewing geographic location calculate the actual
statistical data. It aims to and the distribution of value (if not shown).
view the characteristics of a the data.
subject on the map.
- It consumes more time
- make comparison to execute.
between different
areas.
- The size of symbol
may hide the location
- summarise large on map.
amounts of data.

18 Mohammad S. 2013 Russia - Tree-map One of the most - Tree-map: - Tree-map: not suitable
Alam [28] requirements that the hierarchical grouping for examining historical
analyst tools should meet is clearly shows data trends and time
- Circle Packing
to show more than one relations. patterns.
view per representation
- Sunburst display. Circular Network
- Circle Packing: - Circle packing: same
Diagram is the most
space-efficient. as for Tree-map
appropriate method for
- Circular Network method.
visualizing the statistical
Diagram
data. - Sunburst: easily
perceptible by most - Sunburst: same as for
humans. Tree-map method.

- Circular Network - Circular Network


Diagram: allows us to Diagram: objects with
make relative data the smallest parameter
representation. And weight can be
inside the circle, the suppressed by larger
resolution varies ones.
linearly, increasing
with radial position.

19 Heinrich Hartmann 2016 USA Histograms One of the most essential - Histogram view - Histogram used only
[14] data analysis methods is number of values for numerical.
visualization. The within interval.
Line plots
Histogram is a
- Line plot is difficult to
popular visualization

84
International Journal of Computation and Applied Sciences IJOCAAS, Volume2, Issue 2, April 2017, ISSN: 2399-4509
method. It is commonly read accurately, if there
Rug plots used to show the - Line plot show is a wide range of data.
distribution of numerical changes in data over

data. time.
Scatter plot - Scatter plot cannot
show the relation of
- Scatter plot show the more than two
relationship between variables.
two variables.

85

Anda mungkin juga menyukai