Anda di halaman 1dari 5

Location Database Clustering to Achieve Location

Management Time Cost Reduction in A Mobile


Computing System
Chen Jixiong, Li Guohui, Xu Huajie, Cai Xia*, Yang Bing
School of Computer Science & Technology, Huazhong University of Science & Technology
Department of Computer Science, Huazhong Normal University*
Wuhan, P.R.China, 430074

Location Register (HLR), the overheads for maintaining the


locations of mobile clients can be very heavy if the mobility
Abstract-An important issue in the design of a mobile
computing system is how to manage the location information of
of clients is high. It is obvious that the cost for locating a
mobile clients. In the existing commercial cellular mobile mobile client greatly depends on the locations of the calling
computing systems, a two-tier architecture is adopted. mobile client and the called mobile client. If a called mobile
However, the two-tier architecture is not scalable. In the client is far away from its HLR, the cost for locating the
literatures, a hierarchical database structure is proposed in mobile client could be very expensive.
which the location information of mobile clients within a cell is In order to improve the system performance and to reduce
managed by the location database responsible for the cell. The the total cost for locating a mobile client, various strategies
location databases of different cells are organized into a have been proposed. One of the efficient methods is to
tree-like structure to facilitate the search of mobile clients. This
organize the location databases in a hierarchical structure [1,
architecture can distribute the workload amongst the location
databases in the system. The location database organization
7]. Although the hierarchical structure of the location
has much impact on the total location management cost. databases can improve the performance of searching for
However, non of previous work touches upon this problem. In mobile clients, the total location update overheads for
this paper, considering the mobility patterns of mobile clients maintaining the locations of mobile clients can also be
in different cells, we give a heuristic algorithm to cluster the heavy. In the organization of the hierarchical databases, it is
location databases with the objective to reduce the location important to consider the update cost as well as the
management cost. Extensive experiments are designed to movements of mobile clients, i.e., the mobility patterns.
evaluate the performance. The comparison shows that our Since in the system, different mobile clients can have very
location database clustering method can result in location
different mobility patterns, the location database
update cost reduction.
Index Terms- Mobile computing system, location
organization has much impact on the location management
management time cost, location database, location database cost. However, to our best knowledge, it still lacks any
clustering, set cover detailed study on the relationship between the location
I. INTRODUCTION database organization and the location management cost.
One of the most important issues in the design of the In this paper, we study the location database (LDB)
mobile computing systems is the location management of organization problem in hierarchical databases for mobile
mobile clients [1, 2, 3, 4]. This is not only essential for computing systems over cellular networks. Our objective is
providing efficient mobile communication services among to design a nearly-ideal location database topology structure
mobile clients but also important to many new mobile to achieve the location management cost reduction. It is
computing applications, such as systems to support location NP-hard to design an ideal LDB topology structure. In this
dependent queries [5]. paper, we give a heuristic algorithm to design a nearly ideal
In the existing cellular mobile networks, a two-tier topology structure for the LDB organization based on the set
location database architecture is adopted to manage the covering theory.
locations of mobile clients [6]. This two-tier architecture is The remaining parts of this paper are organized as follows.
simple and easy to implement. However, it has two serious Section II defines the system model. Section III proposes the
performance problems, which makes it unsuitable for many basic definitions applied in the paper. Section IV presents a
new mobile computing applications. Firstly, the number of heuristic LDB organization method based on the set
mobile clients in a mobile computing system can be very covering theory. Section V includes the performance
large, and the system may need to maintain a large amount of evaluation of the algorithm and section VI is the conclusion.
location information for its mobile clients. It is obvious that
the two-tier architecture is not scalable. Secondly, because a II. CELLUAR MOBILE NETWORK AND LOCATION
mobile client is permanently associated with a Home DATABASES MODEL

0-7803-9335-X/05/$20.00 ©2005 IEEE 1328


A cellular mobile network is based on the concept of update and the lookup procedure:
frequency reuse. The whole service area is divided into a tL: The time cost of following a link in the tree of the
number of connected cells with a base supporting station location databases, i.e., sending a message to the parent or
(BSS) in each cell. The limited wireless bandwidth is the child node of a location database;
partitioned into channels. The mobile clients within a cell tU: The time cost of a database update;
communicate with the BSS of the cell through a wireless tQ: The time cost of a database query;
channel. B. Problem
Each BSS maintains a location database. The location
When a mobile client crosses the boundary between two
databases of different neighboring BSS’s are organized into a
cells and moves from its formerly residing cell,
hierarchical tree structure, as shown in Figure 1. Note that in
old_cell(x) ,into the currently residing cell, cur_cell(x), the
an existing GSM mobile network, the location databases are
total location update cost is:
managed by mobile switching centers. There are a large
number of mobile clients in the system. They may make a Update_cost=2dis(old_cell(x), cur_cell(x)) × tL +
call or submit a query on the locations of other mobile (2dis(old_cell(x), cur_cell(x))+1) ×tU eqn. (1)
clients. It is assumed that different mobile clients can have
different mobility patterns. As can be seen from the above equation, the location
update time cost is proportional to the distance between the
starting and destination cells of a mobile client. However, the
distance between two cells in a location database hierarchical
tree is not necessarily the same under different LDB
topological structure. The hierarchical structure of the
location databases can have a significant impact on the
location management cost. Before constructing the location
database tree, we should consider the mobility patterns of all
the mobile clients to optimize the total location management
cost.
Cell 1 Cell 2 Cell 3 Cell 4 Cell 5 Cell 6 Cell 7 Cell 8 Cell 9
Cell 2
Figure 1. Hierarchical databases to store the mobile clients’ locations

III. PRINCIPLES AND DEFINITIONS Cell 1 Cell 3


A. Terminology
Cell 4
Since the location databases are organized into a tree like
structure, we define the least common ancestor of (leaf) Cell 5 Cell 7
location databases DBi and DBj as LCA(DBi, DBj). The
height of LCA(DBi, DBj) to the leaf nodes of the tree is Cell 6
denoted as lca(DBi, DBj). (We assume that all of the leaf
nodes of the location databases are at the same level in the Cell 8 Cell 9
hierarchical location database tree). Let the leaf location
database responsible for the cell i be LDB(i).
Definition 1: The distance between cells i and j, termed as Figure 2. A mobile computing system consisting of 9 cells
dis(i, j), is defined as the height of the least common ancestor
Let us consider a more detailed example to illustrate the
of the responsible leaf the location databases, that is
problem. Suppose in the systems, there are 9 cells as in
dis(i,j) = lca(LDB(i),LDB(j))
Figure 2 and the 9 location databases responsible for the 9
If i = j then dis(i, j) = 0.
cells are organized as in Figure 1. During a time period, there
Let cur_cell(x) be the cell in which mobile client x is now
are 200 times of border-crossings between Cell 3 and Cell 4,
residing.
and 50 times of border-crossings between Cell 2 and Cell 3.
Definition 2: The distance between mobile clients x and y,
According to Equation (1), the total time cost to update the
termed dis(x, y), is defined as the distance of the two cells in
200 times of border-crossings between the Cell 3 and Cell 4
which x and y is now residing, namely dis(x, y) =
is:
dis(cur_cell(x), cur_cell(y)).
The above definition captures the locality of two mobile 200 × (2 × dis × tL + 2 × dis × tU + tU) = 200 × (2 × 2 × tL
clients. It can be seen easily that a mobile client takes a + 2 × 2 × tU + tU) = 800tL + 1000tU
smaller cost to find the location information of another
mobile client if the distance between them is smaller[1]. where dis is the distance (see Definition 1) between the
two leaf location databases responsible for Cell 3 and Cell 4,
When calculating the location management cost, as in namely 2.
[1,2], we consider the following related time costs in location
The location update time cost to update the 50 times of

0-7803-9335-X/05/$20.00 ©2005 IEEE 1329


border-crossings between Cell 2 and Cell 3 is: 100tL +150tU algorithms are given to get the nearly ideal result.
Therefore, the total location update time cost is In the following, the problem of location database
900tL+1150tU. structuring is transformed to the problem of getting a
non-overlapping cover of the set of location databases (the
If we reorganize the location databases by switching the
associated cost is explained later) where (1) Set S is the set of
positions of DB5 and DB7 with each other in the hierarchical
leaf location databases in the system; (2) The objective
tree, the total time cost to update the 200 times of
function is the total location update time cost in the given
border-crossings between Cell 3 and Cell 4 becomes:
location database topological structure.
200 × (2 × dis × tL + 2 × dis × tU + tU) = 200 × (2 × 1 × tL To get a location database organization with a nearly ideal
+ 2 × 1 × tU + tU) = 400tL + 600tU location update time cost, a set of sets is constructed as
The total time cost to update the 50 times of follows:
border-crossings between Cell 2 and Cell3 becomes: (1) For ∀i, 1≤i≤n, Si={Cell i}
(2) For each Cell j, if the number of
50 × (2 × dis × tL + 2 × dis × tU + tU) = 50 × (2 × 2 × tL + border-crossings between Cell i and Cell j (denoted
2 × 2 × tU + tU) = 200tL + 250tU
as Mij) is larger than 0, then Si = Si ∪ {Cell j}
The total location update time cost in the reorganized case Obviously, S1, S2, …, Sn is a cover (with overlapping) of
will be 600tL+850tU and the location update time saving is the set S={Cell 1, Cell 2,…., Cell n}. Here n is the total
300tL+300tU. So the LDB topology structure has a great number of cells in the system. Based on this cover of S, we
impact on the location management cost. get a non-overlapping cover of S. Note that here the cost with
IV. LOCATION DATABASE ORAGINZATION the set covering of the n leaf location databases
Before the LDB organization, we construct a mobility corresponding to the n cells in the systems is the total number
matrix, M, in which Mij is the number of boundary crossings of boundary crossing between two cells in two different sets.
between the cell i and cell j during a time period. Table 1 is That is
an example mobility matrix. num-crossing=
TABLE I

| ∃p, q, p ≠ q, Cell i ∈ S ' ∧ Cell j ∈ S q'


AN EXAMPLE OF MOBILITY MATRIX

C1
C1 C2
100
C3
0
C4
30
C5
25
C6
0
C7
0
C8
0
C9
0
∑M ij
p
C2 100 80 40 20 30 0 0 0
C3 0 80 0 10 30 0 0 0
C4 30 40 0 120 0 80 25 0 S' S'
C5 25 20 10 120 15 160 30 50 Where p and q are two different set in a covering of
C6 0 30 30 0 15 0 100 60 the set of the n cells. After the construction of the above
C7 0 0 0 80 160 0 40 0 objective function, we give a heuristic algorithm to get a
C8 0 0 0 25 30 100 40 75
C9 0 0 0 0 50 60 0 75
nearly-ideal covering of the set of n cells ( leaf location
In the following sections, we give a set-covering databases corresponding to the n cells) with the objective to
algorithm to organize the location databases with the get the minimal value of num-crossing.
objective to minimize the total location management time Then each component (a leaf location database set which
cost. is composed of d(explained later) location databases) of the
cover forms a cluster. The following is the procedure to get
Definition 3 A set of sets S1, S2,…, Sn is called a cover of a the non-overlapping cover. In the following section, the
n number of elements in a set S is denoted as |S|. d is the
set S if and only if S = ∪ Si number of sub-location databases in the hierarchical tree. In
i =1 figure1, d is 3.
Definition 4 A set of sets S1, S2,…, Sn is called a PROCEDURE LDB-structuring
non-overlapping cover of a set S if and only if INPUT: Mij, S1, S2, …., Sn
n OUTPUT: A topology structure for the location
(1) S = ∪ Si database tree
i =1
setid=1;
(2) ∀i, j ,1 ≤ i, j ≤ n, i ≠ j , Si ∩ Sj = φ
In general, a (non-overlapping) cover of a set is associated
REPEAT UNTIL ∪S =φ
i
i

with a cost and we try to find an algorithm to get a cover of a


set with the minimum cost. Actually, to find a {
(non-overlapping) cover of a set with the optimal cost is
NP-hard [8]. Under different circumstances, heuristic

0-7803-9335-X/05/$20.00 ©2005 IEEE 1330


(1) Suppose that the Mij is maximal value in the set reduce the location update cost, first we give an algorithm
{Mab|∃r,s, 1≤r,s≤n, (Cell a ∈Sr∧Cell b∈Ss)} under which the location databases are connected with their
(2)
'
S setid ={Cell i, Cell j} randomly chosen neighboring location databases (We call
this method Randomly Clustering: RC). Then we design
(3) Delete Cell i/Cell j from any Sk (1≤k≤n) which simulation experiments and compare the performance of the
includes Cell i/Cell j algorithm with that of the algorithm proposed in this paper
'
(4) WHILE | S setid |<d DO (Clustering based on Set Covering: CSC).
{ In the simulation experiment, there are 6561(=38) cells
(corresponding to 6561 leaf location databases) in the system
1 Compose the temporary set TS as follows:TS=φ;
and all the location databases are organized into a
2 For ∀Cell p in the system hierarchical tree. There are 100,000 mobile clients scattering
in the 6561 cells randomly. In the location database tree,
'
If (∃k, 1≤k≤n (Cell p∈Sk))∧ (Cell p∉ S setid )∧ each inner location database has 3 sub-location databases
' and the depth of the location database tree is 8. In other
(∃Cell q∈ S setid (Mpq>0)) words, the maximal distance between any two leaf location
databases is 8. The number of boundary-crossings per time
TS= TS ∪ {Cell p} unit for all the mobile clients follow a normal distribution.
We argue that even if in a real application, the number of
IF TS≠φ
boundary-crossings for mobile clients may not follow a
{ normal distribution, these experiments can show that we can
achieve the total location management cost reduction by
3 For ∀Cell m in TS, according to the following
structuring the location databases deliberately.
equation, calculate total-numm where
As can be seen from Equation (1), the location update cost
total-numm= ∑ Mim | Cell i ∈ S setid
'
is proportional to the distance between two leaf location
i databases responsible for the corresponding two cells. In the
4 Select the Cell x from TS with the maximal value of experiments, when comparing the location update costs, we
total-numm defined as above. only need to count the total distance for all the
boundary-crossings.
} The following table describes the key parameters in the
experiments.
ELSE TABLE II
THE KEY PARAMETERS IN THE EXPERIMENTS
Select a Cell x randomly from any Si(1≤i≤n) The parameters The value
' ' Number of cells 6561
5 S setid = S setid ∪ {Cell x} Number of sub-LDB for an inner LDB 3
Depth of the LDB hierarchical tree 8
6 For all the set Si, (1≤i≤n) Number of mobile clients 100,000
We design two sets of experiments. In one set of
Si=Si-{Cell x}
experiments, we assign a static number (=4) to the mean
} value (for the normal distribution) of the number of
boundary-crossings (NBC) per time unit for a mobile client
setid=setid+1;
and change the deviation (for the normal distribution) of the
} number of boundary-crossings. Then we compare the
location update cost per time unit in the two location database
After the above procedure terminates, the leaf location
'
tree organization method: RC and CSC. The performance
databases corresponding to the sets S setid (setid=1,2, ...., n/d) comparison is shown in Figure 3.
forms a cluster. The clusters of the leaf location databases are
organized recursively in the same way to achieve the total
Location update cost

location update time cost reduction. Due to the limitation of


per time unit

space, the detailed procedure is omitted. 25


20 RC
15
V. SIMULATION PERFORMANCE EVALUATION 10 CSC
As can be seen from the construction of the covering of the 5
0
n location databases, for the location databases between
1 1.5 2 2.5 3 3.5
which there are a larger number of boundary-crossings, they
are clustered to reduce the distance between the location The deviation of the NBC
databases. This can obviously result in the location update
cost reduction. To see to what extent the algorithm can Figure 3. Location update comparison(NBC=4)

0-7803-9335-X/05/$20.00 ©2005 IEEE 1331


As can be seen from Figure 3, when the deviation of the patterns of the mobile clients. This paper focuses on this
NBC becomes larger, this means that there is more difference problem with the objective to achieve the location
in the actual number of boundary-crossings per time unit for management cost reduction.
different mobile clients. In other words, there is more Actually, when organizing the location databases, we
difference in the mobility patterns for different mobile should not only consider the number of boundary-crossings
clients. By using the CRC method, we can achieve a between different cells but also the number of calls between
significant location update cost reduction. Contrary to the them. Furthermore, a mobile client can change its mobility
RC method in which the total location update cost remains pattern after the location database tree has been constructed.
almost the same, in the CRC case, the total location update So the performance of an ideal location database topological
cost decreases gradually. This is because even though the structure can deteriorate with the time. A location database
total number of boundary-crossings remains stable (because re-organization procedure is needed to accommodate the
the mean value of NBC remains the same), the distances for mobility pattern variation. We are currently concentrating on
the frequent boundary-crossings are reduced by using CRC these research topics to achieve a better location
method. Thus this results in total location update cost management performance.
reduction.
In the other set of experiments, we change the mean value REFERENCES
of the NBC for a mobile client and the deviation of the NBC [1] Evaggelia Pitoura and George Samaras, “Locating Objects in Mobile
is set as half of the mean value. Computing”, IEEE Transactions on Knowledge and Data Engineering,
The performance comparison is shown in Figure 4. In both VOL. 13, NO.4, 2001
[2] Sajak K. Das and Sanjoy K. Sen, “Adaptive Location Prediction
cases, we can see that with the increase of mean value of
Strategies Based On a Hierarchical Network Model in a Cellular Mobile
NBC, the total location update cost increases. But the Environment”, The Computer Journal, vol. 42, no.6, 1999.
location update cost increases more slowly in CRC method [3] Xie, H., Tabbane, S. and Goodman, D., “Dynamic Location Area
than that RC method. Furthermore, location management Management and Performance analysis”, in Proceeding of 43rd IEEE
Vehicular Technology Conference, May 1993.
cost in the CRC method is much less than that in the RC
[4] Plassmann, D., “Location Management Strategies for Mobile
method. This is because that with the increase of the NBC Cellular Networks of 3rd Generation”, in Proceedings of 44th IEEE
mean value, the NBC deviation also increases (it is half of Vehicular Technology Conf., June 1994.
the NBC mean value) which means greater different among [5] Hüseyin Gökmen Gök and Özgür Ulusoy, “Transmission of
Continuous Query Results in Mobile Computing Systems”, Information
the mobility patterns for mobile clients. This gives more
Sciences, vol. 125, no. 1-4, pages 37-63, 2000.
space for the CRC method to organize the location databases [6] M. Mouly and M.B. Pautet, The GSM System for Mobile
which means more location update cost reduction. The more Communication, Cell and Sys, 1992.
location update cost reduction is reflected as more difference [7] E. Pitoura and I. Fudos, “An Efficient Hierarchical Scheme for
Locating Highly Mobile Users”, in Proceedings of the 6th ACM
in location update cost between the RC method and CRC
International Conference on Information and Knowledge Management
method in Figure 4. (CIKM98), November 1998, pp 218-225.
[8] V. Chvatal, A greedy heuristic for the set-covering problem,
Location update cos t

60 Mathematics. Operations. Research (1975) 233-235


50
time unit

40
RC
30
CSC
20
10
0
2 4 6 8 10
The Mean Value of NBC

Figure 4. Location update cost comparison (Variable Mean Value)


VI. CONCLUSION
Maintaining the real-time locations of mobile clients is
essential to many new mobile computing systems. The
traditional two-tier location database architecture is not
scalable and unsuitable for the new applications in which
there are a very large number of mobile clients. A
hierarchical location database architecture is proposed to
distribute the location update and search workload among
different location databases. However, if the location
databases are not properly organized, the total location
management cost can be very high. When organizing the
location databases, we should take into account the mobility

0-7803-9335-X/05/$20.00 ©2005 IEEE 1332

Anda mungkin juga menyukai