discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/220231728
CITATION READS
1 4
2 authors, including:
Heng Li
The Hong Kong Polytechnic University
351 PUBLICATIONS 8,328 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Heng Li on 10 July 2017.
HENG LI XINDONG WU
Article views: 13
Download by: [Hong Kong Polytechnic University] Date: 10 July 2017, At: 04:13
LEARNING DESIGN KNOWLEDGE AS
GENERALIZATION
HENG LI
Department of Building and Real Estate
Hong Kong Polytechnic University
XINDONG WU
Department of Software Development, Monash
University, Melbourne, Australia
Addre ss corre sponde nce to He ng Li, De partme nt of Building and Real Estate , Hong
Kong Polyte chnic Unive rsity, Kowloon, Hung Hom , H ong Kong. E-m ail: BSH engLi@
Polyu.e du.hlc
The authors are inde bte d to the two anonymous re viewe rs for the ir constructive
comm ents on the first version of this pape r.
this ability in knowle dge -base d de sign systems motivates the re search on
ge ne ralizing de sign knowle dge from de sign e xample s. Ge ne ralization
can be de scribed as a proce ss of taking into account a large num ber of
de sign e xample s, the n e xtracting and re taining prope rtie s and value s
that characte rize the se e xample s. The e xtracted prope rties and value s
are re pre sented in forms that can assist future de sign proje cts.
Ge ne ralizing conce ptual de scriptions s conce pts . from de sign e xam-
ple s is an inte rsectional re se arch area be twe e n machine le arning and
de sign re search. Its e mphasis on ge ne ralizing de sign-supporting, rathe r
than ge ne ric, de scriptions distinguishe s itse lf from othe r ge ne ralization
proble ms. The re are a num ber of applications that de m onstrated the
strength of using ge ne ralization to acquire de sign knowle dge s Arcisze w-
ski e t al., 1987; He rrmann & Be ckmann, 1994; Kamal e t al., 1989; Li,
1993, 1994; Mahe r & Li, 1994; Lu & Tche ng, 1991; Re ich, 1993;
W hite hall e t al., 1990 .. Although the ge ne ralization technique s and
re sults are we ll pre sented in the se applications, the the ore tical princi-
ple s upon which the y are built are rare ly e xplaine d. Lack of a the ore ti-
cal foundation make s re search in ge ne ralizing de sign knowle dge an
ill-de fine d task, conducte d in a ``blind-searching’’ manne r.
This pape r attem pts to ide ntify e sse ntial tasks for ge ne ralization
proble ms in de sign. It focuse s on the differe nces betwe e n ge ne ralization
proble ms in de sign and ge ne ralization proble ms in othe r proble m-
solving dom ains. The role s of the repre sentation and the de scription
language for de sign e xamples are ide ntified. Se ve ral ge ne ralization
strategie s are de scribed through an illustration of ge ne ralizing de sign
conce pts from de sign e xample s.
The third task is to de term ine or de vise a gene ralization ope rator
that will match the de sign e xample s to the knowle dge re pre sentation.
This require s colle cting e xisting gene ralization s le arning . me thods and
choosing suitable one s to adopt, or cre ating ne w me thods based on an
unde rstanding of the limitations of e xisting one s.
The following subsections furthe r address the se tasks in turn.
Spe cifically, the y ide ntify the role of knowle dge re pre sentation in ge ne r-
alization, discuss the importance of a de scription language for de sign
e xample s and the ir ge ne ralizations, and outline ge ne ral principles for
se le cting and de ve loping ge ne ralization te chnique s to ge ne rate the
targe t knowle dge re prese ntation from de sign e xample s. In the following
se ction, we will de monstrate the utility of the se three tasks through an
application of ge ne ralizing de sign concepts.
The comple xitie s of de sign imply that m any e xisting re pre sentation
sche me s do not address the nature of de sign. Re pre sentations in many
of the ge ne ralization programs are used to pre dict or re cognize an
e xample that has not be e n e ncounte re d. In de sign, the repre sentation is
use d to ge ne rate ne w de signs, not to re cognize or cate gorize e xamples
that have not be e n see n be fore. Many re pre sentation paradigms for
de sign knowle dge have bee n propose d. E xample s include de sign proto-
type s Ge ro, 1990 ., de pe nde nce ne tworks s Duffy & MacCallum, 1989 .,
the object-orie nte d approach s Sriram e t al., 1986, 1991 ., and e mpirical
ne tworks s Li, 1993 .. E ach re pre sentation paradigm offers differe nt
LEARNING DESIGN KNOWLEDGE AS GENERALIZATION 187
* The re is inde ed a re se arch branch in m achine learning taking care of knowle dge
re fine m ent. Som e of the most re ce nt re sults can be found in Ginsbe rg e t al. s 1988 . and
Craw & Sle e man s 1995 .. H owe ver, knowle dge re fine m ent improve s knowle dge , such as
dropping an unne ce ssary condition from a rule or incre asing the stre ngth of particula r
rule s, but doe s not change the knowle dge re pre se ntation sche me of a given syste m .
188 H. LI AND X. WU
othe rs are late nt or lost in de sign proce sses, such as the infe re ntial
proce dure s and he uristics used by the de signe r in making de cisions. To
de te rmine the ir e sse ntial prope rties and a language to de scribe the m
be come s anothe r challe nging issue . Traditionally, the training e xamples
in de sign have be e n handcrafted in an e ffort to strike a balance be twe e n
the following two e xtre me s:
be able to unde rstand and ge ne ralize knowle dge from de sign proce sse s
but also be able to fully utilize the information source s to ge ne ralize
re sults that are use ful in assisting future de sign proble m s. The varie ty of
information sources provides many pote ntial application areas for ap-
plying gene ralization strate gies. The re fore, in orde r to de ve lop a le arn-
ing system for de sign domains, an inte gration of se ve ral ge ne ralization
strategie s may be use d.
The aim of this section is not to e nume rate all ge ne ralization
strategie s e xhaustively but rathe r to give an illustration of how gene ral-
ization strate gies can be applie d to le arn a pre de term ine d de sign
knowle dge repre sentation. Spe cifically, the following subsections pre -
se nt how a de sign re prese ntation is de ve lope d and le arne d using an
inte gration of ge ne ralization strategie s.
Figure 1 shows the overall proce ss of ge ne ralizing e mpirical ne t-
works from de sign e xamples. E mpirical ne tworks are de ve lope d to
re prese nt de sign conce pts that support conce ptual stages of de sign. The
ge ne ralization system has three major le arning m odule s: CO NCE P-
TO R, E FD 1 , and DPF. E ach module use s a spe cific le arning strategy to
de al with part of the de sign information and produce s part of the
e mpirical knowle dge . The combination of the se module s e nables the
formation of e mpirical ne tworks. Spe cifically, CO NCE PTO R is a le arn-
ing module that groups similar de sign e xamples into conce ptual space s
that provide the basis for ge ne rating e mpirical ne tworks. This module
also me asure s de fault value s and value range s for de sign variable s.
value s of 1.2 and 39.7 m. Some attribute s, e ve n though the y can have
quantitative value s, such as cost, cannot be pre cise ly calculate d be cause
the crite ria used to me asure the ir values are changing. The value s of
the se attribute s are base d on a ve rbal asse ssme nt of the quantitative
e le me nts and the refore are re garde d as qualitative. For e xample , the
attribute cost is value d in te rms of low, m e dium , and high.
The fore going formalism for de scribing de sign e xample s is consis-
tent with the way de sign solutions are prese nte d and docume nted. In a
de sign spe cification, de sign re quire me nts are usually re corde d as part of
the docume nt, and structures are mainly de scribed as drawings. The
information on de sign history is usually not include d in the final
docume nts. Howe ve r, the formalism provides little aid in how to trans-
late de sign re quire me nts and structural de scriptions into fe atures and
the ir related value s. This re late s to the e le m e ntary-term proble m
s Saxena, 1989 ., which de als with finding e le m e ntary units of de scription,
or te rms, that are commonly acce pted and ``hardwire d’’ in a spe cific
domain. Conside rable re se arch e ffort is re quired to clarify whe the r
the re are e le me ntary term s in a spe cific de sign domain so that de sign
e xample s can be ``legitimate ly’’ de scribed by the se terms.
The algorithm proce sses all available e xample s, rathe r than charac-
teristic sample s only, and puts the m into differe nt conce pt space s. For
e ach concept space, the algorithm ide ntifies the attribute s associate d
with all the e xample s in the conce pt space .
The intere sting aspe ct of this algorithm is that it gene ralize s multi-
ple conce pt space s; this e nable s the system to circumve nt a lim itation
confronted by many othe r gene ralization systems, as m any curre nt
conce pt formation systems can ge ne ralize only one conce pt from e xam-
ple s. The ability to le arn multiple conce pts is important in de sign,
be cause de signs are ofte n distinguishe d by the diffe rence in the way
the y are de scribe d. This differe nce occurs on at le ast two le vels: the
name s of the attributes use d to de scribe the de sign and the value s that
the attribute s assum e for a give n de sign. Grouping de sign e xamples
according to the similarity of the ir attribute s provide s a technique for
ge ne ralizing differe nt de sign conce pts rathe r than being lim ited to
ge ne ralizing a single conce pt.
LEARNING DESIGN KNOWLEDGE AS GENERALIZATION 197
Two proce sses are use d to ge ne ralize de fault value s for de sign
variable s. The de fault value s for quantitative variable s are de te rmine d
by the we ighte d ave rage of value s of the variable s in a conce pt space .
The de fault values for qualitative variable s are de term ine d using the
freque ncie s in which ce rtain nominal value s occur in the de sign e xam-
ple s of a conce pt space , the m ost fre que nt one s becoming the de faults.
V alue range s for quantitative variable s are range s de fine d by the ir
maximum and minimum limits of the nume rical distribution. V alue
ranges for qualitative variable s are colle ctions of all values that appe ar
in the e xample s of a conce pt space .
1
F igu r e 5 . The le arning me chanism of EFD .
1
Ta b le 2 . Comparis on of re sults from EFD and EFD
3 6.9 ? G H y M S s 0 s C O s 93 % . a 4 7 .6 ? G H y M S s 0 s C O s 86 % .
s 0.45 m ( G H ( 1 .3 2 m ; 48 m ( M S ( 73 m .
4 7.5 ? G H y M S s 0 s C O s 98 % .
s 1.32 m ( G H ( 1 .9 7 m ; 73 m ( M S ( 85 m .
4 3.2 ? G H y M S s 0 s C O s 84 % .
s 1.97 m ( G H ( 2 .0 5 m ; 85 m ( M S ( 10 7 m .
G H y 1.38 ? G W s 0 s C O s 97 % . G H y 1 .2 5 ? G W s 0 s C O s 77 % .
s 0.45 m ( G H ( 1 .6 3 m ; 0.8 m ( G W ( 1 .4 3 m .
G H y 1.17 ? G W s 0 s C O s 85 % .
s 1.63 m ( G H ( 2 .0 5 m ; 1.4 3 m ( G W ( 2.37 m .
D W y 1 0 .3 2 ? G W s 0 s CO s 84 % . D W y 10 .3 2 ? G W s 0 s C O s 93 % .
s 8.3 m ( D W ( 2 8 .4 m ; 0 .8 m ( G W ( 2 .3 7 m .
P H y 0 .95 ? C L s 0 s CO s 93 % . P H y 1.23 ? C L s 0 s C O s 82 % .
s 4.2 m ( P H ( 6 .7 m ; 2 .1 m ( C L ( 5.3 m .
P H y 1 .21 ? C L s 0 s C O s 89 % .
s 6.7 m ( P H ( 1 3 .7 m ; 5 .3 m ( C L ( 1 2 .6 m .
P H y 1 .23 ? C L s 0 s C O s 91 % .
s 13 .7 m ( P H ( 1 8.5 m ; 12 .6 m ( CL ( 16 .4 m .
P H y 1 .23 ? C L s 0 s C O s 88 % .
s 18 .5 m ( P H ( 2 7.4 m ; 16 .4 m ( CL ( 28 .4 m .
D W y 3 .67 ? T L s 0 s C O s 79 % . D W y 3.67 ? T L s 0 s C O s 89 % .
s 8.3 m ( D W ( 2 8 .4 m ; 2 ( T L ( 6 .
M S y 5 9.7 ? G W s 0 s C O s 81 % .
s 48 m ( M S ( 68 m ; 0 .8 m ( G W ( 1 .0 4 m .
MS y 3.67 ? GW s 0 s CO s 86% .
s 68 m ( M S ( 84 m ; 1 .0 4 m ( G W ( 1 .6 3 m .
M S y 3 .6 7 ? G W s 0 s C O s 93 % .
s 84 m ( M S ( 93 m ; 1 .6 3 m ( G W ( 1 .9 4 m .
M S y 3 .6 7 ? G W s 0 s C O s 85 % .
s 93 m ( M S ( 10 7 m ; 1.94 m ( G W ( 2.37 m .
B L y 2.14 ? M S s 0 s C O s 83 % .
s 98 m ( B L ( 2 45 m ; 4 8 m ( M S ( 8 4 m .
B L y 3.61 ? M S s 0 s C O s 88 % .
s 24 5 m ( B L ( 48 1 m ; 84 m ( M S ( 10 7 m .
a
C O , cove rage ra te .
Extracting Design Patterns: W e have com puted the freque ncy distri-
bution Pi s i s 1, . . . , k ., whe re k is the numbe r of fe asible state s. The
distribution is nonne gative and attains the value 0 if the re is no link at
all among the value s of qualitative attribute s be ing conside re d. To
de te rmine de sign patterns, the links with the freque ncies that e xcee d a
pre de termine d thre shold are conside red as strong links, and value s of
qualitative attribute s associated with strong links are e xtracte d. The
e xtracte d attribute -value pairs form a de sign pattern.
Step 1: V ariable s, de fault values, and value range s gene ralize d from the
first m odule constitute the node s and structure of the e mpirical
ne twork.
Step 2: Run the se cond module to ge ne rate e mpirical formulas among
quantitative variable s.
Step 3: Run the third m odule to ge ne rate de sign patterns among
qualitative attribute s.
Inconsistency
It is ne ce ssary to note that an e sse ntial re quireme nt for ge ne ralizing
de sign knowle dge is that the represe ntation, the de scription language ,
and the ge ne ralization te chnique s be consiste nt. Inconsistency m ay arise
if e ithe r s 1 . the de scription language is insufficie nt to prese nt de sign
information for ge ne ralizing the targe t repre sentation or s 2 . the ge ne r-
alization te chnique s are unable to shape the de sign e xample s into form s
de fine d in the knowle dge repre sentation. In ge ne ral, the re are no
facilities to e lude the inconsistency or recove r it during the gene raliza-
tion. Although gene ralization technique s can e asily incorporate proba-
bilistic and statistical me thods and the se m e thods can be e rror re silient
and can improve ge ne ralizations over a pe riod of time , ge ne ralization
technique s do not provide any fe e dback on how to re vise the knowle dge
re prese ntation or the de scription language to im prove the use fulne ss of
the ge ne ralization re sults in de sign. The difficulty in avoiding inconsis-
tency makes the de velopme nt of ge ne ralization systems for de sign an
e xpe rime ntal proce ss.
In concluding, this pape r vie ws a ge ne ralization problem in de sign
as an induction proble m, which involve s the de te rmination of a target
knowle dge re prese ntation for gene ralization and a de scription language
for de sign e xample s, as we ll as the se le ction or de ve lopm e nt of gene ral-
ization technique s to gene rate the knowle dge repre se ntation from de -
sign e xample s. The ge ne ralization proce ss is large ly guide d by the target
re prese ntation. The repre sentation de line ates a ``filtere d’’ pe rce ption of
an inte llige nt obse rver, the de scription language repre senting a le ve l of
LEARNING DESIGN KNOWLEDGE AS GENERALIZATION 205
REFERENCES
Arciszewski, T., M. Mustafa, and W. Z iarko. 1987. A methodology of de sign
knowle dge acquisition for use in learning e xpert syste ms. In t. J. Ma n -
Ma ch in e Stu d ie s 27:23 ] 32.
Brachman, R. J. and H. J. Le ve sque , e ds. 1985. Read in gs in kn ow led ge represen ta -
tio n . Los Altos, CA: Morgan Kaufmann.
Coyne , R. D. 1992. Mode lling the eme rge nce of design de scription across
sche mes. E nv iro n . Plan n in g B 18:427 ] 458.
Coyne , R. D., M. A. Rose nman, A. D. Radford, M. Balachandran, and J. S.
Ge ro. 1990. Kn o w led ge-b ased d esign system s . Reading, MA: Addison-W e s-
le y.
Craw, S., and D. H. Slee man. 1995. Refine ment in response to validation. E xp ert
Syst. Appl. 8 s 3 .:343 ] 349.
Dasgupta, S. 1989. The structure of design proce ss. In Adv an ces in co m pu ters,
V ol. 28, ed. M. Yovits. New York: Acade mic Pre ss.
Duffy, A. H. B., and K. J. MacCallum. 1989. Computer re prese ntation of
num erical expe rtise for preliminary ship design. Ma r. Tech n o l. 26:289 ] 302.
Dym, C. L., R. P. Henche y, E. A. De lis, and H. Gonick. 1988. A knowle dge -base d
system for automated archite ctural code che cking. Co m pu t.-A id ed D esign ,
20 s 3. :137 ] 145.
Eastman, C. 1969. Cognitive proce sses and ill-structure d proble ms: A case study
for de sign. In IJCAI-69 , Washington, DC, pp. 669 ] 690.
Finge r, S., and J. R. Rinde rle . 1990. Transforming behavioral and physical
re presentations of me chanical de sign. Pro ceed in gs of th e First In tern a tio n al
206 H. LI AND X. WU