Hybridization
Integrated architectures for machine learning
have been shown to provide performance
improvements over single representation
architectures.
Integration, or hybridization, is achieved using a
spectrum of module or component architectures
ranging from those sharing independently
functioning components to architectures in which
different components are combined in inherently
inseparable ways.
In this presentation we briefly survey prototypical
integrated architectures
Combinations
The combination of knowledge based systems,
neural networks and evolutionary computation
forms the core of an emerging approach to
building hybrid intelligent systems capable of
reasoning and learning in an uncertain and
imprecise environment.
Current Progress
In recent years multiple module integrated
machine learning systems have been developed
to overcome the limitations inherent in single
component systems.
Integrations of neural networks (NN), fuzzy logic
(FL) and global optimization algorithms have
received considerable attention [Abr] but
increasing attention is being paid to integrations
with case based reasoning (CBR) and rule
induction (RI) [Mar, Pren].
Primary Components
The full spectrum of knowledge representation in
such systems is not confined to the primary
components.
For example, in CBR systems although much
knowledge resides in the case library significant
problem solving knowledge may reside in
secondary technologies such as in the similarity
metric used to retrieve problem solution pairs
from the case library, in the adaptation
mechanisms used to improve an approximate
solution and in the case library maintenance
mechanisms.
MultiComponents
Although it is possible to generalize about the
relative utilities of these component types based
on the primary knowledge representation
mechanisms these generalizations may no
longer remain valid in particular cases
depending on the characteristics of the
secondary mechanisms employed.
Table 1 attempts to gauge the relative utilities of
single components systems based on the
primary knowledge representation.
Degree of Integration
Besides differing in the types of component systems
employed, different integrated architectures have
emerged in a rather ad hoc way, Abraham [Abr].
Least integrated architectures consisting of independent
components communicating with each other on a side by
side basis.
More integration is shown in transformational or
hierarchial systems in which one technique may be used
for development and another for delivery or one
component may be used to optimize the performance of
another component.
More fully integrated architectures combine different
effects to produce a balanced overall computational
model.
Transformational,
hierarchial and integrated
Abraham categorizes such systems as
transformational, hierarchial and integrated. In a
transformational integrated system the system
may use one type of component to produce
another which is the functional system.
For example, a rule based system may be used
to set the initial conditions for a neural network
solution to a problem.
Thus, to create a modern intelligent system it
may be necessary to make a choice of
complementary techniques.
Transformational
Expert systems with neural networks
Knowledge from the ES is used to set the
initial conditions and training set of the NN
Hierarchial Hybrid
An ANN uses a GA to optimize its topology
and the output fed into an ES which
creates the desired output or explanation
Fused Architecture
Table 1 (Adapted from [Abr, Jac] and [Neg]). A comparison of the utility of
case based reasoning systems (CBR), rule induction systems (RI), neural
networks (NN) genetic algorithms (GA) and fuzzy systems (FS), with 1
representing low and 4 representing a high utility.
CBR
KB
NN
GA
FL
Know. rep.
Uncertainty
Approximation (noisy
incomplete data)
Adaptable
Learnable
Interpretable
Interpretability
Synaptic weights in trained neural networks are
not easy to interpret with particular difficulties if
interpretations are required.
Genetic algorithms model natural genetic
adaptation to changing environments and thus
are inherently adaptable and learn well
Not easily interpretable because although the
knowledge resides partly in the selection
mechanism it is in the most part deeply
embedded within a population of adapted genes.
Adaptability
Case based systems are adaptable
because changing the case library may be
sufficient to port a system to a related
area. If changes need to be made to the
similarity metric or the adaptation
mechanism or if the case structure needs
to be changed much more work may be
required.
Learnability
Fuzzy rule based systems offer more
option through which learnability may be
more easily achieved.
Fuzzy rules may be fine tuned by
adjusting the shapes of the fuzzy sets
according to user feedback [Abi]
Example
Neural Expert Systems
Rule: IF - THEN
Inference Engine
Explanation Facilities
User Interface
User
Approximate reasoning
In a rule-based expert system, the inference engine
compares the condition part of each rule with data
given in the database. When the IF part of the rule
matches the data in the database, the rule is fired and
its THEN part is executed. The precise matching is
required (inference engine cannot cope with noisy or
incomplete data).
Neural expert systems use a trained neural network in
place of the knowledge base. The input data does not
have to precisely match the data that was used in
network training. This ability is called approximate
reasoning.
Rule extraction
Neurons in the network are connected by links,
each of which has a numerical weight attached to it.
The weights in a trained neural network determine
the strength or importance of the associated neuron
inputs.
Algorithm
By attaching a corresponding question to each input
neuron, we can enable the system to prompt the user
for initial values of the input variables:
Neuron: Wings
Question: Does the object have wings?
Neuron: Tail
Question: Does the object have a tail?
Neuron: Beak
Question: Does the object have a beak?
Neuron: Feathers
Question: Does the object have feathers?
Neuron: Engine
Question: Does the object have an engine?
Score 1 for yes, -1 for no and 0 for unknown
Use a sign function as the activation and interpret 0 for no and 1 for yes.
x w > w
i
i=1
j =1
Rule: IF - THEN
Inference Engine
Explanation Facilities
User Interface
User