Adaptative Behaviour in Robots

Essay
Evolution of Adaptive Behaviour in Robots by Means of

Darwinian Selection
Dario Floreano1*, Laurent Keller2*
1 Laboratory of Intelligent Systems, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland, 2 Department of Ecology and Evolution, University of
Lausanne, Biophore, Lausanne, Switzerland
Ever since Cicero’s De Natura Deorum and body morphologies, and cooperate. We instead be obtained by using an evolutionary
ii.34., humans have been intrigued by present mostly—but not only—experimental process with mutations and selective repro-
the origin and mechanisms underlying results performed in our laboratory, which duction [13]. The development of computer
complexity in nature. Darwin suggested satisfy the following criteria. First, the algorithms inspired by the process of natural
that adaptation and complexity could experiments were at least partly carried out evolution followed shortly after [14–16], but
evolve by natural selection acting suc- with real robots, allowing us to present a the first experiments on the evolution of
cessively on numerous small, heritable video showing the behaviours of the evolved adaptive behaviours for autonomous robots
modifications. But is this enough? Here, robots. Second, the robot’s neural networks were done only in the early 1990s [17–19],
we describe selected studies of experi- had a simple architecture with no synaptic leading to the birth of the field of evolutionary
mental evolution with robots to illustrate plasticity, no ontogenetic development, and robotics [1,2].
how the process of natural selection can no detailed modelling of ion channels and The general idea of evolutionary robotics
lead to the evolution of complex traits spike transmission. Third, the genomes were (Figure 1 and Video S1) is to create a
such as adaptive behaviours. Just a few directly mapped into the neural network (i.e., population with different genomes, each
hundred generations of selection are no gene-to-gene interaction, time-dependent defining parameters of the control system of
sufficient to allow robots to evolve dynamics, or ontogenetic plasticity). By a robot or of its morphology. The genome
collision-free movement, homing, so- limiting our analysis to these studies we are is a sequence of characters whose transla-
phisticated predator versus prey strate- able to highlight the strength of the process of tion into a phenotype can assume various
gies, coadaptation of brains and bodies, Darwinian selection in comparable simple degrees of biological realism [20]. For
cooperation, and even altruism. In all systems exposed to different environmental example, an artificial genome can describe
cases this occurred via selection in robots conditions. There have been numerous other the strength of synaptic connections of an
controlled by a simple neural network, studies of experimental evolution performed artificial neural network that determines
which mutated randomly. with computer simulations of behavioural the behaviour of the robot. The input
Genes do not specify behaviours directly systems. Reviews of these studies can be neurons of the neural network are activated
but rather encode molecular products that found in [4–6]. Furthermore, artificial evo- by the robot’s sensors, and the output
lead to the development of brains and bodies lution has also been applied to disembodied neurons control the motors of the robot.
through which behaviour is expressed. An digital organisms living in computer ecosys- Within a population, each individual has a
important task is therefore to understand tems, such as Tierra [7] and Avida [8], to different genome describing a different
how adaptive behaviours can evolve by the address questions related to gene interactions neural network (i.e., different connections
mere process of natural selection acting on [9], evolution of complexity [10], and between neurons), thus resulting in specific
genes that do not directly code for behav- mutation rates [11,12]. individual responses to sensory-motor in-
iours. A spectacular demonstration of the teractions with the environment. These
power of natural selection comes from The Principle of Selection in behavioural differences affect the robot’s
experiments in the field of evolutionary Evolutionary Robotics fitness, which is defined, for example, by
robotics [1,2], where scientists have conduct- how fast and straight the robot moves or
ed experimental evolution with robots. The first proposal that Darwinian selection how frequently it collides with obstacles. At
Evolutionary robotics has also been advo- could generate efficient control systems can the beginning, robots have random values
cated as a method to automatically generate be attributed to Alan Turing in the 1950s. He for their genes, leading to completely
control systems that are comparatively suggested that intelligent machines capable of random behaviours. The process of Dar-
simpler or more efficient than those engi- adaptation and learning would be too difficult winian selection is then imitated by selec-
neered with other design methods because to conceive by a human designer and could tively choosing the genomes of robots with
the space of solutions explored by evolution
can be larger and less constrained than that
explored by conventional engineering meth- Citation: Floreano D, Keller L (2010) Evolution of Adaptive Behaviour in Robots by Means of Darwinian
Selection. PLoS Biol 8(1): e1000292. doi:10.1371/journal.pbio.1000292
ods [3]. In this essay we will examine key
Published January 26, 2010
experiments that illustrate how, for example,
robots whose genes are translated into simple Copyright: ß 2010 Floreano, Keller. This is an open-access article distributed under the terms of the Creative
Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium,
neural networks can evolve the ability to provided the original author and source are credited.
navigate, escape predators, coadapt brains
Funding: This work was partly funded by the Swiss National Science Foundation and by the Future and
Emerging Technologies Division of the European Commission. The funders had no role in study design, data
collection and analysis, decision to publish, or preparation of the manuscript.
Essays articulate a specific perspective on a topic of
broad interest to scientists. Competing Interests: The authors have declared that no competing interests exist.
* E-mail: Dario.Floreano@epfl.ch (DF); Laurent.Keller@unil.ch (LK)
PLoS Biology | www.plosbiology.org 1 January 2010 | Volume 8 | Issue 1 | e1000292

Figure 1. Major steps of Darwinian selection with robots. 1) The robots have a neural network with the strength of connections between
neurons determining their behaviour as a function of the information provided by the environment. 2) The fitness f of each robot (i.e., the
performance in the task assigned to them) is measured in the experimental setting using real robots or physics-based simulators. 3) The genomes of
robots with highest fitness are selected to form a new generation. 4) The selected genomes are paired to perform crossover and mutations. 5) The
new genomes are used to perform a new round of selection in the next generation.
doi:10.1371/journal.pbio.1000292.g001
highest fitness to produce a new generation number of individuals over many genera- looping maze (Figure 2, left) with a two-
of robots. In this process, genomes are tions. The evolved genomes can then be wheeled robot equipped with eight dis-
paired (to allow recombination) and ran- implemented in real robots, which have tance sensors (six on one side and two on
dom mutations (e.g., character substitution, been shown to display the same behaviour the other side of the robot). The sensors
insertion, deletion, or duplication) are as observed in simulations for the experi- were connected to eight input neurons that
applied with a given probability to the ments described in this article. were connected to two output neurons,
new genomes. This process of evolution can which each controlled the direction and
be repeated over many generations until a Collision-Free Navigation speed of rotation of one of the wheels
stable behavioural strategy is established. In (Text S1, section 1). The genome of the
some experiments this selective process has Darwinian selection has been used to robots consisted of a sequence of bits
been performed with real robots whereas in investigate whether small-wheeled robots encoding the connection weights between
other experiments physics-based simula- could evolve collision-free navigation, a input and output neurons. Mutations
tions [21] that included models of mass, behaviour that requires appropriate pro- allowed the strengths of connections
friction, gravity, accelerations, and colli- cessing of sensory information and coor- between neurons to change over genera-
sions have been used. Such simulations dinated activation of the motor system. tions. Experimental selection was conduct-
allow one to conduct selection with a large The experiments were conducted in a ed in three independent populations each
Figure 2. Collision-free navigation. A) A Khepera robot tested in a looping maze. B) Trajectory of one of the robots with an evolved neural
controller. The segments represent the axis between the two wheels plotted every 300 ms using an external tracking device.

consisting of 80 individuals [18]. The walls sufficiently in advance at high speed. lasted 150 sensory-motor cycles, a robot
performance of each robot was evaluated Thus, the robots evolved to move at had to return at least twice to the nest to
with a fitness function describing the intermediate speeds because of their limited be able to continue moving throughout the
ability of the robot to efficiently move in neural and sensory abilities. More general- whole experiment. In addition to the eight
the maze. ly, these experiments reveal that a process distance sensors used in the collision-free
Over the first few generations, the robots of selective reproduction with random experiments, robots also had a floor-colour
rapidly improved their ability to move mutations on genes that encode the wiring sensor, enabling them to determine wheth-
without collisions in the looping maze of neural networks can generate coordinat- er they were in the nest; two light sensors
and, within less than 100 generations, most ed navigation behaviour that takes into on their sides, allowing them to locate the
of them exhibited collision-free navigation account not only the environmental char- light tower over their nest (but not
(Figure 2, right, and Video S2). Although acteristics, but also the morphological and sufficient to tell precisely the distance);
the fitness function did not specify in what mechanical properties of the robots. and a sensor giving information on the
direction the robot should navigate (the battery level (Text S1, section 2).
robots were perfectly circular and the Homing Experimental selection was conducted
wheels could rotate in both directions), in a population of 100 individuals [25]. A
the best evolved individuals across all An evolutionary experiment with the robot’s fitness was proportional to the
replicates moved in the direction corre- same robots was conducted to investigate average rotational speed of the two wheels
sponding to the side with the highest whether they could also evolve the ability and distance from the walls (Text S1,
number of sensors. This was because to find their way home, a process that has section 2). After 200 generations of
individuals initially moving in the direction been suggested to require the development selection with real robots, the best indi-
with fewer sensors had higher probability of of internal representations of the environ- viduals performed wide explorations of the
colliding into corners and thus had lower ment [22–24]. To mimic a situation arena, returning to the nest only when
probability of being selected for reproduc- selecting for homing ability, robots were their batteries had approximately 10%
tion. Interestingly, the driving speed of the placed in a dark room with a small light residual energy (Video S3). They stayed in
best-evolved robots was approximately half tower located behind their nest, which the nest only for the time necessary to turn
of the maximum possible speed and did not consisted of a black patch on the floor in and exit. This was because being in the
increase even when the evolutionary ex- one of the corners of a square arena nest only permitted small fitness increase
periments were continued for another 100 (Figure 3, left). Robots initially had a fully as the robots’ distance to the walls was
generations. Additional experiments where charged (simulated) battery that dis- very small.
the speed was artificially increased revealed charged linearly over 50 sensory-motor The ability of robots to arrive to the nest
that fast-moving robots had high rates of cycles. When a robot passed over the black when their batteries reached a very low level
collisions because the 300-ms refresh rate of patch of the nest, its battery was instanta- was mediated by the evolution of a neuronal
the sensors did not allow them to detect neously recharged. As the experiment representation of the environment that
Figure 3. Evolution of homing. A) Experimental setup with a Khepera robot moving in the direction of the nest (recharging station), located in
front of the light tower. B) Trajectory of an evolved robot after 200 generations. The trajectory starts in the lower left corner and ends within the
recharging nest in the top left corner. Each point corresponds to the recording of the robot’s position using an external tracking device. The arena
and the recharging nest were plotted by manually positioning the robot along their contours.

enabled them to combine information on with a 36u field of view. Third, the prey simulations and three replicates of 25
their location and battery level to precisely had a black stick that could be visually generations were conducted with real
time the homing behaviour (Text S1, section perceived by the predator (Text S1, robots [30].
2). The process of neural representation was section 3). These differences allowed Both the simulation and real robot
reminiscent of ‘‘place cells’’ [26] and of predators to detect the prey at a distance experiments led to the generation of, and
‘‘head-oriented cells’’ [27] in the rat hippo- of up to 100 cm, whereas prey could only cycling through, a set of different pursuit
campus, suggesting that artificial organisms detect predators when they were less than and evasion strategies (Video S4). The
may evolve functionally similar internal 0.5 cm away, but the prey could outrun cycle observed in one of the simulation
representations and behavioural strategies the predator. replicates is illustrated in Figure 4. Dur-
as real organisms to solve tasks requiring the One prey and one predator were ing the first generations, most predator
simultaneous processing of different sources placed in pairs in square arenas with and prey robots displayed an uncoordi-
of information. the fitness of the predator being inversely nated behaviour, turning on the spot
proportional to the time it took to catch (Figure 4, box 1). After a few generations,
Predator–Prey Coevolution the prey developed fast motion in the
(i.e., touch) the prey while that of the
Experimental evolution with robots has prey was proportional to the time it environment whereas the predators visu-
also been used to study the coevolutionary managed to avoid being caught by the ally tracked them so as to intercept their
processes between a population of preda- predator. Each predator robot was indi- trajectories (Figure 4, box 2). Some
tor robots and a population of prey robots vidually tested during two minutes generations later, the predators became
[28,29]. Both the predator and prey robots against the best prey of each of the so efficient in catching the prey that they
(Figure 4, top) were equipped with eight previous five generations, and similarly lost the ability to detect and avoid walls
distance sensors (six on one side and two each prey against the five best predators. (this was due to weak selection pressure
on the other side). However, prey and Starting with populations of 80 predators for wall avoidance because the prey was
predator robots differed in three ways. and 80 prey, each independently tested almost always caught before the predator
First, the maximum speed of the prey was in one-to-one tournaments, ten indepen- would hit a wall) (Figure 4, box 3).
twice that of predator. Second, the dent replicates of 100 generations were Subsequently, the prey evolved a new
predator had an additional vision system carried out in physics-based computer strategy that consisted of waiting for the
predator and moving backward when it
approached (Figure 4, box 4), thus
avoiding being caught. However, this
evasion strategy was not perfect because
the prey could not detect the predator
when approached from the sides without
sensors. A few generations later, the prey
displayed a variation of an earlier strategy
consisting of coasting the walls at maxi-
mum speed. At this point, the predators
evolved a ‘‘spider’’ strategy consisting of
backing against one of the walls and
waiting for the fast-moving prey, whose
sensors could not detect the predator
sufficiently, early to avoid it because its
body reflected less infrared light than the
white walls (Figure 4, box 5). After some
more generations, the prey displayed a
novel variation of the wait-and-avoid
strategy where it quickly rotated in place,
which reduced the probability of being
approached from the sides without sen-
sors. As soon as it detected the predator, it
moved backward while facing it with the
side having the highest number of sensors
(Figure 4, box 6). Overall, these experi-
ments revealed that a large variety of
sophisticated behavioural strategies could
evolve, but none of them were stable over
time because of the coevolutionary dy-
namics. A similar pattern seems to occur
in natural systems where each party in a
coevolutionary relationship exerts selec-
Figure 4. Coevolution of predator and prey robots. A) The predator robot (right) facing the tive pressure on the other, thereby
prey robot (left). B) Six examples of pursuit and evasion strategies that evolved over the 100
affecting each other’s evolution and
generations of selection in one of the replicates (see main text for description). The position of
the prey at the end of the trial is indicated by the empty disk and that of the predator by the leading to a constant evolution of strate-
black disk (the lines in the disks correspond to the frontal directions). gies and counterstrategies between par-
doi:10.1371/journal.pbio.1000292.g004 ties [31,32].

Joint Evolution of Brains and between brain and body morphologies can robots was measured in populations con-
Body Morphologies produce various types of adaptive behav- taining 100 groups of 10 robots each.
iour and morphologies. In one experimental condition, the
Experimental evolution has also been arena contained only large tokens, and
used to coevolve artificial brains and the only way for robots to increase their
Evolution of Cooperation and
morphologies of simulated robots. In a fitness was to cooperate in pushing them
pioneering study, Karl Sims used a Altruism
[37]. Accordingly, robots readily evolved
competitive scenario where the fitness of Experimental evolution was also used to the ability to cooperatively push large
two opponent robots was proportional to investigate whether robots could evolve tokens towards the white wall in all 20
their ability to gain control of a cube cooperative and altruistic behaviour and, evolutionary replicates that were conduct-
placed in the centre of an arena [33]. The if so, under what conditions. Cooperation ed. However, when the arena contained
evolutionary experiments were carried out is defined as an act increasing both the both large and small tokens, the behaviour
solely in physics-based simulations. The direct fitness of the individual giving help of robots was influenced by the group kin
genome of each robot consisted of two and the fitness of the individual receiving structure. In groups of unrelated robots
chromosomes, one encoding the topology help; by contrast, altruism reduces the (i.e., robots whose genomes where not
of a neural network and the other direct fitness of the individual performing more similar within than between groups),
encoding the shape of a body composed the helping act [35,36]. The experimental robots invariably specialised in pushing the
of rigid blocks linked by controllable setup consisted of a foraging situation in a small objects, which was the most efficient
articulations. This led to the coevolution square arena containing ten sugar cube- strategy to maximise their own individual
of different types of robots capable of sized wheeled robots, small tokens that a fitness them (i.e., large tokens provided an
moving towards the cube and preventing single robot could push, and large tokens equal direct payoff as a small token but
access to its opponent. For example, some requiring at least two robots to be pushed were more difficult to successfully push).
robots consisted of a cubic block with two (Figure 6). The robots had five infrared By contrast, the presence of related robots
articulated, arm-like structures, which distance sensors, four of them sensing within groups allowed the evolution of
were used for moving on the ground and objects within a 3-cm range and a fifth, altruism. When groups were formed of
holding the cube. Other robots were which was placed higher, having a 6-cm ‘‘clonal’’ robots all having the same
composed of only two articulated worm- genome, individuals primarily pushed the
range. These sensors allowed robots to
like segments where one segment was so large tokens even though it was costly, in
locate the tokens and distinguish them
large and heavy that, once placed over the terms of individual fitness, for the robots
from robots. Robots were also equipped
cube, it prevented the opponent from
with two vision sensors to perceive the pushing (Video S6).
displacing it.
colours of the walls (Text S1, section 5). Similar results were obtained in exper-
The idea of fully evolvable robot
Their fitness was proportional to the iments where groups of light-emitting,
hardware was taken on by Lipson and
number of tokens successfully pushed foraging robots could communicate the
Pollack [34], who applied Darwinian
within a 4-cm zone along a white wall position of a food source at a cost to
selection to simulated electromechanical
(the three other walls of the arena were themselves because of the resulting in-
systems. As in Sims’ experiments, the
black). A large token successfully pushed creased competition near food. In these
genome of the evolving individuals speci-
along the white wall increased the fitness experiments, robots again readily evolved
fied the morphology of the robot body and
of all robots within a group (10 robots per costly communication when they were
of the neural network. The robot bodies
group) by 1 fitness unit, while a small genetically related, but altruistic commu-
consisted of simple building blocks, such as
token successfully pushed increased the nication never evolved in groups of
bars of variable lengths, joints, and linear
fitness (also by 1 unit) of only the robot unrelated robots when selection operated
actuators. Bars could be coupled with
that pushed it. The fitness of individual at the individual level [38,39].
linear actuators that changed their length
and were connected together through ball
joints to form arbitrary truss structures
with the possibility of both rigid and
articulated substructures. The movements
of the linear actuators were controlled by
the activations of neurons, whose connec-
tions to other neurons and to the linear
actuators were specified along with the
body components in the evolving genomes
(Text S1, section 4). The fitness of a robot
was proportional to the distance it moved
over a flat surface. After 300 generations
of selection with physics-based simula-
tions, the individuals with highest fitness
were fabricated robotically using rapid
manufacturing technology (plastic extru-
sion 3-D printing) and tested in the real
world (Video S5). An example of such a
robot capable of fast locomotion is shown Figure 5. Example of an evolved ‘‘creature’’ created by autonomous design and
in Figure 5. Taken as a whole, these fabrication process. (Image: Hod Lipson).
experiments revealed how the coevolution doi:10.1371/journal.pbio.1000292.g005

Figure 6. Evolution of cooperative foraging. A) Foraging arena containing ten Alice micro-robots (black squares with arrows) and small and
large tokens that robots had to push towards the dashed area near the white wall (the other three walls were painted black). B) Experiment with real
robots.
These experiments are interesting in So far, evolutionary robotic experiments that vary along relevant dimensions (e.g.,
two ways. First, they demonstrate that the have been conducted mostly by computer lighting conditions or ground texture).
same general rules apply for experimental scientists and engineers (e.g., [17,46–56]). Another consists of incorporating noise in
evolution of robots and real organisms. Their primary interest has been to exploit features of the simulation model (e.g.,
Theory predicts that altruism, defined as the power of artificial evolution to auto- elasticity of joints or the physical interac-
an act of helping that decreases the direct matically generate novel or better control tions that occur during collisions) that may
fitness of the individual performing it, systems and body shapes for specific not faithfully reflect the real world. A third
should only evolve among related individ- problems. For example, the method of consists of coevolving the robot and the
uals, and this is also what has been found evolutionary robotics described in the key parameters of the simulation model
in a wide range of organisms, ranging context of cooperative behaviour has been and periodically testing the evolved con-
from bacteria to social insects and social successfully used to generate the control trol system with real robots to improve the
vertebrates (e.g., [40–45]). Second, it systems of a swarm of micro aerial vehicles estimate of the fitness of the robot and
demonstrates that cooperation and altru- that must locate rescuers and spread so as simulator [53]. Finally, a solution that may
ism can evolve even in organisms with to establish a radio communication net- also be relevant from a biological perspec-
simple cognitive abilities (in both the token work based uniquely on signal strength of tive consists of adding ontogenetic plastic-
pushing and communication experiments, the rescuer mobile phones and of the robot ity to the evolving individuals so that they
robots had neural network controllers emitters, a problem for which existing can adapt to environmental modifications
consisting of less than 15 neurons). engineering solutions require the use of arising during their lifetime [60].
absolute geo-localisation information pro- It is only very recently that biologists
Conclusions vided by GPS signals [57]. and cognitive scientists have become
These examples of experimental evolu- A major issue in evolutionary robotics is interested in evolutionary robotics, realis-
tion with robots verify the power of that agents may use idiosyncratic features ing that it provides a powerful means to
evolution by mutation, recombination, and of the environment in which they are study how phenotypes can be shaped by
natural selection. In all cases, robots initially selected to increase performance, hence natural selection and address questions
exhibited completely uncoordinated behav- leading to a major fitness drop in new that are difficult to address with real
iour because their genomes had random environments where these features are organisms. Current topics of biologically
values. However, a few hundreds of gener- lacking. A similar problem arises when motivated research in evolutionary robot-
ations of random mutations and selective the evolutionary process takes place in ics include the role of ontogenetic devel-
reproduction were sufficient to promote the simulations failing to capture relevant opment (e.g., [61]), the principles of neural
evolution of efficient behaviours in a wide physical aspects of the environment. In control of highly dynamic and elastic body
range of environmental conditions. The this case, the evolved individuals do not morphologies such as passive robotic
ability of robots to orientate, escape pred- operate well in the real world [58,59]. walkers (e.g., [62,63]), the functional role
ators, and even cooperate is particularly Computer scientists and engineers have of morphology in coevolving bodies and
remarkable given that they had deliberately come up with various solutions to this brains [64], the role of active perception as
simple genotypes directly mapped into the problem (for a recent review, see [4]). One a mean to structure and simplify sensory
connection weights of neural networks consists of measuring the fitness of evolv- information in behaving organisms
comprising only a few dozen neurons. ing individuals in several environments [65,66], and the effects of synaptic plas-

ticity [60,67,68] and neuromodulation Video S1 Method for evolving the Found at: doi:10.1371/journal.pbio.
[69] on organisms evolving in rapidly neural network of a robot. Valid gene 1000292.s004 (9.23 MB MPG)
changing and partially unpredictable en- sequences are extracted (magnifying lens)
Video S4 Coevolved predator and
vironments (i.e., under situations where from a binary string representing the genome
prey robots engaged in a tourna-
individuals benefit to change behaviour of the robot. Those genes are translated into
ment. After locating and moving to-
over time). In particular, the incorporation neurons of different type (colour) according to wards the prey, the predator cannot reach
of adaptive mechanisms during ontogeny the genetic specifications, such as sensory, it because the prey can perceive it with
mediated by phenotypic plasticity and motor, excitatory, or inhibitory neurons. The the rear distance sensors and moves
learning (e.g., [70]) provides promising corresponding neural network is connected to faster.
avenues for the study of processes operat- the sensors and motors of the robot and the Found at: doi:10.1371/journal.pbio.
ing at different spatial and temporal scales. resulting behaviour of the robot is measured 1000292.s005 (1.89 MB MOV)
In comparison to theoretical and numer- according to the fitness function. The
ical models of biological phenomena, the genomes of the individuals that had the worst Video S5 Coevolution of body and
embodiment and behavioural features of performance are discarded from the popula- brain in a robotic machine. Please
robot models can result in stronger testing tion (symbolically thrown in a dustbin) switch the audio on to listen to the com-
of hypotheses and in higher predictive power whereas the genomes of the best individuals mentary. Video courtesy of Hod Lipson, also
[71–73]. The use of real robot features are are paired and crossed over with small available from http://www.mae.cornell.edu/
particularly useful in an evolutionary per- random mutations to generate new offspring Lipson/.
spective where behaviour and ensuing com- (the process of selective reproduction is Found at: doi:10.1371/journal.pbio.
plex physical interactions can significantly symbolically shown to occur in a ‘‘mother 1000292.s006 (4.97 MB MOV)
affect the interaction with the environment robot’’). After several generations of selective Video S6 Evolution of altruistic
and performance. Therefore, evolutionary reproductions with mutations, robots display cooperation in a team of clonal
robotics also offers new opportunities to better or novel behaviours. ‘‘Alice’’ robots. In the initial genera-
address issues such as sexual selection,
Found at: doi:10.1371/journal.pbio. tion, the robots can hardly perform
division of labour, speciation, and, in general,
1000292.s002 (7.82 MB MOV) coordinated navigation. After 240 genera-
the open-ended evolution of diversity and
Video S2 Evolution of collision-free tions of Darwinian selection, most robots
complexity in behavioural systems. Interdis-
navigation. In the initial generations, search for large food tokens and cooperate
ciplinary collaborations among engineers,
robots can hardly avoid walls (one robot to push them towards the region of the
evolutionary biologists, neuroscientists, and
arena under the white wall.
molecular biologists should prove fruitful to even approaches objects). After 50 gener-
investigate important issues on the principles ations, robots can navigate around the Found at: doi:10.1371/journal.pbio.
that mediate the evolution of adaptive 1000292.s007 (10.21 MB MPG)
looping maze without hitting the walls.
behaviour of organisms that cannot be Found at: doi:10.1371/journal.pbio.
readily studied with standard methods. 1000292.s003 (8.15 MB MOV) Acknowledgments
Video S3 Evolved ‘‘Khepera’’ robot We thank Kevin Foster, Tad Kawecki, Hod
Supporting Information performing exploration and homing
Lipson, Sara Mitri, Karen Parker, Rolf Pfeifer,
Francis Ratnieks, Ken Ross, and Steffen
Text S1 Supplementary methods. for battery recharge. The robot enters Wischmann for useful comments on the man-
Found at: doi:10.1371/journal.pbio. the recharging area approximately 2 uscript, and Daniel Marbach for help with
1000292.s001 (6.50 MB DOC) seconds before full battery discharge. figures.
References
1. Cliff D, Husbands P, Harvey I (1993) Explora- 9. Lenski RE, Ofria C, Collier TC, Adami C (1999) intelligence. Ann ArborMI: University of Michi-
tions in evolutionary robotics. Adapt Behav 2: Genome complexity, robustness, and genetic gan Press.
73–110. interactions in digital organisms. Nature 400: 17. Lewis MA, Fagg AH, Solidum A (1992) Genetic
2. Nolfi S, Floreano D (2000) Evolutionary robotics: 661–664. programming approach to the construction of a
the biology, intelligence, and technology of 10. Lenski RE, Ofria C, Pennock RT, Adami C neural network for control of a walking robot.
self-organizing machines. CambridgeMA: MIT (2003) The evolutionary origin of complex Proceedings of the IEEE International Confer-
Press. features. Nature 423: 139–144. ence on Robotics and Automation (ICRA ’92). pp
3. Miller JF, Job D, Vassilev VK (2000) Principles in 11. Wilke CO, Wang J, Ofria C, Lenski RE, Adami C 2618–2623.
the evolutionary design of digital circuits – part I. (2001) Evolution of digital organisms at high 18. Floreano D, Mondada F (1994) Automatic crea-
Genetic Programming and Evolvable Machines mutation rate leads to survival of the flattest. tion of an autonomous agent: genetic evolution of a
1: 7–35. Nature 412: 331–333. neural network driven robot. In: Cliff D,
4. Floreano D, Husbands P, Nolfi S (2008) Evolu- 12. Clune J, Misevic D, Ofria C, Lenski RE, Husbands P, Meyer JA, Wilson S, eds. Proceed-
tionary Robotics. In: Siciliano B, Khatib O, eds Elena SF, et al. (2008) Natural selection fails to ings of third International Conference on Simula-
(2008) Springer handbook of robotics. Berlin: optimize mutation rates for long-term adaptation tion of Adaptive Behavior: From Animals to
Springer Verlag. pp 1423–1451. on rugged fitness landscapes. PLoS Computa- Animats 3. CambridgeMA: MIT Press. pp
5. Floreano D, Mattiussi C (2008) Bio-inspired tional Biology 4: e1000187. doi:10.1371/journal. 421–430.
artificial intelligence: theories, methods, and pcbi.1000187. 19. Harvey I, Husbands P, Cliff D (1994) Seeing the
technologies. CambridgeMA: MIT Press. 13. Turing AM (1950) Computing machinery and light: artificial evolution, real vision. In: Cliff D,
6. Harvey I, Di Paolo E, Wood R, Quinn M, intelligence. Mind 49: 433–460. Husbands P, Meyer JA, Wilson S, eds. Proceed-
Tuci E (2005) Evolutionary robotics: a new 14. Rechenberg I (1965) Cybernetic solution path of ings of the third International Conference on
scientific tool for studying cognition. Artif Life an experimental problem. Royal Air Force Simulation of Adaptive Behavior: From Animals to
11: 79–98. Establishment. 1122 p. Animats 3. CambridgeMA: MIT Press. pp
7. Ray TS (1992) An approach to the synthesis of 15. Fogel LJ, Owens AJ, Walsh MJ (1966) Artificial 392–401.
life. In: Langton CG, Taylor C, Farmer DJ, intelligence through simulated evolution. New 20. Floreano D, Dürr P, Mattiussi C (2008) Neuroe-
Rasmussen S, eds. Proceedings of the Second York, NY: John Wiley. volution: from architectures to learning. Evol
Workshop on Artificial Life. pp 371–408. 16. Holland JH (1975) Adaptation in natural and Intelligence 1: 47–62.
8. Adami C (1998) Introduction to artificial life. artificial systems: an introductory analysis with 21. Featherstone R (2000) Robot dynamics: equations
New York: Springer Verlag. applications to biology, control, and artificial and algorithms. Proceedings of IEEE Interna-

tional Conference on Robotics and Automation workers: A test of kin selection theory. Science 58. Brooks RA (1992) Artificial life and real robots.
(ICRA ’00). pp 826–834. 274: 993–995. In: Varela FJ, Bourgine P, eds. Toward a Practice
22. Schmajuk NA, Blair HT (1993) Place learning 42. Pfennig DW, Collins JP (1993) Kinship affects of Autonomous Systems: Proceedings of the First
and the dynamics of spatial navigation: A neural morphogenesis in cannibalistic salamanders. Na- European Conference on Artificial Life. Cam-
network approach. Adapt Behav 1: 353–385. ture 362: 836–838. bridgeMA: MIT Press. pp 3–10.
23. Burgess N, Donnett JG, Jeffery KJ, O’Keefe J 43. Langer P, Hogendoorn K, Keller L (2004) Tug- 59. Jakobi N, Husbands P, Harvey I (1995) Noise and
(1997) Robotic and neuronal simulation of of-war over reproduction in a social bee. Nature the reality gap: The use of simulation in
hippocampal navigation. Philos Trans R Soc 428: 844–847. evolutionary robotics. In: Moran F, Moreno A,
Lond B Biol Sci 352: 1535–1543. 44. West SA, Murray MG, Machado CA, Griffin AS, Merelo J, Chacon P, eds. Advances in Artificial
24. Healy S, ed (1998) Spatial representations in Herre EA (2001) Testing Hamilton’s rule with Life: Proceedings of the Third European Confer-
animals. Oxford: Oxford University Press. competition between relatives. Nature 409: ence on Artificial Life. London: Springer-Verlag.
25. Floreano D, Mondada F (1996) Evolution of 510–513. pp 704–720.
homing navigation in a real mobile robot. IEEE 45. West SA, Diggle SP, Buckling A, Gardner A, 60. Urzelai J, Floreano D (2001) Evolution of
Trans Syst Man Cybern B Cybern 26: 396–407. Griffin AS (2007) The social lives of microbes. adaptive synapses: robots with fast adaptive
26. O’Keefe J, Nadel L (1978) The hippocampus as a Annu Rev Ecol Evol Syst 38: 53–77. behavior in new environments. Evol Comput 9:
cognitive map. Oxford: Clarendon Press. 46. Gallagher JC, Beer RD, Espenschied KS, 495–524.
27. Taube JS, Muller RU, Ranck JBJ (1990) Head- Quinn RD (1996) Application of evolved loco-
61. Bongard J, Pfeifer R (2003) Evolving complete
direction cells recorded from the postsubiculum in motion controllers to a hexapod robot. Rob
agents using artificial ontogeny. In: Hara F,
freely moving rats. I. Description and quantitative Auton Syst 19: 95–103.
Pfeifer R, eds. Morpho-functional Machines: The
analysis. J Neurosci 10: 420–435. 47. Gomi T, Ide K (1998) Emergence of gaits of a
New Species: Designing Embodied Intelligence.
28. Floreano D, Nolfi S (1997) God save the red legged robot by collaboration through evolution. In:
queen! Competition in co-evolutionary robotics. Sugisaka M, ed. Proceedings of the International Berlin: Springer-Verlag. pp 237–258.
In: Koza JR, Deb K, Dorigo M, Fogel DB, Symposium on Artificial Life and Robotics. Berlin: 62. Reil T, Husbands P (2002) Evolution of central
Garzon M, et al. (1997) Genetic Programming Springer Verlag. pattern generators for bipedal walking in a real-
1997: Proceedings of the Second Annual Confer- 48. Gruau F, Quatramaran K (1996) Cellular encoding time physics environment. IEEE Trans Evol
ence. San FranciscoCA: Morgan Kaufmann. pp for interactive evolutionary robotics. In: Husbands P, Comput 6: 159–168.
398–406. Harvey I, eds. Fourth European Conference on 63. Vaughan ED, Di Paolo E, Harvey IR (2004) The
29. Nolfi S, Floreano D (1998) Co-evolving predator Artificial Life. CambridgeMA: MIT Press. pp evolution of control and adaptation in a 3D
and prey robots: do ‘‘arm races’’ arise in artificial 368–377. powered passive dynamic walker. In: Pollack J,
evolution. Artif Life 4: 311–335. 49. Kodjabachian J, Meyer J-A (1998) Evolution and Bedau M, Husbands P, Ikegami T, Watson R, eds.
30. Floreano D, Nolfi S, Mondada F (1998) Com- development of neural controllers for locomotion, Proceedings of the Ninth International Conference
petitive co-evolutionary robotics: from theory to gradient-following, and obstacle-avoidance in on the Simulation and Synthesis of Living Systems
practice. In: Pfeifer R, ed. Proceedings of the fifth artificial insects. IEEE Trans Neural Netw 9: MIT Press. pp 139–145.
International Conference on Simulation of Adap- 796–812. 64. Pfeifer R, Bongard JC (2006) How the body
tive Behavior: From Animals to Animats 5. 50. Hornby GS, Lipson H, Pollack JB (2003) shapes the way we think. CambridgeMA: MIT
CambridgeMA: MIT Press. pp 515–524. Generative representations for the automated Press.
31. Van Valen L (1973) A new evolutionary law. Evol design of modular physical robots. IEEE Trans 65. Bianco R, Nolfi S (2004) Evolving the neural
Theory 1: 1–30. Rob Autom 19: 709–713. controller for a robotic arm able to grasp objects
32. Bell G (1982) The masterpiece of nature: the 51. Quinn M, Smith L, Mayley G, Husbands P on the basis of tactile sensors. Adapt Behav 12:
evolution and genetics of sexuality. Berkeley: (2003) Evolving controllers for a homogeneous 37–45.
University of California Press. system of physical robots: structured cooperation 66. Suzuki M, Floreano D, Di Paolo E (2005) The
33. Sims K (1994) Evolving 3D morphology and with minimal sensors. Philos Trans Phys Sci Eng contribution of active body movement to visual
behavior by competition. In: Brooks R, Maes P, 361: 2321–2344. development in evolutionary robots. Neural Netw
eds. Artificial Life IV. Cambridge, MA: MIT 52. Dorigo M, Trianni V, Sahin E, Groß R, 18: 656–665.
Press Bradford Books. pp 28–39. Labella TH, et al. (2004) Evolving self-organizing 67. Di Paolo EA (2003) Evolving spike-timing-depen-
34. Lipson H, Pollack JB (2000) Automatic design behaviors for a swarm-bot. Auton Robots 17: dent plasticity for single-trial learning in robots.
and manufacture of robotic lifeforms. Nature 406: 223–245. Philos Trans Phys Sci Eng 361: 2299–2319.
974–978. 53. Bongard J, Zykov V, Lipson H (2006) Resilient 68. Niv Y, Joel D, Meilijson I, Ruppin E (2002)
35. Hamilton WD (1964) The genetical evolution of machines through continuous self-modeling. Sci- Evolution of reinforcement learning in uncertain
social behaviour. J Theor Biol 7: 1–52. ence 314: 1118–1121. environments: A simple explanation for complex
36. Lehmann L, Keller L (2006) The evolution of 54. Trianni V, Nolfi S, Dorigo M (2006) Cooperative
foraging behaviors. Adapt Behav 10: 5–24.
cooperation and altruism: A general framework hole avoidance in a swarm-bot. Rob Auton Syst
69. Philippides A, Husbands P, Smith T, O’Shea M
and a classification of models. J Evol Biol 19: 54: 97–103.
(2005) Flexible couplings: diffusing neuromodula-
1365–1379. 55. Tellez R, Angulo C, Pardo D (2006) Evolving the
tors and adaptive robotics. Artif Life 11: 139–160.
37. Waibel M, Keller L, Floreano D (2009) Genetic walking behaviour of a 12 DOF quadruped using
team composition and level of selection in the a distributed neural architecture. In: Ijspeert AJ, 70. Parisi D, Nolfi S (2001) Developmental in neural
evolution of cooperation. IEEE Trans Evol Masuzawa T, Kusumoto S, eds. Proceedings of networks. In: Mukesh JP, Honavar V,
Comput 13: 648–660. the International Workshop on Biologically Balakrishan K, eds. Advances in Evolutionary Syn-
38. Floreano D, Mitri S, Magnenat S, Keller L (2007) Inspired Approaches to Advanced Information thesis of Neural Networks. Cambridge, MA: MIT
Evolutionary conditions for the emergence of Technology. Berlin: Springer-Verlag. pp 5–19. Press. pp 215–246.
communication in robots. Curr Biol 17: 514–519. 56. Watson RA, Ficici SG, Pollack JB (2002) 71. Ijspeert AJ, Crespi A, Ryczko D, Cabelguen J-M
39. Mitri S, Floreano D, Keller L (2009) The Embodied evolution: distributing an evolutionary (2007) From swimming to walking with a
evolution of information suppression in commu- algorithm in a population of robots. Robotics and salamander robot driven by a spinal cord model.
nicating robots with conflicting interests. PNAS Autonomous Systems 39: 1–18. Science 315: 1416–1420.
106: 15786–15790. 57. Hauert S, Zufferey J-C, Floreano D (2009) 72. Pfeifer R, Lungarella M, Iida F (2007) Self-
40. Sherman PW (1977) Nepotism and the evolution Reverse-engineering of artificially evolved con- organization, embodiment, and biologically in-
of alarm calls. Science 197: 1246–1253. trollers for swarms of robots. Proceedings of the spired robotics. Science 318: 1088–1093.
41. Sundström L, Chapuisat M, Keller L (1996) IEEE Congress on Evolutionary Computation. 73. Webb B (2002) Robots in invertebrate neurosci-
Conditional manipulation of sex ratios by ant pp 55–61. ence. Nature 417: 359–363.

Adaptative Behaviour in Robots

Diunggah oleh

Informasi Dokumen

Deskripsi Asli:

Hak Cipta

Format Tersedia

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Hak Cipta:

Format Tersedia

Adaptative Behaviour in Robots

Diunggah oleh

Hak Cipta:

Format Tersedia

Essay

Evolution of Adaptive Behaviour in Robots by Means of

PLoS Biology | www.plosbiology.org 1 January 2010 | Volume 8 | Issue 1 | e1000292

PLoS Biology | www.plosbiology.org 2 January 2010 | Volume 8 | Issue 1 | e1000292

PLoS Biology | www.plosbiology.org 3 January 2010 | Volume 8 | Issue 1 | e1000292

PLoS Biology | www.plosbiology.org 4 January 2010 | Volume 8 | Issue 1 | e1000292

PLoS Biology | www.plosbiology.org 5 January 2010 | Volume 8 | Issue 1 | e1000292

PLoS Biology | www.plosbiology.org 6 January 2010 | Volume 8 | Issue 1 | e1000292

PLoS Biology | www.plosbiology.org 7 January 2010 | Volume 8 | Issue 1 | e1000292

PLoS Biology | www.plosbiology.org 8 January 2010 | Volume 8 | Issue 1 | e1000292

Anda mungkin juga menyukai