Complex models of innovation on rugged technology landscapes Endogenous evolutionary models of economic growth, and comparison with neoclassical rational expectations models with learning Applied econometrics of complex systems:
Long memory Skewed, fat-tailed distributions (e.g. Pareto vs. Lognormal) Spatio-temporal clustering Graph-theoretic properties of technological trajectories
Percolating Complexity: Generating the Complex Patterns of the Innovation Process from a Simple Probabilistic Lattice, with some Empirical Illustrations
Gerald Silverberg MERIT, Maastricht University based on joint work with Bart Verspagen ECIS, Eindhoven University of Technology
6. There appears to be a certain arbitrariness in the path actually chosen, which could be the result of small events (path dependence or neutral theory?) and cultural and institutional biases (social construction of technology?).
7. Incremental improvements tend to follow upon radical innovations according to rather regular laws (learning curves).
Raw data and four fitted regression models for supersample data. Pure Poisson models are rejected against negative binomial (overdispersed) models
1760
1780
1800
1820
raw data
1840
1860
fit nb
1880
1900
fit Poisson
1920
1940
1960
1980
10
Citations + 1
10
Source: Scherer, Harhoff and Kukies, 2000, Uncertainty and the size distribution of rewards from innovation, JEE, 10: 175-200.
Hill Estimator: Consider n observations of a random variable Xi, and denote by X[i] the order statistics X[1]X[2] X[n]. Then the Hill estimator is defined as follows:
H ( k , n)
k 1 / k (ln X [i ] i 1
ln X [ k 1] ).
1.20
3.00
1.00
2.50
0.80
2.00
alpha
0.60
alpha
1 10 k 100
1.50
0.40
1.00
0.20
0.50
0.00
Hill estimator applied to Harvard University patent portfolio data used in Scherer (1998) (left), and to Trajtenbergs (1990) patent citation data (right).
Percolation diagram in technology-performance space. Lattice sites are filled at random. A site is viable when it connects to the baseline.
Probability of a random site being on the infinite cluster P as a function of the percolation probability q
pc
Convergence, divergence and shortcuts, and two methods of defining a technology's competitiveness.
New innovations are generated with probability p in a region d units above and below the technological frontier.
Near disjoint regions represent inventions, far off discoveries science, and clusters that can never be connected to the baseline science fictions.
A cluster of simultaneous invention occurs when a disjoint island of invention is suddenly joined to the frontier by a single 'cornerstone' innovation.
Deadlocking Statistics
10
10 9 8 7 6 5 4 3 2 1 0
4
9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 search radius
0.673
10 13 16
Number of runs, which deadlock out of batches of ten for different values of the search radius. q=0.593.
Number of deadlocked runs out of ten as a joint function of the search radius and the percolation probability q.
19
sea
rch radi u
0.698
The mean height of the BPF attained after 5000 periods as a function of the search radius and percolation probability q.
350 300 250 200 mean height 150 100 50 0 0.733 0.703 0.673 0.643 0.613 0.583 0.553 q
10
13
16
0.523
search radius
19
Size distribution of innovations (left) and rank-order distribution (right, double-log scale), q=0.603, m=10.
200 180 160
100
number of cases
120 100 80 60 40 20 0
innovation size
1 7 13 19 25 31 37 43 49 55 61 67 73 79 85 91 97
140
10
innovation size
Hill estimator of Pareto for innovation distribution generated with q=0.645 and m=5 plotted on a double-log scale for values of k up to 90% of number of observations
LD Plot q=0.6
LD plot for innovation distribution generated with q=0.60 and m=5 (original data and aggregated data in blocks of 10, 100 and 200 observations).
LD plot for innovation distribution generated with q=0.645 and m=5 (original data and aggregated data in blocks of 10, 100 and 200 observations).
LD plot for innovation distribution generated with q=0.695 and m=5 (original data and aggregated data in blocks of 10, 100 and 200 observations).
Temporal clustering: Innovation count time series for a threshold of 2 (left) and 10 (right) for a run with search radius 5.
12 5
10 4
8 3
2 4
1 2
Arrival rate and overdispersion index for search radius m and threshold theta for radical innovations
0.25
Arrival Rate
100 50 0 76 54 3 21 m 3 5 20
20 3 2
98
theta
Overdispersion
0.2
150
Space-time clustering of cylinder sizes of Cornish steam engines (Nuvolari and Verspagen 2003)
Conclusions
We can maximize our ignorance of technology space by percolating it with an exogenous percolation probability q We can impose the cumulativeness condition by requiring viable technologies to trace a path back to the baseline We can impose blindness and localness of the R&D search process by using m-neighborhoods of the BPF with uniform prob of testing sites Nevertheless we obtain, instead of a completely random innovation process, spatial and temporal clustering of innovations Innovation size distributions are highly skewed and possibly fat tailed. Near the critical q they appear to be Pareto, for higher q probably lognormal
Outlook
Endogenize R&D effort and targeting using agent-based model Endogenize q along lines of Bak-Sneppen model