Reconfigurable Computing
1 Introduction
2 Drawbacks of FPL
The area overhead of FPL is largely inherent to its flexibility. The counter-argument
is that FPL is re-usable, in the context of reconfiguration. The same piece of silicon,
re-used repeatedly for different circuit implementations, can justify the area penalty it
implies for a single implementation. Re-usability through reconfiguration is the only
justification for the silicon overhead of FPL implementations, to the extent that the
FPL resources are used to implement a number of different digital circuits throughout
the device’s lifetime. For high-volume electronics, however, hardware
reconfigurability is not an issue when: (1) The algorithms a computing device must
run during its operating lifetime are known at design-time. In this case, ASICs are
more cost-effective (see the discussion in [3]); (2) Standard programmable
architectures can fulfil the performance requirements. These architectures can be
customised before fabrication [6], which is claimed to be viable even for low-volume
production [7]. The stability of the hardware platform makes application
programming tools much easier to develop for standard programmable platforms.
Reconfigurable Computing platforms are way behind programmable architectures
(and their well-developed compiler technology counter-part) in terms of programming
friendliness.
The closer an RC platform is to the standard programmable architectures, the
greater are the possibilities of adapting standard compiler technology to make
application programming less of a problem. Tightly-coupled platforms are suggested
as good candidates (Section 3) in this aspect.
The less a device is targeted at a specific application, the more unknown-at-design-
time algorithms it must run in operation, and the better it can benefit from hardware
reconfigurability. See Figure 1. However, the performance deficiency aspect of FPL
(Section 2) must also be taken into account.
Once the usefulness of reconfigurability has been verified, there are yet other issues to
look at. As the enabling technology of the RC paradigm, FPL is a promising solution
for a wide range of problems. A solution, however, that comes in different flavours.
The choice of the right flavour for the right problem is not always obvious.
Mapping a multiply-rich application segment onto a general-purpose FPL
architecture is like using a hammer to tighten a screw. If the target applications are
known to be biased towards a certain kind of computation, a suitable FPGA
architecture can be chosen that performs best (and with the least silicon overhead) for
that particular kind of computation. For instance, “island-style” FPGAs [4], like the
Xilinx XC4000 family, have arbitrary long-distance communication lines suitable for
complex, irregular random logic. In contrast, fine-grained “cellular-style” FPGAs [4],
like the Atmel’s AT6K family, are better suited for highly local, pipelined circuits
such as systolic arrays. Hauck [5] discusses those issues thoroughly.
Architectural optimisations that improve FPL performance for regular DSP
arithmetic have been developed more extensively in the Academia, in the form of
coarse-grained FPGAs [9][10] (or “chunky functional units”, as in [1]). Hard-wired
computing cores as ALUs or multipliers are embedded into the framework of a
reconfigurable interconnect matrix. This allows for a boost in performance and a
reduction in the area overhead for the target applications. The loss in flexibility, in
turn, renders chunky units inefficient for irregular bit-wise computations. Another
limitation is that reduction of order is no longer possible.
Generally speaking, the FPL architecture can be fine-tuned towards a specific set
of applications (a domain) by varying the degree of flexibility of the interconnect and
the logic blocks, and by specific performance-enhancing features. This fine-tuning, in
turn, usually renders the FPL inefficient for other application domains.
Returning to our point regarding reconfigurable general-purpose processors
(Section 4), it is likely that any such device would be required to run as much DSP-
like computing kernels as anything else, due to its broad application nature. General-
purpose computing, however, is the extreme of a spectrum that goes all the way down
to ASIC devices in the opposite extreme. A fundamental dilemma in RC then
becomes clear in Figure 1. In our view, the essential design challenge is to find an
application domain wide enough to justify hardware reconfigurability, while specific
enough to allow for proper fine-tuning of the FPL resources. Different domains may
require different FPL flavours and different integration methods (see Section 3).
7 Wrap-Up
13 Conclusions
Reconfigurable Computing (RC) implies a number of trade-offs in terms of: (1) the
demand for hardware reconfigurability, (2) the choice of the correct FPL architecture
for the kind of computations at hand, (3) the silicon overhead of a “computing in
space” approach, and (4) special requirements related to the target applications. These
trade-offs spawn a 4-dimensional think-model that can help evaluate the cost-
effectiveness of RC approaches. The RC paradigm is not a cost-effective solution
where the trade-offs do not lead to a commercial edge.
References
n o W. H. Mangione et al. “Seeking Solutions in Configurable Computing”, Computer,
pq 30(12), pp. 38-43, December 1997.
G. Brebner. “Field-Programmable Logic: Catalyst for New Computing Paradigms”, Proc.
rts Of Field-Programmable Logic and Applications, pp. 49-58, Estonia, 1998.
R. Wilson. “Large PLDs face big uncertainties”, EE Times, Issue 1050, March 1st, 1999.
u v http://www.techweb.com/se/directlink.cgi?EET19990301S0002
wtx S. Trimberger. “Field-Programmable Gate Array Technology”, Kluwer, MA, 1994.
S. Hauck. “The Roles of FPGA’s in Reprogrammable Systems”, Proc. of the IEEE,
ytz Volume 86, Issue 4, April 1998.
D. Bursky. “Tool Suite Enables Designers to Craft Customized Embedded Processors”,
{t| Electronic Design, pp. 33-38, February 8, 1999.
Wolfe. “HP lays foundation for embedded’s future”, EDTN Network, March 1st, 1999.
}t~ http://www.edtn.com/story/tech/OEG19990226S0010-R
B. Kastrup et al. “ConCISe: A Compiler-Driven CPLD-Based Instruction Set
Accelerator”, Proc. IEEE Symp. on Field-Programmable Custom Computing Machines,
t Napa Valley, April 1999.
R. W. Hartenstein et al. “Using the KressArray for Configurable Computing”, Proc. of
t SPIE, Vol. 3526, Boston, MA, November 2-3, 1998.
C. Ebeling et al. “RaPiD – Reconfigurable Pipelined Datapath”, Proc. Of Field-
t
lNNN
Programmable Logic and Applications, 1996.
et al. “A Survey of Reconfigurable Computing Architectures”, Proc. Of Field-
Programmable Logic and Applications, pp. 376-385, Estonia, 1998.