CHAMELEON CHIP
PRESENTED BY DHANYA.I.G
FINAL YEAR DIPLOMA IN ELECTRONICS ENGINEERING 2011 2012
CERTIFICATE
This is to certify that the bonafide report of the seminar conducted on 28/01/2012 in CHAMELEON CHIP done by DHANYA.I.G (89040655) student of final year Diploma in Electronics Engineering, during the academic year 2011 2012 in partial fulfillment of the requirement for the award of diploma in Electronics Engineering conducted by the directorate of Technical Education Govt. of Kerala
Guided by:
SUHARA .K. M
Head of section Department of Electronics Engineering Govt. Womens Polytechnic TVM
External examiner
Internal examiner
AKNOWLEDGEMENT
I wish to express my sincere thanks to Suhara, Head of the department of Electronics Engineering, Govt. Polytechnic Thiruvananthapuram for the encouragement & supervision at all stages of my seminar. I am also greatful to all faculties of the department of Electronics Engineering for being a constant source of inspiration & allowing me to utilize the facilities of the department. Thanks are also to my friends who have extended their help & timely support during the course of my seminar.
DHANYA.I.G
ABSTRACT
Today, designing a chip crosses too many architectural boundaries. Nobody has figured out a way to get a chip to meet all the criteria for the ultimate consumer device. But we might be getting closer. Now a new kind of chip adapts to any programming task by effectively erasing its hardware design and regenerating new hardware design that is perfectly suited to run the software at hand. These chips are referred to as reconfigurable processors. These new chips are able to rewire themselves on the fly to create the exact hardware needed to run a piece of software at the utmost speed. This new chip is called CHAMELEON CHIP.
CONTENTS
1. INTRODUCTION 2. MULTIFUNTION IMPLEMENTATION 3. THE GENERAL ARCHITECTURE OF RECONFIGURABLE PROSESSOR 4. ARCHITECTURE 5. RECONFIGURABLE PROCESSING FABRIC 6. PROGRAMMEBLE I/O 7. EMBEDDED PROCESSOR SYSTEM 8. TECHNOLOGIES USED IN CHIP 9. DESIGN PROCESS 10. COMPARISON WITH OTHER TECHNOLOGIES 11. ADVANTAGES 12. DISADVANTAGES 13. APPLICATIONS 14. CONCLUSION 15. REFERENCES:
INTRODUCTION A reconfigurable processor is a microprocessor with erasable hardware that can rewire itself dynamically. This allows the chip to adapt effectively to the programming tasks demanded by the particular software they are interfacing with at any given time. Ideally, the reconfigurable processor can transform itself from a video chip to a central processing unit (CPU) to a graphics chip, for example, all optimized to allow applications to run at the highest possible speed. These chips are like providing a "chip on demand." In practical terms, this ability can translate to immense flexibility in terms of device functions. For example, a single device could serve as both a camera and a tape recorder (among numerous other possibilities): you would simply download the desired software and the processor would reconfigure itself to optimize performance for that function. According to a recent Red Herring magazine article, that type of device versatility may be available by 2003. Reconfigurable processor chip usually contains several parallel processing computational units known as functional blocks. These functional blocks are connected in all the possible way. While
6
reconfiguring the chip, the connections inside the functional blocks and the connections in between the functional blocks are changing.
That means when particular software is loaded the present hardware design is erased and a new hardware design is generated by making a particular number of connections active while making others idle. This will define the optimum hardware configuration for that particular software. The key to the design is the small size of each processing element. The smallest segments of the chip can be defined with just 50 bits of software code, so the entire chip can be reprogrammed with just 50,000 bits of software description. It takes just 20 microseconds to reconfigure the entire processing array. Reconfigurable processors are currently available from Chameleon Systems, Billions of Operations (BOPS), and PACT (Parallel Array Computing Technology). Among those only Chameleon is providing a design environment, which allows customers to convert their algorithms to hardware configuration by themselves.
MULTIFUNCTION IMPLEMENTATION
In a conventional ASIC or FPGA, multiple algorithms are implemented as separate hardware modules. Four algorithms would divide the chip into four functional areas. With Reconfigurable Technology, the four algorithms are loaded into the entire reconfigurable Fabric one at a time. First, the entire Fabric is dedicated to algorithm 1; during this processing time, algorithm 2 is loaded into the background place. In a single clock cycle, the entire Fabric is swapped to algorithm 2; during this processing time, algorithm 3 is loaded into the background plane. The entire reconfigurable fabric is dedicated to just one algorithm at a time. The result: much higher performance, lower cost and lower power consumption
MULTIFUNCTION IMPLEMENTATION
The structure of Reconfigurable chip is designed in some developmental tool .The various possible connections between functional blocks are encoded to bits known as Configuration bits. Resulting configuration stream is downloaded into configuration memory through configuration inputs. Thus, a new reconfigurable machine is established.
ARCHITECTURE
The Chip incorporates three core architectural technologies:
1) A Complete 32 bit Embedded Processor system It provides all of the basic building blocks for a complete system: a 32-bit ARC processor, 32-bit interface, and 64-bit high-performance memory controller. These fully integrated and fully verified modules simplify design, debug and verification. 2) A high-performance 32-bit Reconfigurable Processing Fabric (RPF) The RPF has 108 parallel computation units, providing tremendous computational power. This is where the "heavy lifting" (Rec Roadrunner Bus links these system modules. This 128-bit, split-transaction bus provides 2GByte/sec on-chip bandwidth amongst the subsystems in the Embedded Processor System and the RPF. 3) Instantaneous reconfigurability These core technologies combine to eliminate the performance/flexibility compromise, exploit platform-based design and enable you to implement your own algorithms to differentiate your product
10
ARCH
32 BIT PCI BUS PCI CONTROLLER RECONFIGURABLE PROCESSING FABRIC (RPF)
The Fabric (RPF or Fabric) provides unmatched algorithmic computation power to Chameleon Chip. It consists of 84,32-bit Data path Units and 24, 16x24-bit Multipliers. Operating at 125Mhz, they provide up to 3,000 16-bit Million Multiply-Accumulates Per Second and 24,000 16-bit Million Operations Per Second. The fabric is divided into Slices, the basic unit of reconfiguration. The CS2112 includes four Slices, each of which can be independently reconfigured. Each Slice consists of three Tiles. The Tile is built with 32-bit Data path Units, 16x24-bit Single-
128 B
CONFIGURATION
11
Cycle Multipliers, Local Store Memories, and Control Logic Units. The Dynamic Interconnect connects the modules within the fabric.
12
The DPU includes a 32-bit real-time Barrel Shifter for shifting operations. The DPU also includes two 32-bit AND/OR Mask operators. At the heart of the DPU is the 32-bit Operator, which directly implements all C and Verilog operators. The Operator supports number calculation shifting and bit-field masking data operation modes. signed/unsigned
DAT
The Tile includes two 16x24-bit single-cycle multipliers. With a total of 24 multipliers, the CS2112 delivers 3,000 Million Multiply-Accumulates per Second.
ROUTING M UX
13
Dynamic Interconnect
The Fabric provides 100% routability Connecting embedded processor system with the RPF results in Dynamic Interconnect: routes can be changed one a clock-byclock basis for flexible and optimal dataflow.
PROGRAMMABLE I/O
RCP includes banks of Programmable I/O (PIO) pins which provide tremendous bandwidth. Each PIO bank of 40 PIO pins delivers 0.5 GBytes/sec I/O bandwidth.
14
The Embedded Processor Systems provides all of the basic building blocks for a complete system. These fully integrated and fully verified modules simplify design, debug and verification. This integrated system platform consists of:
32-bit ARC Processor The Processor delivers 120 MIPS at 125 MHz and it employs 64 general-purpose 32-bit registers and a 32-bit address space. It includes a 4 Kbytes instruction cache and a 4 Kbytes data memory. 32-bit PCI ControllerInterface to PCI bus
DMA SubsystemIt supports 16 DMA Channels, transferring data between the modules in the Embedded Processor System and to/from the Local Store Memories.
Configuration Subsystem
The Configuration Subsystem includes the Configuration Controller and the transferring configuration data from off-chip memory through the 64-bit Controller to the Background Configuration Plane. This transfer can take place during full-speed operation of the Fabric, loading a new configuration while the prior configuration is running on the Fabric two
15
1.
eCONFIGURABLE TECHNOLOGY
eConfigurable Technology is used for instantaneous reconfiguration. This
technology reconfigures fabric in one clock cycle and increases voice/data/video channels per chip. As mentioned earlier, each Slice can be configured independently. Loading the Background Plane from external memory requires just 3 sec per Slice; this operation does not interfere with active processing on the Fabric. Swapping the Background Plane into the Active Plane requires just one clock cycle. with eConfigurable Technology; the four algorithms are loaded into the entire reconfigurable processing Fabric one at a time. 2. C~SIDE Development Tools Without the necessary software tools, no one but the inventors has been able to port software to the processors. As a result customers had to give their algorithms to developers. With this software, Chameleon Systems are providing the ability for the customers to do the programming themselves thus keeping the secrecy of their algorithms. The Chameleon Systems Integrated Development Environment (C~SIDE) is a complete toolkit for designing, debugging and verifying RCP designs. C~Side uses a combined C language and Verilog (Verilog HDL is a hardware description language used to design and document electronic systems) flow to map algorithms into the chip's reconfigurable processing fabric (RPF).
16
C~SIDE includes an optimized GNU C compiler for the ARC Processor and an optimized Verilog To Bits (V2B) synthesizer for the Reconfigurable Processing Fabric., an interactive floor planner, an instruction-set simulator and a unified debug environment for the ARC core and the RPF.
3. eBIOS
eBIOS provides a interface between the Embedded Processor System and the Fabric. eBIOS provides resource allocation, configuration management and DMA services. The eBIOS calls are automatically generated at compile time, but can be edited for precise control of any function.
DESIGN PROCESS
Design process consists of converting a C/C++ program to a hardware configuration. One end of design is a C/C++ program and the other side is processing hardware. So a mapping is needed between them. But C is not a hardware description language (HDL). To specify a hardware configuration a HDL is needed. For that purpose Chameleon Systems uses a HDL called Verilog.When a hardware description in verilog is obtained it can be converted to configuration bits using VerilogToBits (V2B) synthesizer. Configuration bits actually specify hardware configuration. Now a mapping between C/C++ program and verilog is needed. For that an assembler is provided by Chameleon Systems. When an assembly language like description of C/C++ program is given to this assembler it will generate Verilog descriptions. Now C/C++ algorithm is mapped to a hardware configuration
17
DESIGN PROCESS
C/C++ PROGRAM
ASSEMBLER VERILOG V2B
18
However, system architects continue to struggle with the requirement that communication systems deliver both performance and flexibility. Enter the reconfigurable processor, an entirely new category of semiconductor solution that serves as a system-level platform for a broad range of applications. The RCP fills the void between fast but inflexible ASICs, and flexible but slow and costly DSPs and FPGAs. Table1 shows the comparison of RCP with other technologies in terms of Flexibility, cost, performance and time to- market factors.
COMPARISO TECH
RCP FLEXIB HIGH
19
ADVANTAGES
Early and fast design
Design cycle time and cost actually increase due to the fact that FPGAs are bitoriented arrays that incur large silicon overhead when used to process wide data streams. DSP processing speed is typically limited by an internal bus that provides the interconnect for multiple execution units. Converting a prototype to an ASIC solution for cost reduction and then manufacturing the ASIC is a lengthy and costly process. Prototyping using RCPs and associated tools enables a fast all-software design. Reducing power. RCPs achieve better speed/power characteristics than DSPs and FPGAs. Reducing development cost. RCPs substantially reduce development cycles and costs normally associated with ASIC design Reducing manufacturing cost. Measured by chip count or silicon area, the manufacturing cost advantage of an RCP over DSP- or FPGA-based solutions with equivalent data Increasing bandwidth. Every feature of the RCP more fundamental processing power, higher internal and external I/O speeds, closer interaction between the on-chip RISC processor and the
20
reconfigurable data stream logic, and the algorithmic flexibility to adapt to predetermined conditions enables the development of higher system bandwidth cost effectively.
DISADVANTAGES
Inertia might be the worst problem facing reconfigurable computing. Engineers are slow to change, and they're comfortable designing things the old way, which offered them a spectrum of programmable or hard-wired options. Several startups in reconfigurable computing have chosen the next-generation wireless market as the key battleground. Besides QuickSilver and Chameleon, Morphics Technology in Campbell, California, is also targeting the wireless market. They should expand from there. Controlling the development time and costs in an RCP design requires a comprehensive set of tools a design environment with a graceful flow from systems design to executable files that run the embedded microprocessor and configure the fabric. Hardware and software debugging and verification tools are also necessary. Ultimately, the complete RCP design process should merge seamlessly with the equipment manufacturers other design tools. At present, there is a "learning curve" for designers unfamiliar with reconfigurable logic. Because designer has to study Chameleons assembly like design entry language. Researches are going on to help designers enter their design through such tools as Matlab or SPW. That will let users draw data-flow diagrams in lieu of writing code.
21
22
APPLICATIONS
1. Wireless Base stations
The reconfigurable technology mainly focuses on base stations and their unpredictable combination of voice and data traffic.Base-station infrastructure will have to be adaptive enough to accommodate those requirements. With a fixed processor the channels must be able to support both simple voice calls and high-bandwidth data connections, which means many voice calls do not use up all the bandwidth that is assigned to them. With a reconfigurable processor, each channel can be allotted the exact amount of bandwidth it requires.
23
CS2112, provides very high bandwidth. Hence they can be effectively used in local switching stations.
24
25
CONCLUSION
One day, someone will make a chip that does everything for the ultimate consumer device. The chip will be smart enough to be the brains of a cell phone that can transmit or receive calls anywhere in the world. If the reception is poor, the phone will automatically adjust so that the quality improves. At the same time, the device will also serve as a handheld organizer and a player for music, videos, or games. Today, designing such a chip crosses too many architectural boundaries. Nobody has figured out a way to get a chip to meet all the criteria for the ultimate consumer device. But we might be getting closer. Now a new kind of chip adapts to any programming task by effectively erasing its hardware design and regenerating new hardware design that is perfectly suited to run the software at hand. These chips are referred to as reconfigurable processors. These new chips are able to rewire themselves on the fly to create the exact hardware needed to run a piece of software at the utmost speed. If these adaptable chips can reach cost-performance parity with hard-wired chips, so will the gadgets of the information age.
26
REFERENCES
27