Anda di halaman 1dari 215

OFDM Basics Tutorial

- OFDM - Orthogonal Frequency Division Multiplex, the modulation concept being used for many wireless and radio communications radio applications from DAB, DVB, Wi-Fi and Mobile Video.

OFDM tutorial includes: OFDM basics tutorial OFDM synchronization OFDMA / OFDM CDMA comparison Orthogonal Frequency Division Multiplex or OFDM is a modulation format that is finding increasing levels of use in today's radio communications scene. OFDM has been adopted in the Wi-Fi arena where the 802.11a standard uses it to provide data rates up to 54 Mbps in the 5 GHz ISM (Industrial, Scientific and Medical) band. In addition to this the recently ratified 802.11g standard has it in the 2.4 GHz ISM band. In addition to this, it is being used for WiMAX and is also the format of choice for the next generation cellular radio communications systems including 3G LTE and UMB. If this was not enough it is also being used for digital terrestrial television transmissions as well as DAB digital radio. A new form of broadcasting called Digital Radio Mondiale for the long medium and short wave bands is being launched and this has also adopted COFDM. Then for the future it is being proposed as the modulation technique for fourth generation cell phone systems that are in their early stages of development and OFDM is also being used for many of the proposed mobile phone video systems. OFDM, orthogonal frequency division multiplex is a rather different format for modulation to that used for more traditional forms of transmission. It utilises many carriers together to provide many advantages over simpler modulation formats.

OFDM concept
An OFDM signal consists of a number of closely spaced modulated carriers. When modulation of any form - voice, data, etc. is applied to a carrier, then sidebands spread out either side. It is necessary for a receiver to be able to receive the whole signal to be able to successfully demodulate the data. As a result when signals are transmitted close to one another they must be spaced so that the receiver can separate them using a filter and there must be a guard band between them. This is not the case with OFDM. Although the sidebands from each carrier overlap, they can still be received without the interference that might be expected because they

are orthogonal to each another. This is achieved by having the carrier spacing equal to the reciprocal of the symbol period.

Traditional view of receiving signals carrying modulation To see how OFDM works, it is necessary to look at the receiver. This acts as a bank of demodulators, translating each carrier down to DC. The resulting signal is integrated over the symbol period to regenerate the data from that carrier. The same demodulator also demodulates the other carriers. As the carrier spacing equal to the reciprocal of the symbol period means that they will have a whole number of cycles in the symbol period and their contribution will sum to zero - in other words there is no interference contribution.

OFDM Spectrum One requirement of the OFDM transmitting and receiving systems is that they must be linear. Any non-linearity will cause interference between the carriers as a result of inter-modulation distortion. This will introduce unwanted signals that would cause interference and impair the orthogonality of the transmission. In terms of the equipment to be used the high peak to average ratio of multi-carrier systems such as OFDM requires the RF final amplifier on the output of the transmitter to be able to handle the peaks whilst the average power is much lower and this leads to inefficiency. In some systems the peaks are limited. Although this introduces distortion that results in a higher level of data errors, the system can rely on the error correction to remove them.

Data on OFDM
The data to be transmitted on an OFDM signal is spread across the carriers of the signal, each carrier taking part of the payload. This reduces the data rate taken by each carrier. The lower data rate has the advantage that interference from reflections is much less critical. This is achieved by adding a guard band time or guard interval into the system. This ensures that the data is only

sampled when the signal is stable and no new delayed signals arrive that would alter the timing and phase of the signal.

Guard Interval The distribution of the data across a large number of carriers in the OFDM signal has some further advantages. Nulls caused by multi-path effects or interference on a given frequency only affect a small number of the carriers, the remaining ones being received correctly. By using error-coding techniques, which does mean adding further data to the transmitted signal, it enables many or all of the corrupted data to be reconstructed within the receiver. This can be done because the error correction code is transmitted in a different part of the signal.

OFDM variants
There are several other variants of OFDM for which the initials are seen in the technical literature. These follow the basic format for OFDM, but have additional attributes or variations:

COFDM: Coded Orthogonal frequency division multiplex. A form of OFDM where error correction coding is incorporated into the signal. Flash OFDM: This is a variant of OFDM that was developed by Flarion and it is a fast hopped form of OFDM. It uses multiple tones and fast hopping to spread signals over a given spectrum band. OFDMA: Orthogonal frequency division multiple access. A scheme used to provide a multiple access capability for applications such as cellular telecommunications when using OFDM technologies. VOFDM: Vector OFDM. This form of OFDM uses the concept of MIMO technology. It is being developed by CISCO Systems. MIMO stands for Multiple Input Multiple output and it uses multiple antennas to transmit and receive the signals so that multi-path effects can be utilised to enhance the signal reception and improve the transmission speeds that can be supported.

WOFDM: Wideband OFDM. The concept of this form of OFDM is that it uses a degree of spacing between the channels that is large enough that any frequency errors between transmitter and receiver do not affect the performance. It is particularly applicable to WiFi systems.

Each of these forms of OFDM utilise the same basic concept of using close spaced orthogonal carriers each carrying low data rate signals. During the demodulation phase the data is then combined to provide the complete signal. OFDM and COFDM have gained a significant presence in the wireless market place. The combination of high data capacity, high spectral efficiency, and its resilience to interference as a result of multi-path effects means that it is ideal for the high data applications that are becoming a common factor in today's communications scene.

Orthogonal Frequency Division Multiplexing for Wireless Networks


Standard IEEE 802.11a
UNIVERSITY OF CALIFORNIA SANTA BARBARA By ANBAL LUIS INTINI Graduate Student Electrical and Computer Engineering Department aintini@engineering.ucsb.edu December, 2000
Autonomous Transportation Agents for On-Scene Networked Incident Management

Contents

1. Introduction _______________________________________________________________ 1 2. The Standard IEEE 802.11a _________________________________________________ 5 3. Carrier Sense Multiple Access/Collision Avoidance __________________________ 10 4. Propagation Characteristics of mobile radio channels ______________________ 11 4.1. Attenuation____________________________________________________________ 12 4.2. Multipath Effects______________________________________________________12 4.2.1. Rayleigh fading __________________________________________________________ 12 4.2.2. Frequency Selective Fading __________________________________________________ 13 4.3. Delay Spread __________________________________________________________ 14 4.4. Doppler Shift __________________________________________________________ 15 5. OFDM ____________________________________________________________________ 16 5.1. General Structure______________________________________________________ 16 5.2. Implementation________________________________________________________ 18 5.2.1. Guard time and cyclic extension _______________________________________________ 20 5.2.2. Windowing_____________________________________________________________ 20 5.3. Forward Error Correction Coding ______________________________________ 22 5.3.1. Block Codes ____________________________________________________________ 22 5.4. Interleaving ___________________________________________________________ 24 6. Synchronization __________________________________________________________ 26 6.1. Synchronization Using Special Training Symbols_________________________ 29 7. Detection _________________________________________________________________ 31 7.1. Coherent Detection ____________________________________________________ 32 7.2. Two-Dimensional Channel Estimators ___________________________________ 33

8. Simulation________________________________________________________________ 36 Conclusion_________________________________________________________________ _ 38 Acknowledgements_______________________________________________________ ___ 39 References_________________________________________________________________ _ 39

Figures
Figure 1. Concept of OFDM signal: orthogonal multicarrier technique..........................................2 Figure 2. Spectra of (a) an OFDM subchannel and (b) and OFDM signal. .....................................3 Figure 3. 5-Ghz frequency spectrum allocation ................................................................................7 Figure 4. IEEE 802.11a frame format for 5 GHz. .............................................................................8 Figure 5. IEEE 802.11a channel scheme.........................................................................................10 Figure 6. Typical Rayleigh fading while the Mobile Unit is moving (900 MHz) [11] .....................13 Figure 7. Multipath Delay Spread ...................................................................................................15 Figure 8 OFDM modulator..............................................................................................................17 Figure 9. Block diagram of an OFDM transceiver..........................................................................19 Figure 10. OFDM frame with cyclic extension and windowing for (a) single ................................21 Figure 11. Convolutional encoder (k = 7) .......................................................................................22 Figure 12. BER of the (16,11,4) RM code and rate 2/3 memory 2 and 3 ........................................24 Figure 13. Convolutional interleaver ..............................................................................................26 Figure 14. Synchronization using the cyclic prefix..........................................................................26 Figure 15. Received phasor, showing effect of noise on the received phase angle .........................28 Figure 16. OFDM training structure ...............................................................................................30 Figure 17. Block diagram of an OFDM receiver with coherent detection......................................32 Figure 18. Pilot Sequence Generator ..............................................................................................34

Figure 19. Pilot Frequency Allocation ............................................................................................34 Figure 20. Simulink OFDM Model ..................................................................................................36 Figure 21. OFDM Signal Spectrum .................................................................................................37 Figure 22. Received constellation for a given amout of noise.........................................................38

Tables
Table 1. Typical attenuation in a radio channel ..............................................................................12 Table 2. Cumulative distribution for Rayleigh distribution [11].....................................................13 Table 3. Typical Delay Spread [11].................................................................................................15 Table 4. Physical layer parameters. ................................................................................................19 Table 5. Physical layer parameters. ................................................................................................20
December, 2000 Page 1 - 40

ORTHOGONAL FREQUENCY DIVISION MULTIPLEXING FOR WIRELESS NETWORKS


Abstract. Orthogonal frequency division multiplexing (OFDM) is a
special case of multicarrier transmission, where a single datastream is transmitted over a number of lower rate subcarriers. In July 1998, the IEEE standardization group decided to select OFDM as the basis for their new 5-GHz standard, targeting a range of data stream from 6 up to 54 Mbps. This new standard is the first one to use OFDM in packet-based communications, while the use of OFDM until now was limited to continuous transmission systems. In this project, transmitter and receiver were simulated according to the parameters established by the standard, to evaluate the performance and different possibilities in the implementation. Also, some considerations about forward error correction coding, synchronization and channel estimation are given oriented to improve the system performance.

1. Introduction
OFDM is of great interest by researchers and research laboratories all over the world. It has already been accepted for the new wireless local area network standards IEEE 802.11a, High Performance LAN type 2 (HIPERLAN/2) and Mobile Multimedia Access Communication (MMAC) Systems. Also, it is expected to be used for wireless broadband multimedia communications. Data rate is really what broadband is about. The new standard specify bit rates of up to 54 Mbps. Such high rate imposes large bandwidth, thus pushing carriers for values higher than UHF band. For instance, IEEE802.11a has frequencies allocated in the 5- and 17- GHz bands. This project is oriented to the application of OFDM to the standard IEEE 802.11a, following the parameters established for that case. OFDM can be seen as either a modulation technique or a multiplexing technique. One of the main reasons to use OFDM is to increase the robustness against frequency selective fading or narrowband interference. In a single carrier system, a single fade or interferer can cause the entire link to fail, but in a multicarrier system, only a small percentage of the subcarriers will be affected. Error correction coding can then be used to correct for the few erroneous subcarriers. The concept of using parallel data transmission and frequency division multiplexing was published in the mid-1960s [1, 2]. Some early
OFDM for Wireless Networks December, 2000 Page 2 - 40

development is traced back to the 1950s [3]. A U.S. patent was filed and issued in January 1970 [4]. In a classical parallel data system, the total signal frequency band is divided into N nonoverlapping frequency subchannels. Each subchannel is modulated with a separate symbol and then the N subchannels are frequency-multiplexed. It seems good to avoid spectral overlap of channels to eliminate interchannel interference. However, this leads to inefficient use of the available spectrum. To cope with the inefficiency, the ideas proposed from the mid-1960s were to use parallel data and FDM with overlapping subchannels, in which, each carrying a signaling rate b is spaced b apart in frequency to avoid the use of high-speed equalization and to combat impulsive noise and multipath distortion, as well as to fully use the available bandwidth.
Figure 1. Concept of OFDM signal: orthogonal multicarrier technique versus conventional multicarrier technique

Figure 1 illustrates the difference between the conventional nonoverlapping multicarrier technique and the overlapping multicarrier modulation technique. As shown in Figure 1, by using the overlapping multicarrier modulation technique, we save almost 50%

of bandwidth. To realize the overlapping multicarrier technique, however we need to


OFDM for Wireless Networks December, 2000 Page 3 - 40

reduce crosstalk between subcarriers, which means that we want orthogonality between the different modulated carriers. The word orthogonal indicates that there is a precise mathematical relationship between the frequencies of the carriers in the system. In a normal frequency-division multiplex system, many carriers are spaced apart in such a way that the signals can be received using conventional filters and demodulators. In such receivers, guard bands are introduced between the different carriers and in the frequency domain, which results in a lowering of spectrum efficiency. It is possible, however, to arrange the carriers in an OFDM signal so that the sidebands of the individual carriers overlap and the signals are still received without adjacent carrier interference. To do this, the carriers must be mathematically orthogonal. The receiver acts as a bank of demodulators, translating each carrier down to DC, with the resulting signal integrated over a symbol period to recover the raw data. If the other carriers all beat down the frequencies that, in the time domain, have a whole number of cycles in the symbol period T, then the integration process results in zero contribution from all these other carriers. Thus, the carriers are linearly independent (i.e., orthogonal) if the carrier spacing is a multiple of 1/T. (a) (b)
Figure 2. Spectra of (a) an OFDM subchannel and (b) and OFDM signal.

Much of the research focuses on the high efficient multicarrier transmission scheme based on "orthogonal frequency" carriers. In 1971, Weinstein and Ebert applied the discrete Fourier transform (DFT) to parallel data transmission systems as part of the modulation and demodulation process. Figure 2(a) shows the spectrum of the individual data of the subchannel. The OFDM signal, multiplexed in the individual spectra with a
OFDM for Wireless Networks December, 2000 Page 4 - 40

frequency spacing b equal to the transmission speed of each subcarrier, is shown in Figure 1(b). Figure 2 shows that at the center frequency of each subcarrier, there is no crosstalk from other channels. Therefore, if we use DFT at the receiver and calculate correlation values with the center of frequency of each subcarrier, we recover the transmitted data with no crosstalk. In addition, using the DFT-based multicarrier technique, frequency-division multiplex is achieved not by bandpass filtering but by baseband processing. Moreover, to eliminate the banks of subcarrier oscillators and coherent demodulators required by frequency-division multiplex, completely digital implementations could be built around special-purpose hardware performing the fast Fourier transform (FFT), which

is an efficient implementation of the DFT . Recent advances in very-large-scale integration (VLSI) technology make high-speed, large-size FFT chips commercially affordable. Using this method, both transmitter and receiver are implemented using efficient FFT techniques that reduce the number of operations from N2 in DFT down to N log N. In the 1980s, OFDM was studied for high-speed modems, digital mobile communications, and high-density recording. One of the systems realized the OFDM techniques for multiplexed QAM using DFT, and by using pilot tone, stabilizing carrier and clock frequency control and implementing trellis coding are also implemented. Moreover, various-speed modems were developed for telephone networks. In the 1990s, OFDM was exploited for wideband data communications over mobile radio FM channels, high-bit-rate digital subscriber lines (HDSL; 1.6 Mbps), asymmetric digital subscriber lines (ADSL; up to 6 Mbps), very-high-speed digital subscriber lines (VDSL; 100 Mbps), digital audio broadcasting (DAB), and high-definition television (HDTV) terrestrial broadcasting. The OFDM transmission scheme has the following key advantages: __Makes efficient use of the spectrum by allowing overlap __By dividing the channel into narrowband flat fading subchannels, OFDM is more resistant to frequency selective fading than single carrier systems are. __Eliminates ISI and IFI through use of a cyclic prefix. __Using adequate channel coding and interleaving one can recover symbols lost due to the frequency selectivity of the channel. __Channel equalization becomes simpler than by using adaptive equalization techniques with single carrier systems. __It is possible to use maximum likelihood decoding with reasonable complexity, as discussed in OFDM is computationally efficient by using FFT techniques to implement the modulation and demodulation functions.
OFDM for Wireless Networks December, 2000 Page 5 - 40

__In conjunction with differential modulation there is no need to implement a channel estimator. __Is less sensitive to sample timing offsets than single carrier systems are. __Provides good protection against cochannel interference and impulsive parasitic noise. In terms of drawbacks OFDM has the following characteristics: __The OFDM signal has a noise like amplitude with a very large dynamic range, therefore it requires RF power amplifiers with a high peak to average power ratio. __It is more sensitive to carrier frequency offset and drift than single carrier systems are due to leakage of the DFT.

2. The Standard IEEE 802.11a


The IEEE 802.11 specification is a wireless LAN (WLAN) standard that defines a set

of requirements for the physical layer (PHY) and a medium access control (MAC) layer. For high data rates, the standard provides two PHYs - IEEE 802.11b for 2.4-GHz operation and IEEE 802.11a for 5-GHz operation. The IEEE 802.11a standard is designed to serve applications that require data rates higher than 11 Mbps in the 5-GHz frequency band. The wireless medium on which the 802.11 WLANs operate is different from wired media in many ways. One of those differences is the presence of interference in unlicensed frequency bands, which can impact communications between WLAN NICs. Interference on the wireless medium can result in packet loss, which causes the network to suffer in terms of throughput performance. Current 2.4-GHz 802.11b radios handle interference well because they support a feature in the MAC layer known as fragmentation. In fragmentation, data frames are broken into smaller frames in an attempt to increase the probability of delivering packets without errors induced by the interferer. When a frame is fragmented, the sequence control field in the MAC header indicates placement of the individual fragments and whether the current fragment is the last in the sequence. When frames are fragmented into request-to-send (RTS), clear-to-send (CTS), and acknowledge (ACK), control frames are used to manage the data transmission. Therefore, using fragmentation, designers can avoid interference problems in their WLAN designs. But interference is not the only problem for todays WLAN designers. Security issues are also a major concern. To solve potential security problems, the IEEE has
OFDM for Wireless Networks December, 2000 Page 6 - 40

incorporated a MAC-level privacy mechanism within the 802.11 specification, which protects the content of data frames going over a wireless medium from eavesdroppers. The mechanism, dubbed wired equivalent privacy (WEP), is an encryption engine that takes the contents of the entire data frame and passes them through an encryption algorithm. The encrypted data frames are transmitted with the WEP bit set in the frame control field of the MAC header. The received encrypted data frames are decrypted using the same encryption algorithm employed by the sending unit. The encryption algorithm used by 802.11 is RC4. RC4 is a symmetric stream cipher developed by RSA Data Security, Inc. that supports variable key lengths up to 256 bits. The standard specifies a 40-bit key, but many 802.11-compliant products shipping today support key lengths of up to 128 bits.

In order for a mobile station to communicate with other mobile wireless NICs in a given service area, it must first locate those wireless NICs or access points. To enable communication between the mobile station and the NIC, active and passive scanning techniques are supported in the MAC. Passive scanning involves listening for traffic only on an 802.11 network. Passive scanning allows a mobile wireless NIC to find an IEEE 802.11 network while minimizing DC power consumption. In this mode, the wireless NIC listens for special frames called beacons and probe responses, while extracting information about the particular frequency channel. Although passive scanning expends minimal power, the cost is the time spent listening for a frame on a channel that is idle or may never occur. Active scanning, on the other hand, requires the scanning wireless NIC to transmit and receive responses from 802.11 wireless NICs and access points. Active scanning allows the mobile wireless NIC to interact with another wireless NIC or access point. The 802.11 standard does not specify a method for scanning. However, many WLAN OEMs support both methods and variants to differentiate their products in the market. When developing WLAN systems, choosing the right modulation and frequency band should be a priority in RF design, especially when designing IEEE 802.11a radios. For the past decade, WLAN systems have been designed to operate in the unlicensed 2.4GHz frequency band. The 2.4-GHz band provides 83 MHz of total contiguous bandwidth, spanning from 2.4 to 2.483 GHz. Moving to the 5-GHz band offers over three times the operating bandwidth over the available spectrum in the 2.4-GHz band. The 5-GHz band is also less susceptible to interference, unlike the 2.4-GHz unlicensed band, which shares spectrum with other wireless appliances such as Bluetooth devices.
OFDM for Wireless Networks December, 2000 Page 7 - 40

There are, however, a few things to consider when switching to 5 GHz. The first is that the frequency allocation isnt contiguous across the band, and the transmit power levels are restricted depending on which block of frequency is occupied. Secondly, in order to achieve the same effective range as covered in the 2.4-GHz band, the transmit power of a 5-GHz system must be slightly increased. As a designer of 5-GHz radios, these issues must be carefully considered in product development. In the US, 300 MHz of bandwidth is allocated in the 5-GHz band to WLANs under the rules of the Unlicensed-National Information Infrastructure (U-NII). The bandwidth is fragmented into two blocks that are noncontiguous across the 5-GHz band. In Europe, only Hiperlan WLANs are allowed to operate in the 5-GHz frequency band. A total of 455 MHz of spectrum is allocated for Hiperlan radios. The frequency spectrum allocations for each of the geographic regions are shown in Figure 3.
Figure 3. 5-Ghz frequency spectrum allocation

Its important to point out that although the PHY specifications for IEEE 802.11a are similar to the Hiperlan2, radios compliant with the 802.11a specification are not allowed to operate in the 5-GHz band according to ETSI rules. Efforts are under way by IEEE 802 and ESTI together with the ITU-R to harmonize a global allocation of 5-GHz spectrum for WLANs. Global harmonization could occur by late 2001. When the IEEE 802.11 began evaluating proposals for 802.11a, the working group adopted a joint proposal from NTT and Lucent that recommended orthogonal frequency
OFDM for Wireless Networks December, 2000 Page 8 - 40

division multiplexing (OFDM) as the baseline technology for 5-GHz WLAN systems. OFDM was chosen because of its superior performance in combating fading multipath. This battle is extremely important, particularly in applications that transmit streaming video. During the development of the 802.11a specification, ESTI was charging ahead with a 5-GHz WLAN project called Hiperlan2. They too adopted OFDM. For the most part, the PHY for 802.11a is similar to Hiperlan2. The differences between the two standards are minimal and reside in the method by which convolution encoding is used to generate the OFDM symbols and data rates. But it has been said that by making the convolution encoder a programmable feature in the baseband processor, the same silicon can be used to support both standards. This is an extremely attractive feature for those who want to develop products for both standards. Unfortunately, the MAC layers are very different. In the U-NII band, eight carriers are spaced across 200 MHz in the lower spectrum (5.150 - 5.350 GHz) and four carriers are spaced across 100 MHz in the upper spectrum (5.725 - 5.825 GHz). The channels are spaced 20 MHz apart, which allows for high bit rates per channel. The channel scheme used for 5 GHz is illustrated in figure 4.
Figure 4. IEEE 802.11a frame format for 5 GHz.

As figure 2 illustrates, there are 52 subcarriers per channel in the 5-GHz band. Of these channels, only 48 carry actual data. The remaining four subcarriers are used as pilot tones, which assist in phase tracking for coherent demodulation. The duration of the guard interval is equal to 800 ns, which provides excellent performance on channels with delay spread of up to 250 ns. To efficiently use the spectrum provided in the 5-GHz range, designers of IEEE 802.11a systems use OFDM techniques. OFDM is a unique form of multicarrier modulation. The basic concept is to transmit high data rate information into
OFDM for Wireless Networks December, 2000 Page 9 - 40

several interleaved, parallel bit streams and let each of these bit streams modulate a separate sub carrier. In this way, the channel spectrum is passed into a number of independent, non-selective frequency sub-channels for transmission between wireless NICs and access points. The OFDM modulation technique is generated through the use of complex signal

processing approaches such as fast Fourier transforms (FFTs) and inverse FFTs in the transmitter and receiver sections of the radio. One of the benefits of OFDM is its strength in fighting the adverse effects of multipath propagation with respect to intersymbol interference in a channel. OFDM is also spectrally efficient because the channels are overlapped and contiguous. OFDM is well tested and has been adopted by a number of standards bodies for several applications, including a wired global standard for asymmetric digital subscriber line (ADSL) and for digital audio broadcasting (DAB) in the European market. To complement OFDM, the IEEE 802.11a specification also offers support for a variety of other modulation and coding alternatives. For example, the standard allows engineers to combine BPSK, QPSK, and 16-QAM modulations with convolution encoding (R = 1/2 and constraint length seven) to generate data rates of 6, 12, and 24 Mbps. All other combinations of encoding rates, including R = 2/3 and R = 3/4 combined with 64QAM, are used to generate rates up to 54 Mbps, which are optional in the standard. During development of the 802.11a standard, the IEEE 802.11 working group carefully optimized the PHY for traffic transmitting multimedia content such as streaming video. The packet date frame defined in the 802.11a specification consists of the PHY header, PHY layer convergence protocol (PLCP) and the payload (PSDU). This is similar to the structure used in the IEEE 802.11b specification. The first field of the PLCP header is called the preamble. The preamble consists of 12 symbols, which are used to synchronize the receiver. The second field is the signal field. The signal field is used to indicate the rate at which the OFDM symbols of the PSDU payload are transmitted. The PLCP header is always BPSK modulated and convolution encoded at R = 1/2. The PSDU packet payload is modulated and transmitted at the rate indicated in the signal field. This rate is variable from 6 up to 54 Mbps. The structure of the packet data frame is illustrated in figure 5.
OFDM for Wireless Networks December, 2000 Page 10 - 40

Figure 5. IEEE 802.11a channel scheme

3. Carrier Sense Multiple Access/Collision Avoidance


Since different physical transmission layers are supported by IEEE 802.11, the wireless MAC protocol should be transparency to physical layers, which include Direct Sequence Spread Spectrum (DSSS), Frequency Hopping Spread Spectrum (FHSS) diffused infrared and recently OFDM. Since spectrum is a scare resource above all different physical layers, the throughput and packet delay performance is one of the most critical considerations in the design of a wireless MAC protocol. The basic protocol level in the 802.11 MAC protocol is the Distributed Coordination Function (DCF), which supports asynchronous communication between multiple users [5]. The DCF allows sharing medium between similar and dissimilar

systems through the use of the CSMA/CA and a random back off delay algorithm. The CSMA/CA is similar to the Carrier Sense Multiple Access with Collision Detection (CSMA/CD) used in a Ethernet. As the Ethernet, the CSMA/CA uses carrier-sense mechanism to determine whether other terminals are using the medium. If a channel is sensed idle, the packet transmission is started immediately in both cases. However, if the channel is sensed busy, the CSMA/CA and the CSMA/CD operate differently to resolve the contention. In the case of the CSMA/CD, when a terminal senses a busy channel, it waits until the channel goes idle and then it transmits a packet with probability one. When two or more terminals are waiting to transmit, a collision is absolutely occurred because each terminal will transmit immediately at the end of channel busy period. While a terminal, operates in the CSMA/CA protocol, senses the busy channel, it waits until the channel goes idle and waits for delay period, which is called backoff delay. In the CSMA/CA, the collision probability between multiple terminals under above situation is reduced since a random backoff arrangement is used to resolve medium contention conflicts. The Collision Detection (CD) function detects collisions in the CSMA/CD, but the CD function is not viable in wireless LANs because the dynamic range of signals in the medium is very large. Thus, packet transmission errors are increased in wireless
OFDM for Wireless Networks December, 2000 Page 11 - 40

communication environments. The IEEE 802.11 MAC protocol supports coexisting asynchronous and time-bounded services using different priority levels with different Inter Frame Space (IFS) delay controls. Three kinds of IFS are used to support three backoff priorities such as a Short IFS (SIFS), a Point coordination function IFS (PIFS) and Distributed Coordination function IFS (DIFS); SIFS is the shortest IFS and is used for all immediate response actions which include ac-knowledgement (ACK) packet transmissions, Clear To Send (CTS) packet transmissions and contention-free response packet transmissions. PIFS is a middle length IFS and is used for terminal polling in time-bounded services. DIFS is the longest IFS and is used as a minimum delay for asynchronous transmission in the contention period. A random backoff algorithm of the IEEE 802.11 MAC is similar to that of Ethernet.The slot time is the sum of transmitter turn-on time, medium propagation delay and medium busy detect response time. In wireless communication environments, packet transmission suffers from .hidden terminal., so IEEE 802.11 MAC protocol provides three alternative ways of packet transmission flow control [5]. First, actual data packet is only used for packet transmission which is called Basic CSMA/CA. Second, immediate positive acknowledgements are employed to confirm the successful reception of each packet. We call this scheme Stop-and-Wait (SW) CSMA/CA. The last is 4-Way Handshake (4-WH)

CSMA/CA which use Request To Send (RTS) and Clear To Send (CTS) packets prior to the transmission of the actual data packet. The packet transmission flow of three kinds of CSMA/CA is summarized as follow 1. Basic CSMA/CA: Data . Data - . 2. SW CSMA/CA: Data - ACK - Data - ACK - . 3. 4WH CSMA/CA: RTS - CTS - Data - ACK -.

4. Propagation Characteristics of mobile radio channels


In an ideal radio channel, the received signal would consist of only a single direct path signal, which would be a perfect reconstruction of the transmitted signal. However in a real channel, the signal is modified during transmission in the channel. The received signal consists of a combination of attenuated, reflected, refracted, and diffracted replicas of the transmitted signal. On top of all this, the channel adds noise to the signal and can cause a shift in the carrier frequency if the transmitter, or receiver is moving (Doppler effect). Understanding of these effects on the signal is important because the performance of a radio system is dependent on the radio channel characteristics.
OFDM for Wireless Networks December, 2000 Page 12 - 40

4.1. Attenuation Attenuation is the drop in the signal power when transmitting from one point to another. It can be caused by the transmission path length, obstructions in the signal path, and multipath effects. Any objects, which obstruct the line of sight signal from the transmitter to the receiver, can cause attenuation. Shadowing of the signal can occur whenever there is an obstruction between the transmitter and receiver. It is generally caused by buildings and hills, and is the most important environmental attenuation factor. Shadowing is most severe in heavily built up areas, due to the shadowing from buildings. However, hills can cause a large problem due to the large shadow they produce. Radio signals diffract off the boundaries of obstructions, thus preventing total shadowing of the signals behind hills and buildings. However, the amount of diffraction is dependent on the radio frequency used, with low frequencies diffracting more then high frequency signals. Thus high frequency signals, especially, Ultra High Frequencies (UHF), and microwave signals require line of sight for adequate signal strength. To over come the problem of shadowing, transmitters are usually elevated as high as possible to minimise the number of obstructions. Typical amounts of variation in attenuation due to shadowing are shown in table 1.
Description Typical Attenuation due to Shadowing Heavily built-up urban center 20dB variation from street to street Sub-urban area (fewer large buildings) 10dB greater signal power then built-up urban center Open rural area 20dB greater signal power then sub-urban areas

Terrain irregularities and tree foliage 3-12dB signal power variation

Table 1. Typical attenuation in a radio channel

Shadowed areas tend to be large, resulting in the rate of change of the signal power being slow. For this reason, it is termed slow-fading, or log-normal shadowing. 4.2. Multipath Effects 4.2.1. Rayleigh fading In a radio link, the RF signal from the transmitter may be reflected from objects such as hills, buildings, or vehicles. This gives rise to multiple transmission paths at the receiver. The relative phase of multiple reflected signals can cause constructive or destructive interference at the receiver. This is experienced over very short distances (typically at half wavelength distances), thus is given the term fast fading. These variations can vary from
OFDM for Wireless Networks December, 2000 Page 13 - 40

10-30dB over a short distance. Figure 6 shows the level of attenuation that can occur due to the fading.
Figure 6. Typical Rayleigh fading while the Mobile Unit is moving (900 MHz)[11]

The Rayleigh distribution is commonly used to describe the statistical time varying nature of the received signal power. It describes the probability of the signal level being received due to fading. Table 2 shows the probability of the signal level for the Rayleigh distribution. Signal Level (dB about median) % Probability of Signal Level being less than the value given 10 99 0 50 -10 5 -20 0.5 -30 0.05
Table 2. Cumulative distribution for Rayleigh distribution [11]

4.2.2. Frequency Selective Fading In any radio transmission, the channel spectral response is not flat. It has dips or fades in the response due to reflections causing cancellation of certain frequencies at the receiver. Reflections off near-by objects (e.g. ground, buildings, trees, etc) can lead to multipath signals of similar signal power as the direct signal. This can result in deep nulls in the received signal power due to destructive interference.
OFDM for Wireless Networks December, 2000 Page 14 - 40

For narrow bandwidth transmissions if the null in the frequency response occurs at the transmission frequency then the entire signal can be lost. This can be partly overcome

in two ways. By transmitting a wide bandwidth signal or spread spectrum as CDMA, any dips in the spectrum only result in a small loss of signal power, rather than a complete loss. Another method is to split the transmission up into many small bandwidth carriers, as is done in a COFDM/OFDM transmission. The original signal is spread over a wide bandwidth thus, any nulls in the spectrum are unlikely to occur at all of the carrier frequencies. This will result in only some of the carriers being lost, rather then the entire signal. The information in the lost carriers can be recovered provided enough forward error corrections is sent. 4.3. Delay Spread The received radio signal from a transmitter consists of typically a direct signal, plus reflections of object such as buildings, mountings, and other structures. The reflected signals arrive at a later time than the direct signal because of the extra path length, giving rise to a slightly different arrival time of the transmitted pulse, thus spreading the received energy. Delay spread is the time spread between the arrival of the first and last multipath signal seen by the receiver. In a digital system, the delay spread can lead to inter-symbol interference. This is due to the delayed multipath signal overlapping with the following symbols. This can cause significant errors in high bit rate systems, especially when using time division multiplexing (TDMA). Figure 7 shows the effect of inter-symbol interference due to delay spread on the received signal. As the transmitted bit rate is increased the amount of intersymbol interference also increases. The effect starts to become very significant when the delay spread is greater then ~50% of the bit time.
OFDM for Wireless Networks December, 2000 Page 15 - 40

Figure 7. Multipath Delay Spread

Table 3 shows the typical delay spread that can occur in various environments. The maximum delay spread in an outdoor environment is approximately 20sec, thus significant intersymbol interference can occur at bit rates as low as 25kbps. Environment or cause Delay Spread Maximum Path Length Difference Indoor (room) 40nsec 200nsec 12m - 60 m Outdoor 1usec - 20usec 300m - 6km
Table 3. Typical Delay Spread [11]

Inter-symbol interference can be minimized in several ways. One method is to reduce the symbol rate by reducing the data rate for each channel (i.e. split the bandwidth into more channels using frequency division multiplexing). Another is to use a coding

scheme, which is tolerant to intersymbol interference such as CDMA. 4.4. Doppler Shift When a wave source and a receiver are moving relative to one another the frequency of the received signal will not be the same as the source. When they are moving toward each other the frequency of the received signal is higher then the source, and when they are approaching each other the frequency decreases. This is called the Doppler effect. An
OFDM for Wireless Networks December, 2000 Page 16 - 40

example of this is the change of pitch in a car.s horn as it approaches then passes by. This effect becomes important when developing mobile radio systems. The amount the frequency changes due to the Doppler effect depends on the relative motion between the source and receiver and on the speed of propagation of the wave. The Doppler shift in frequency can be written: [10]
c f f v 0 }

where f is the change in frequency of the source seen at the receiver, fo is the frequency of the source, v is the speed difference between the source and transmitter, and c is the speed of light. For example: Let fo= 5.9 GHz, and v = 80km/hr = 22.22m/s (50 miles/hr aprox) then the Doppler shift will be:
Hz xms f x Hertz m s 437 3 10 / 5.9 10 22.22 / 8
9 0

==

This shift of 437Hz in the carrier will generally not effect the transmission. However, Doppler shift can cause significant problems if the transmission technique is sensitive to carrier frequency offsets or the relative speed is higher, which is the case for OFDM. If we consider now a link between to cars moving in opposite directions, each one with a speed of 80 km/hr, the Doppler shift will be double.

5. OFDM
5.1. General Structure The basic principle of OFDM is to split a high-rate datastream into a number of lower rate streams that are transmitted simultaneously over a number of subcarriers. The relative amount of dispersion in time caused by multipath delay spread is decreased because the symbol duration increases for lower rate parallel subcarriers. The other problem to solve is the intersymbol interference, which is eliminated almost completely by

introducing a guard time in every OFDM symbol. This means that in the guard time, the OFDM symbol is cyclically extended to avoid intercarrier interference. An OFDM signal is a sum of subcarriers that are individually modulated by using phase shift keying (PSK) or quadrature amplitude modulation (QAM). The symbol can be written as:
OFDM for Wireless Networks December, 2000 Page 17 - 40

s t t t and t t T tttttT T stdjfi


SS SS N iN iNcs
S S S

=<>+ + __ __ _ __ __ _
= +

=_ + ( ) 0, ( ) Re exp( 2 ( 0.5)( )) ,
1 2 2 /2

where : NS is the number of subcarriers T is the symbol duration fc is the carrier frequency The equivalent complex baseband notation is given by:
s t t t and t t T tttttT T stdji
SS SS N iN iNs
S S S

=<>+
= +

=_ + ( ) 0, ( ) exp( 2 ( )) ,
1 2 2 /2

(1) In this case, the real and imaginary parts correspond to the in-phase and quadrature parts of the OFDM signal. They have to be multiplied by a cosine and sine of the desired frequency to produce the final OFDM signal. Figure 8 shows the block diagram for the OFDM modulator.
Figure 8 OFDM modulator

The complex baseband OFDM signal defined the equation (1) is the inverse Fourier transform of Ns QAM input symbols. The time discrete case is the inverse discrete Fourier transform. In practice, this transform can be implemented very efficiently by the inverse fast Fourier transform (IFFT). The IFFT drastically reduces the amount of calculations by exploiting the regularity of the operations in the IDFT.
Serial to Parallel QAM data exp(-j NS(t-ts)/T) exp(-j (NS-2)(t-ts)/T) OFDM signal . . . OFDM for Wireless Networks December, 2000 Page 18 - 40

5.2. Implementation In practice, the OFDM signal for the standard IEEE 802.11a is generated as follows: In the transmitter, binary input data is encoded by a rate convolutional encoder. The rate can be increased to 2/3 and . After interleaving, the binary values are converted to QAM values. Four pilot values are added each 48 data values, resulting in a total of 52 QAM values per OFDM symbol. The symbol is modulated onto 52 subcarriers by applying the Inverse Fast Fourier Transform (IFFT). The output is converted to serial and a cyclic extension is added to make the system robust to multipath propagation. Windowing is applied after to get a narrower output spectrum. Using an IQ modulator, the signal is converted to analog, which is upconverted to the 5 GHz band, amplified, and transmitted through the antenna.

Basically, the receiver performs the reverse operations of the transmitter, with additional training tasks. In the first step, the receiver has to estimate frequency offset and symbol timing, using special training symbols in the preamble. After removing the cyclic extension, the signal can be applied to a Fast Fourier Transform to recover the 52 QAM values of all subcarriers. The training symbols and the pilot subcarriers are used to correct for the channel response as well as remaining phase drift. The QAM values are then demapped into binary values, and finally a Viterbi decoder decodes the information bits. Figure 9 shows the block diagram of an OFDM modem, including the transmitter and the receiver. The IFFT modulates a block of input QAM values onto a number of subcarriers. In the receiver, the subcarriers are demodulated by the FFT, which is the reverse operation of the IFFT. These two operations are almost identical. In fact, the IFFT can be made using an FFT by conjugating input and output of the FFT and dividing the output by the FFT size. This makes it possible to use the same hardware for both the transmitter and the receiver. Of course, this saving in complexity is only possible when the modem does not have to transmit and receive simultaneously, which is the case for the standard.
OFDM for Wireless Networks December, 2000 Page 19 - 40

Figure 9. Block diagram of an OFDM transceiver

In case of the standard IEEE 802.11a, the parameters for the physical layer (e.g. for OFDM) are as follows:
Data rate (Mbits/s) Modulation Coding rate (R) Coded bits per subcarrier (NBPSC) Coded bits per OFDM symbol (NCBPS) Data bits per OFDM symbol (NDBPS) 6 BPSK 1/2 1 48 24 9 BPSK 3/4 1 48 36 12 QPSK 1/2 2 96 48 18 QPSK 3/4 2 96 72 24 16-QAM 1/2 4 192 96 36 16-QAM 3/4 4 192 144 48 64-QAM 2/3 6 288 192 54 64-QAM 3/4 6 288 216

Table 4. Physical layer parameters.


FEC Encoder Interleaving QAM Mapping Pilot Insertion Serial to Parallel Parallel to Serial Add cyclic extension and windowing

IQ modulator Binary input data HPA FEC Decoder Deinterleaving Channel QAM demapping Correction Parallel to Binary Serial output data Serial to Parallel Remove cyclic extension Timing and frequency synchronization IQ Detector LNA AFC Clock Recovery AGC Amp IFFT (TX) Frequency corrected signal Symbol timing FFT(RX)

OFDM for Wireless Networks December, 2000 Page 20 - 40

Parameter Value NSD : Number of data subcarriers 48 NSP : Number of pilot subcarriers 4 NS : Number of subcarriers, total 52 (NSD + NSP) F : Subcarrier frequency spacing 0.3125 MHz (=20 MHz/64) TFFT : IFFT/FFT period 3.2 s (1/F) TPREAMBLE : PLCP preamble duration 16 s (TSHORT + TLONG) TSIGNAL : Duration of the SIGNAL BPSK-OFDM symbol 4.0 s (TGI + TFFT) TGI : GI duration 0.8 s (TFFT /4) TGI2 : Training symbol GI duration 1.6 s (TFFT /2) TSYM : Symbol interval 4 s (TGI + TFFT) TSHORT : Short training sequence duration 8 s (10 TFFT /4) TLONG : Long training sequence duration 8 s (TGI2 + 2TFFT) Table 5. Physical layer parameters.

5.2.1. Guard time and cyclic extension One of the most important problems in for wireless communications is the multipath delay spread. OFDM deals with it very efficiently. The parallel transmission implies that the input datastream is divided in NS subcarriers and the symbol duration is made NS times smaller, which also reduces the relative multipath delay spread, relative to the symbol time, by the same factor. The intersymbolic interference is almost completely eliminated by introducing a guard time for a each OFDM symbol. The guard time is chosen larger than the expected delay spread such that multipath components from one symbol cannot interfere with the next symbol. This guard time could be no signal at all but the problem of intercarrier interference (ICI) would arise. Then, the OFDM symbol is cyclically extended in the guard

time. Using this method, the delay replicas of the OFDM symbol always have an integer number of cycles within the FFT interval, as long as the delay is smaller than the guard time. Multipath signals with delays smaller than the guard time cannot cause ICI. If multipath delay exceeds the guard time by a small fraction of the FFT interval (for example 3%), the subcarriers are not orthogonal anymore, but the interference is still small enough to get a reasonable constellation. Considering that the multipath delay exceeds the guard time by 10% of the FFT interval, the constellation is seriously affected and an unacceptable error rate is obtained. 5.2.2. Windowing Essentially, an OFDM signal consists of a number of unfiltered QAM subcarriers. This means that the out-of-band spectrum decreases rather slowly, following a sinc
OFDM for Wireless Networks December, 2000 Page 21 - 40

function. For larger number of subcarriers, the spectrum goes down rapidly in the beginning, which is caused by the fact that the sidelobes are closer together. To make the spectrum decrease faster, windowing is applied to the OFDM signal. The standard doesn.t specify the kind of window to be used but an example is included using the following function [5]:
(2) ) 22 0.5 ( 2 sin ) 22 1( ) 22 0.5 ( 2 sin ()
2 2

____ _ ____ _ _ +< _ _ _ _ __

__ _ _ __ < << _ _ _ _ __ __ _ _ __ + =
TR TR TR TR TR TR TR TR T

for T T t T T T tT for T t T T for T t T T t wt

considering that TTR is the transition time between two consecutive periods of FFT, as it can be seen in Figure .
Figure 10. OFDM frame with cyclic extension and windowing for (a) single reception or (b) two receptions of the FFT period

Figure 10 also illustrates the possibility of extending the windowing function over more than one period, TFFT , and additionally shows smoothed transitions by application of

a windowing function, as exemplified in Equation (2). In particular, window functions that extend over multiple periods of the FFT are utilized in the definition of the preamble.
OFDM for Wireless Networks December, 2000 Page 22 - 40

Several other conventional windows were simulated including raised cosine, Hann, Hamming, Blackman and Kaiser. The best performance was obtained for the Blackman window, considering the resulting stopband attenuation and the transition bandwidth. 5.3. Forward Error Correction Coding According to the standard, data must be encoded with a convolutional encoder of coding rate R = 1/2, 2/3, or 3/4, corresponding to the desired data rate. The convolutional encoder uses the industry-standard generator polynomials, g0 = 1338 and g1 = 1718 of rate R = 1/2, as shown in Figure 11.
Figure 11. Convolutional encoder (k = 7)

The bit denoted as .A. shall be output from the encoder before the bit denoted as .B.. Higher rates are derived from it by employing .puncturing.. Puncturing is a procedure for omitting some of the encoded bits in the transmitter reducing the number of transmitted bits and increasing the coding rate, and inserting a dummy .zero. metric into the convolutional decoder on the receive side in place of the omitted bits. Decoding by the Viterbi algorithm is recommended. Even thought the convolutional code is established by the standard, block codes may be more desirable in certain applications, such as packet voice or data communications, in which the length of the packet may be made equal to a multiple of the code length. In these applications, block codes are more attractive than convolutional codes, since the memory of the convolutional codes must be brought to a known state to terminate the trellis, which lowers the effective code rate. 5.3.1. Block Codes
Tb Tb Tb Tb Tb Tb Tb Output data A Output data B Input data

OFDM for Wireless Networks December, 2000 Page 23 - 40

A block code encodes a block of k input symbols into n coded symbols, with n being larger than k. The purpose of adding the redundant n-k symbols is to increase the minimum Hamming distance, which is the minimum number of different symbols between any pair of code words. For a minimum Hamming distance of dmin, the code can correct t errors where t is given by:
__ _ __ _ 2

1 min t floor d (3)

The minimum Hamming distance is upperbound by the number of redundant symbols n-k as: 1 min d n k + (4) For binary codes, only repetition codes and single-parity check codes reach this upperbound. A class of nonbinary codes that doesn.t reach the above bound are the ReedSolomon codes. Because of their good distance properties and the availability of efficient coding and decoding algorithms, Reed-Solomon codes are the most popularly used block codes. Reed-Solomon codes are defined for block symbols with m bits per symbol, where the code length n is related to m by:
n = 2m 1

The number of input symbols k is related to m and the required minimum Hamming distance dmin as:
min

k = 2m d

There appears to be little flexibility in the available code lengths. However, a ReedSolomon code can easily be shortened to any arbitrary length by leaving a number of input bits zero and deleting the same amount of output bits. It is also possible to extend the code length to a power of 2 by adding an extra parity symbol. Using equations (3) and (4), a Reed-Solomon code can correct up to floor ((n-k)/2) erroneous symbols. Each symbol contains m bits, so a maximum amount of m floor((nk)/ 2) erroneous bits may be corrected. If the Reed-Solomon code is designed to correct up to two symbol errors containing 8 bits per symbol, it cannot correct an arbitrary combination of three bit errors, as these errors can occur in three different symbols. This characteristic makes Reed-Solomon codes particularly useful for correcting bursty channels. The OFDM link in presence of fading multipath is a very good application for this code. Another block codes is the Reed-Muller code [6]. Using the Viterbi algorithm with soft decision it results a very attractive implementation. Using the results from [7], the performance of this code can be evaluated and compared with that of convolutional codes. Also, compared with other block codes, Reed.
OFDM for Wireless Networks December, 2000 Page 24 - 40

Muller codes are preferable where soft decision decoding is required since they have a simple structure, resulting in computationally manageable decoding. Figure compares the performance of the (16,11,4) RM code with rate 2/3, memory 2 and 3 punctured convolutional codes [8]. This figure shows that the performance curve of the (16,11,4) RM code falls midway between that of the two convolutional codes.
Figure 12. BER of the (16,11,4) RM code and rate 2/3 memory 2 and 3

5.4. Interleaving Because of the frequency fading of typical radio channels, the OFDM subcarriers

generally have different amplitudes. Deep fades in the spectrum may cause groups of subcarriers to be less reliable than others, thereby causing bit errors to occur in bursts rather than being randomly scattered. Interleaving is applied to randomize the occurrence of bit errors prior to decoding. At the transmitter, the coded bits are permuted in a certain way, which makes sure that adjacent bits are separated by several bits after interleaving. According to the standard, all data bits must be interleaved by a block interleaver with a block size corresponding to the number of bits in a single OFDM symbol, NCBPS. The interleaver is defined by a two-step permutation. The first permutation ensures that adjacent coded bits are mapped onto nonadjacent subcarriers. The second ensures that adjacent coded bits are mapped alternately onto less and more significant bits of the constellation and, thereby, long runs of low reliability (LSB) bits are avoided.
OFDM for Wireless Networks December, 2000 Page 25 - 40

Considering k the index of the coded bit before the first permutation, i the index after the first and before the second permutation and j the index after the second permutation, just prior to modulation mapping, the first permutation is defined by the rule: i = (NCBPS/16) (k mod 16) + floor(k/16) k = 0,1,.,NCBPS.1 The function floor (.) denotes the largest integer not exceeding the parameter. The second permutation is defined by the rule: j = s floor(i/s) + (i + NCBPS.floor(16 i/NCBPS)) mod s i=0,1,. NCBPS.1 The value of s is determined by the number of coded bits per subcarrier, N BPSC, according to: s = max(NBPSC /2,1) The deinterleaver, which performs the inverse relation, is also defined by two permutations. Here, j denotes the index of the original received bit before the first permutation, I the index after the first and before the second permutation, and k the index after the second permutation, just prior to delivering the coded bits to the convolutional (Viterbi) decoder. The first permutation is defined by the rule: i = s floor(j/s) + (j + floor(16 j/NCBPS)) mod s j = 0,1,. NCBPS.1 where s = max(NBPSC /2,1) The second permutation is defined by the rule: k = 16 i .(NCBPS.1)floor(16 i/NCBPS) i = 0,1,. NCBPS.1 Instead of a block interleaver, it is possible to use a convolutional interleaver. Figure 13 shows a convolutional interleaver. The interleaver cyclically writes each input symbol or bit into one of the K shift registers that introduce a delay of 0 to k-1 symbol durations.
OFDM for Wireless Networks December, 2000 Page 26 - 40

Figure 13. Convolutional interleaver

6. Synchronization
Before an OFDM receiver can demodulate the subcarriers, it has to perform at least two synchronization tasks. The first one is to find out where the symbol boundaries are and

what the optimal timing instants are to minimize the effects of intercarrier interference (ICI) and intersymbol interference (ISI). The second task is to estimate and correct the carrier frequcncy offset of the received signal to avoid the ICI. Also, for coherent receivers, the carrier phase has to be synchronized. Further, a coherent QAM receiver needs to detect the amplitudes and phases of all subcarriers to define the decision boundaries for the QAM cosntellation of each subcarrier. Usually, the OFDM received signal has a frequency offset, which immediately results in ICI. This means that the subcarries are not perfectly orthogonal, producing phase noise. Using the cyclic prefix an considering that the first TG seconds of each symbol is identical to the last part, this property can be exploited for both timing and frequency synchronization, using a system showed in figure 14.
Figure 14. Synchronization using the cyclic prefix
T TT TT TT T

Input data Output data T Delay Conjugation

_
TG

dt Maximum

correlation Phase of maximum Frequency offset Timming OFDM signal

OFDM for Wireless Networks December, 2000 Page 27 - 40

=_
TG

xtrtrtTd
0

()()()

This device correlates a TG long part of the signal with a part that is T seconds delayed. The output can be written as [12]: The correlation function produce several peaks, corresponding to the different symbols and the peak amplitudes show a significant variation. The reason for this is that although the average power for a T-seconds interval of each OFDM symbol is constant, the power in the guard time can substantially vary from this average power level. Another effect is the level of the undesired correlation sidelobes between the main correlation peaks. These sidelobes reflect the correlation between two pieces of the OFDM signal that

belong partly or totally to two different OFDM symbols. Because different OFDM symbols contain independent data values, the correlation output is a random variable, which may reach a value that is larger than the desired correlation peak. The standard deviation of the random correlation magnitude is related to the number of independent samples over which the correlation is performed. The larger the number of independent samples, the smaller the standard deviation is. In the extreme case where the correlation is performed over only one sample, the output magnitude is proportional to the signal power, and there is no distinct correlation peak in this case. In the other extreme case where the correlation is performed over a very large number of samples, the ratio of sidelobestopeak amplitude will go to zero. Because the number of independent samples is proportional to the number of subcarriers, the cyclic extension correlation technique is only effective when a large number of subcarriers are used, preferably more than 100. An exception to this is the case where instead of random data symbols, specially designed training symbols are used. In this case, the integration can be done over the entire symbol duration instead of the guard time only. The level of undesired correlation sidelobes can be minimized by a proper selection of the training symbols. The phase of the correlation output is equal to the phase drift between samples that are T seconds apart. Hence, the frequency offset can simply be found as the correlation phase divided by 2 T. This method works up to a maximum absolute frequency offset of half the subcarrier spacing. To increase this maximum range, shorter symbols can be used, or special training symbols with different PN sequences on odd and even subcarrier frequencies to identify a frequency offset of an integer number of subcarrier spacings [13]. The noise performance of the frequency offset estimator is now determined for an input signal r(t) that consists of an OFDM signal s(t) with power P and additive Gaussian noise n(t) with a one-sided noise power spectral density of N0 within the bandwidth of the OFDM signal: r(t) = s(t) + n(t)
OFDM for Wireless Networks December, 2000 Page 28 - 40

The frequency offset estimator multiplies the signal by a delayed and conjugated version of the input to produce an intermediate signal y(t) given by [13] y(t) = r(t)r*(t - T) = |s(t)|2 exp(j) + n(t)s*(t - T)+ n*(t - T)s(t) + n(t)n*(t - T) (5) The first term in the right-hand side of the equation 5 is the desired output component with a phase equal to the phase drift over a T-second interval and a power equal to the squared signal power. The next two terms are products of the signal and the

Gaussian noise. Because the signal and noise are uncorrelated, and because noise samples separated by T seconds are uncorrelated, the power of the two terms is equal to twice the product of signal power and noise power. Finally, the power of the last term is equal to the squared noise power. If the input SNR is much larger than one, the power of the squared noise component becomes negligible compared with the power of the other two noise terms. The frequency offset is estimated by averaging y(t) over an interval equal to the guard time TG and then estimating the phase of y(t). Because the desired output component of ( ) is a constant vector, averaging reduces the noise that is added to this vector. Assuming that the squared noise component may be neglected, the output SNR is approximated as [13]. Figure show the effect of noise on the received phase angle. If we let the amplitude of the transmitted signal be 1, and the length of the noise vector be A with angle f, then the received phase error is err.
Figure 15. Received phasor, showing effect of noise on the received phase angle
00 2

2 / 2N PT PN T SNR P G
0 G

=
OFDM for Wireless Networks December, 2000 Page 29 - 40

Using trigonometry, Since, Therefore, The signal to ratio determines the relative amplitude of the received signal and the noise level. Since the signal is scaled to an amplitude of 1, the amplitude of the noise is: The SNR is base on the amplitudes of the signals thus must be scaled correctly when converting it to dB. Substituting this in, 6.1. Synchronization Using Special Training Symbols The synchronization technique based on the cyclic extension is particularly suited to tracking or to blind synchronization in a circuit-switched connection, where no special training signals are available. For packet transmission, however, there is a drawback because an accurate synchronization needs an averaging over a large (>10) number of OFDM symbols to attain a distinct correlation peak and a reasonable SNR. For high-rate packet transmission, the synchronization time needs to be as short as possible, preferably a few OFDM symbols only. To achieve this, special OFDM t symbols can be used for which

the data content is known to the receiver. In this, the entire received training signal can be used to achieve synchronization, whereas the cyclic extension method only uses a fraction of each symbol.
OFDM for Wireless Networks December, 2000 Page 30 - 40

13 / 6 13 / 6

According to [5], the physical layer convergence procedure (PLCP) preamble field is used for synchronization. It consists of 10 short symbols and 2 long symbols, shown in figure 16.
Figure 16. OFDM training structure

Figure 16 shows the OFDM training structure (PLCP preamble), where t1 to t10 denote short training symbols and T1 and T2 denote long training symbols. The PLCP preamble is followed by the SIGNAL field and DATA. The total training length is 16 s. The dashed boundaries in the figure denote repetitions due to the periodicity of the inverse Fourier transform. A short OFDM training symbol consists of 12 subcarriers, which are modulated by the elements of the sequence S, given by S.26, 26 = {0, 0, 1+j, 0, 0, 0, .1.j, 0, 0, 0, 1+j, 0, 0, 0, .1.j, 0, 0, 0, .1.j, 0, 0, 0, 1+j, 0, 0, 0, 0,0, 0, 0, .1.j, 0, 0, 0, .1.j, 0, 0, 0, 1+j, 0, 0, 0, 1+j, 0, 0, 0, 1+j, 0, 0, 0, 1+j, 0,0} The multiplication by a factor of is in order to normalize the average power of the resulting OFDM symbol, which utilizes 12 out of 52 subcarriers. The fact that only spectral lines of S.26:26 with indices that are a multiple of 4 have nonzero amplitude results in a periodicity of TFFT/4 = 0.8 s. The interval TSHORT is equal to ten 0.8 s periods (i.e., 8 s). A long OFDM training symbol consists of 53 subcarriers plus a zero value at dc, which are modulated by the elements of the sequence L, given by L.26, 26 = {1, 1, .1, .1, 1, 1, .1, 1, .1, 1, 1, 1, 1, 1, 1, .1, .1, 1, 1, .1, 1, .1, 1, 1, 1, 1, 0,1, .1, .1, 1, 1, .1, 1, .1, 1, .1, .1, .1, .1, .1, 1, 1, .1, .1, 1, .1, 1, .1, 1, 1, 1, 1} This information is used, also, to estimate the channel and to improve the system performance.
OFDM for Wireless Networks December, 2000 Page 31 - 40

The receiver has to perform two operations: tracking and acquisition. During the tracking mode, only small frequency fluctuations have to be corrected. But during the acquisition mode the frequency offset can take large values. This is the most challenging task to be managed by the synchronizer structure. During the tracking mode, it can be assume that the remaining frequency offset is substantially smaller than F/2, with F representing the subcarrier frequency spacing (for

the standard, F=0.3125MHz [5]). Considering just one subchannel, then this frequency synchronization problem is similar to that in case of single carrier problem. Therefore, frequency estimation algorithms derived from maximum likelihood theory [14] can be used in this case. The underlying principle of these frequency algorithms is that the frequency estimation problem can be reduced to a phase estimation problem by considering the phase shift between two subsequent subchannel samples. The influence of the modulation is removed in this case by the multiplication with the conjugate complex value of the transmitted symbols. The known symbols are taken from the training sequence and the pilot signals. In several papers, like [14], it is recommended to transmit the synchronization sequences in different channels instead of doing it over a single subchannel. They should be spread uniformly over the whole frequency domain. That is how the standard implement it. Equations (6) and (7) in [15] describe generalized estimators for this cases. In order to avoid a large decoder performance degradation due to the crosstalk, it may be necessary to correct the frequency offset prior to the demodulation even during the tracking mode. It is theoretically possible to correct a small offset on the subchannel level. In practice, this will depend on the maximum magnitude of the frequency offset which the tracking unit has to cope with. The acquisition process should be performed fast. The special acquisition training preamble should be avoided to increase the transmission efficiency. Using the same training synchronization symbols from the pilot signals, the operation involved in acquiring an initial frequency offset estimate coincides with the search operation for the training symbols transmitted on the channel. The acquisition rule is based on the fact that the magnitude of the correlation function reaches a maximum, if the estimation coincides with the received frequency. The acquisition time is directly proportional to the frequency range to be scanned, depending of the expected Doppler value in the channel.

7. Detection
In an OFDM link, the data bits are modulated on the subcarriers by some form of phase shift keying (PSK) or quadrature amplitude modulation (QAM). To estimate the bits
OFDM for Wireless Networks December, 2000 Page 32 - 40

at the receiver, knowledge is required about the reference phase and amplitude of the constellation on each subcarrier. In general, the constellation of each subcarrier shows a random phase shift and amplitude change, caused by carrier frequency offset, timing offset, and frequency selective fading. To cope with these unknown phase and amplitude

variations, two different approaches exist. The first one is coherent detection, which uses estimates of the reference amplitudes and phases to determine the best possible decision boundaries for the constellation of each subcarrier. The main issue with coherent detection is how to find the reference values without introducing too much training overhead. The second approach is differential detection, which does not use absolute reference values, but only looks at the phase and/or amplitude differences between two QAM values. Differential detection can be done both in the time domain and in the frequency domain. In the first case, each subcarrier is compared with the subcarrier of the previous OFDM symbol. In the case of differential detection in the frequency domain, each subcarrier is compared with the adjacent subcarrier within the same OFDM symbol. Differential detection implies differential encoding, which is not the case for the standard IEEE 802.11a, so this last case will not be treated. 7.1. Coherent Detection Figure 17 shows a block diagram of a coherent OFDM receiver. After downconversion and analog-to-digital conversion, the fast Fourier transform (FFT) is used to demodulate the N subcarriers of the OFDM signal. For each symbol, the FFT output contains N QAM values. However, these values contain random phase shifts and amplitude variations caused by the channel response, local oscillator drift, and timing offset. It is the task of the channel estimation block to learn the reference phases and amplitudes for all subcarriers, such that the QAM symbols can be converted to binary soft decisions.
Figure 17. Block diagram of an OFDM receiver with coherent detection.
RF Receiver ADC FFT Channel Estimation Coherent Detection Deinterleaving Decoding Binary Output Data

OFDM for Wireless Networks December, 2000 Page 33 - 40

7.2. Two-Dimensional Channel Estimators In general, radio channels are fading both in time and in frequency. Hence, a channel estimator has to estimate time-varying amplitudes and phases of all subcarriers. One way to do this is to use a two-dimensional channel estimator that estimates the reference values based on a few known pilot values. In this case, the signal has 4 subcarriers containing known pilot values to allow the estimation. To be able to interpolate the channel estimates both in time and frequency from the available pilots, the pilot spacing has to fulfill the Nyquist sampling theorem, which states that the sampling interval must be smaller than the inverse of the double-sided bandwidth

of the sampled signal. For the case of OFDM, this means that there exist both a minimum subcarrier spacing and a minimum symbol spacing between pilots. By choosing the pilot spacing much smaller than these minimum requirements, a good channel estimation can be made with a relatively easy algorithm. The more pilots are used, however, the smaller the effective SNR, becomes that is available for data symbols. Hence, the pilot density is a tradeoff between channel estimation performance and SNR loss. To determine the minimum pilot spacing in time and frequency, we need to find the bandwidth of the channel variation in time and frequency, These bandwidths are equal to the Doppler spread Bd in the time domain and the maximum delay spread max in the frequency domain [16]. Hence, the requirements for the pilot spacings in time and frequency st and sf are st < 1/Bd sf < 1/max These simple equations can be used to estimate the theoretical limits of the standard. In terms of Doppler effect, and considering that the pilot signals are transmitted continuously, the pilot spacing in time is not a limitation. The spacing in frequency imposes a limit to the delay spread. The pilots signal are the subcarriers number .21, -7, 7 and 21. This means that the spacing between pilots is 4.375MHz (210.3125MHz). In these conditions, the maximum delay spread max to be allowed is 228 nsec. In the literature, there are several algorithms to estimate the OFDM channel the one explained in [12]. This algorithm can be used with the parameters indicated by the standard. According to it, the pilots must be BPSK modulated by a pseudo binary sequence to prevent the generation of spectral lines.The contribution of the pilot subcarriers for the nth OFDM symbol is produced by Fourier transform of sequence P, given by:
OFDM for Wireless Networks December, 2000 Page 34 - 40

P .26, 26 = {0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0,0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, .1, 0, 0, 0, 0, 0} The polarity of the pilot subcarriers is controlled by the sequence, p n , which is a cyclic extension of the 127 elements sequence and is given by: p 0..126 = {1,1,1,1, -1, -1, -1,1, -1, -1, -1, -1, 1,1, -1,1, -1, -1,1,1, -1,1,1, -1, 1,1,1,1, 1,1, -1,1,1,1, -1,1, 1, -1, -1,1, 1,1, -1,1, -1, -1, -1,1, -1,1,-1,-1, 1,-1,-1,1, 1,1,1,1, -1,-1,1,1,-1,-1,1,-1, 1,-1,1,1, -1,-1,-1,1, 1,-1,-1,-1, -1,1,-1,-1, 1,-1,1,1, 1,1,-1,1, -1,1,-1,1,-1,-1,-1,-1, -1,1,-1,1, 1,-1,1,-1, 1,1,1,-1, -1,1,-1,-1, -1,1,1,1, -1,-1,-1,-1, -1,-1,-1} The sequence, pn can be generated by the scrambler defined by figure 18 when the .all ones. initial state is used, and by replacing all .1.s. with .1 and all .0.s. with 1. Each sequence element is used for one OFDM symbol.
Figure 18. Pilot Sequence Generator

The subcarrier frequency allocation is shown in figure 19.

Figure 19. Pilot Frequency Allocation

The channel estimation techniques were designed to estimate a channel that varied both in time and frequency. These techniques are especially suitable for continuous transmission systems such as Digital Audio Broadcasting or Digital Video Broadcasting. Even thought the standard has previsions about it, according to [12] they are not very suited for packet-type communications for two reasons. First, in many packet transmission
X7 X6 X5 X4 X3 X2 X1 Output Sequence

OFDM for Wireless Networks December, 2000 Page 35 - 40

systems, such as wireless LAN, the packet length is short enough to assume a constant channel during the length of the packet. This means there is no need to estimate time fading, which greatly simplifies the channel estimation problem. Second, using pilots scattered over several OFDM data symbols introduces a delay of several symbols before the first channel estimates can be calculated. Such a delay is undesirable in packet transmission like in an IEEE 802.11 wireless LAN, which requires an acknowledgment to be sent after each packet transmission. Any delay in the reception of a packet will also delay the acknowledgment and hence decrease the effective throughput of the system. An additional disadvantage is the fact that the receiver needs to buffer several OFDM symbols, thereby requiring extra hardware. However, for applications with fading and strong Doppler effect, like the signal between a car in movement and an access point in a highway, the pilot signals can be useful to improve the BER. The mobile channel introduces multipath distortion of the signaling waveforms. Both the amplitude and phase are corrupted and the channel characteristics changes because of movements of the mobile. In order to perform coherent detection, reliable channel estimates are required. The problem is to decide where and how often to insert pilot symbols. The spacing between the pilot symbols shall be chosen small enough to enable reliable channel estimates but large enough not to increase the overhead too much. In a multicarrier system there exist a unique opportunity to determine various parts of the channel impulse response, as opposed to a single-carrier system. It is no use to make efforts to determine already known parts. The impact of pilot signals was analyzed by Cavers in [17] who made an exhaustive theoretical analysis for single-carrier systems. He pointed out that it was appropriate that 14% of the sent symbols were pilot symbols to be able to handle large Doppler values (fdTS = 0.05). This means that the standard pilot scheme could be redundant. Also, Tufvesson and Maseng [18] analyzed several pilot patterns specifically for OFDM systems, arriving to the conclusion that the ability to estimate the channel reliably when it changes due to e.g. vehicle movements is highly dependent on the pilot pattern used. By rearranging the pilot pattern it is, in some cases,

possible to handle 10 times higher Doppler frequency and to reduce the number of needed pilot symbols the same amount, still retaining the same bit error rate. Alternatively the new pilot pattern could be used to reduce the bit error rate up to a factor 5, even more in a low noise environment. For a given propagation environment, e.g. a base station site, it is possible to precalculate a suitable pilot pattern. This is an alternative to improve the standard performance in terms of throughput.
OFDM for Wireless Networks December, 2000 Page 36 - 40

8. Simulation
The OFDM system was modeled using Matlab to allow various parameters of the system to be varied and tested, including those established by the standard. The aim of doing the simulations was to measure the performance of OFDM under different channel conditions, and to allow for different OFDM configurations to be tested.
Figure 20. Simulink OFDM Model OFDM Modulation and Demodulation
Run "Parameters.m" in the Workspace before simulation The following routines must be included in the path: zeropad.m zerounpad.m cyclic.m remcyc.m Run "constellation" after the simulation in the workspace to watch the received constellation

Transmitter Channel Receiver


MATLAB Function zeropad blackman Window Function Viterbi Decoder Viterbi Decoder Time Scope1 Time Scope Serial to parallel1 Serial to parallel MATLAB Function Remove zeros MATLAB Function Remove cyclic Extension1 Random int Random Data Generator Pulse Pulse Parallel to serial MATLAB Function PIlot Insertion A-QASK Mapping Block interleave Interleave1 Block interleave Interleave
Out1 In1

IQ Modulator Subsystem
Out1 In1

IQ Demodulator Subsystem IFFT IFFT FFT

FFT 0 Errors
In1 In2 Out1

Error counter Subsystem


In1 Out1

Delay Subsystem Convolutional Encoder Convolutional Encoder simout1 Constellation To Workspace Re(u) Complex to Real Im(u) Complex to Imag Complex Rayl Fading Complex Rayleigh fading Commutator2 C1 1 C1 B-FFT Buffered FFT Frame Scope Complex Rician Noise Additive Rician Noise MATLAB Function Add cyclic Extension AWGN AWGN channel A-QASK A-map QASK demod baseband

OFDM for Wireless Networks December, 2000 Page 37 - 40

Using MatLab 5.3 and Simulink the OFDM transceiver was simulated. The model can be seen in figure 20 The simulation includes all the stages for transceiver and receiver, according to the standard. Because of the Matlab sampling time, the transmission was implemented in baseband to avoid long periods of simulation. Considering Gaussian and Rician noise and Rayleigh fading effect, a good approximation to the real performance can be observed, over all in the degradation of the BER. Also the channel effect can be observed in the output constellation before the Viterbi decoder using some routines in MatLab. In figure 21, the OFDM signal spectrum can be seen corresponding to a random binary input sequence. According to the standard, the 0 (DC) input must be set to zero in the IFFT implementation. This can be clearly observed at frequency zero on the graph.
-8 -6 -4 -2 0 2 4 6 8 -10 -5 0 5 10 15 20 25 30 35

Frequency (MHz) M a g n it u d e , d B

Figure 21. OFDM Signal Spectrum


OFDM for Wireless Networks December, 2000 Page 38 - 40

Using the 64QAM modulation scheme, the signal is observed after the QAM demodulator with the distortion due to the noise. The effect of the channel over the constellation can be seen in the figure 22.
Figure 22. Received constellation for a given amout of noise

Conclusion
The IEEE802.11 Standard Committee has developed the standard for wireless networks with a 5GHz PHY layer and OFDM modulation. But, the industry has not offered any interface yet. Many companies are still researching and developing, over all for the receiver, which is the key part of the system. The standard does not give rules about it and its implementation is up to the designer. According to the simulations, OFDM appears to be a good modulation technique for high performance wireless telecommunications. Some factors were not tested here like peak power clipping, start time error and the effect of frequency stability errors. The codification used for the system could be improved, just using block codes instead of convolutional codes, but this fact would be out of the standard. Also, the pilot signal distribution could be modified to reduce the redundancy.
OFDM for Wireless Networks December, 2000 Page 39 - 40

The use of channel estimation is a very interesting function to be added to the receiver to make the system more resistant to fading and Doppler effects, over all, if it is going to be used aboard of cars in a highway. Wireless LAN is a very important application for OFDM and the development of the standard promises to have not only a big market but also application in many different environments.

Acknowledgements
I would like to thank my supervisor Dr Ronald Iltis for the suggestions, ideas and advise. I would also like to thank Mr Ramez Gerges and Prof Stephen Pope for their support, and talks to keep me on track. Finally, I would like to thank my family for the patience during the writing of this project.

References

[1] Chang R. W., Synthesis of Band Limited Orthogonal Signals for Multichannel Data Transmission, Bell Syst. Tech. J., Vol. 45, pp. 1775-1796, Dec. 1996. [2] Salzberg, B. R., Performance of an efficient parallel data transmission system,

IEEE Trans. Comm., Vol. COM- 15, pp. 805 - 813, Dec. 1967. [3] Mosier, R. R., and Clabaugh, R.G., A Bandwidth Efficient Binary Transmission System, IEEE Trans., Vol. 76, pp. 723 - 728, Jan. 1958. [4] Orthogonal Frequency Division Multiplexing, U.S. Patent No. 3, 488,4555, filed November 14, 1966, issued Jan. 6, 1970. [5] Wireless LAN Medium Access Control (MAC) and Physical Layer (PHY) Specification, IEEE Standard, Supplement to Standard 802 Part 11: Wireless LAN, New York, NY, 1999. [6] F.J. MacWilliams and N.J.A. Sloane, The Theory of Error Correcting Codes, New York: North-Holland, 1977.
OFDM for Wireless Networks December, 2000 Page 40 - 40

[7] Firmanto, W. T. and Gulliver.T. A. , Code Combining of ReedMuller Codes in An Indoor Wireless Environment, Wireless Personal Communications 6: 359 371, 1997, Kluwer Academic Publishers, Netherlands. [8] Haccoun D. and Begin G., High-rate punctured convolutional codes for Viterbi and sequential decoding, IEEE Trans. Commun., Vol. 37, pp. 11131125, 1989. [9] Beach M., Propagation and System Aspects, University of Bristol, Future Communication Systems course, April 1994. [10] Tipler P., Physics for Scientists and Engineers, 3rd Edition, Worth Publishers, pp. 464-468, 1991. [11] Rappaport, T.S., Wireless Communications Principles and Practice, IEEE Press, New York, Prentice Hall, pp. 169-177, 1996. [12] Van Nee, R., Prasad R., OFDM for wireless Multimedia Communications, Artech House, Boston, pp 80-81, 2000. [13] Schmidl, T. M., and Cox, D.C., Robust Frequency and timing Synchronization on OFDM, IEEE Trans. on Comm., Vol. 45, No. 12, pp. 1613-1621, Dec. 1997.

[14] Classen, F, Meyr, H, Sehier, P, Maximum likelihood open loop carrier synchronizer for digital radio, Proceeding ICC'93, pages 493-497, 1993 [15] Classen, F, Meyr, H, Frequency synchronization algorithms for OFDM systems suitable for communications over frequency selective fading channels, Proceedings of IEEE Vehicular Technology Conference (VTC), IEEE, 1994. pp.1655-9. [16] Proakis, J. G., Digital Communications, Prentice Hall, 3rd edition, 1995. [17] Cavers, J. K., An Analysis of Pilot Symbol Assisted Modulation for Rayleigh Fading Channels, IEEE Trans. Vehic. Tech, vol 40, no 4, pp 686-693, nov 1991. [18] Tufvesson, F, Maseng, T, Pilot assisted channel estimation for OFDM in mobile cellular systems, IEEE 47th Vehicular Technology Conference Technology in Motion, Phoenix, AZ, USA, 4-7 May 1997, vol.3 pp.1639-43,1997.

Peak Power Reduction in Orthogonal Frequency Division Multiplexing Transmitters


BY

Gavin Hill Thesis submitted in fulfilment of the requirements for the degree of Doctor of Philosophy
Victoria University of Technology School of Communications & Informatics
March 2011

Abstract

Orthogonal Frequency Division Multiplexing (OFDM) is a digital transmission method developed to meet the increasing demand for higher data rates in communications which can be used in both wired and wireless environments. This thesis describes the issue of the Peak to Average Power Ratio (PAPR) in OFDM which is a major drawback, and presents new and variations to existing algorithms to reduce it. Initially the theoretical principles behind OFDM are discussed elaborating on the advantages and disadvantages of OFDM. This is followed by analysis of the PAPR in OFDM where it is shown through theoretical analysis and simulation that the occurrence of large peaks in OFDM is actually quite rare. The effect on system performance in terms of the Bit Error Rate (BER) and Power Spectral Density (PSD) is simulated for an OFDM transceiver with a saturated High Power Amplifier. This is followed by a study of published PAPR reduction methods The first contribution is a low complexity variation of Partial Transmit Sequences (PTS). In PTS several alternate transmit signals are seeded from the same source, each alternate transmit signal has a reversible and different phase rotation performed on the data. The transmit signal with the lowest PAPR is chosen for transmission. In novel variations, called Cyclic Shifted Sequences (CSS) and Time Inversion (TI), different shifts of the data are performed which avoid the need for complex multiplications. In certain cases a whole IFFT operation can be removed with a negligible effect on performance when CSS is combined with PTS. Furthermore it is shown that the peak regrowth of TI and CSS after pulse shaping filtering is considerably less than for PTS. Next, new clipping techniques are presented which reduce substantially the complexity of clipping algorithms by using novel methods to calculate the magnitude, avoiding the use of multiplications. One method, called Sector clipping uses a rule base to clip the signal, dividing the clipping region into a series of sectors. When the rule base is expanded to include more sectors the performance is shown to approach ii more complex existing clipping methods. This algorithm is implemented in silicon in a 3 metal layer 0.5 process. Another clipping scheme called Vector Subtraction is a variation of another low complexity magnitude estimate method which further reduces complexity by alleviating the need for a scaling operation. The performance of the new methods was ascertained through simulation of a whole OFDM transceiver chain and shown to have relative BERs. Finally Vector Subtraction was implemented in a previously proposed clip and filter algorithm where its low latency and accuracy proved it to be suitable for the algorithm. iii

Acknowledgements
First of all I wish to thank Professor Mike Faulkner for his guidance, patience, advice, and time thorough my many years of research. I owe the completion of this thesis to his efforts. I also wish to thank Dr Reza Berangi for his mathematical and simulation knowledge and his willingness to share it. Special thanks goes to the post graduate studies committee for the numerous second chances to complete my research and to Shirley

Herewyn for her omnipotent knowledge on dealing with university bureaucracy. I would also like to acknowledge my research colleagues for the discussions on everything from communications theory to ancient history. In particular I wish to thank Abdi Waheed Mohammed, Tuan Nguyen, Andrew Mancuso, Trung Nguyen, Olivia Hu (who never really saw the significance of being Dr Who, even after I brought in pictures of Tom Baker and the Tardis) and Melvyn Pereira for their company and generous natures. Finally I wish to thank my parents and my brother for their support, both financially and emotionally, and my best friends Dan, Spencer, and Mark for the good times and social outlet. Gavin Hill 7 February 2006 v

Contents
Abstract .......................................................................................................................... i Acknowledgements ..................................................................................................... iii Contents ........................................................................................................................ v List of Tables .............................................................................................................. vii List of Figures ............................................................................................................ viii Acronyms and Symbols ............................................................................................ xvi Chapter 1 Introduction ................................................................................................ 1 1.1 Structure of thesis .............................................................................................. 2 1.1.1 Contributions ................................................................................................. 4 Chapter 2 Orthogonal Frequency Division Multiplexing ........................................ 6 2.1 History of multicarrier networks .................................................................... 6 2.2 Multicarrier principle ........................................................................................ 8 2.3 OFDM implementation of multicarrier modulation ...................................... 8 2.3.1 Use of Fourier Transform for modulation and demodulation ..................... 11 2.3.2 Orthogonality in OFDM ............................................................................. 13 2.4 OFDM transmission over time varying channels ......................................... 14 2.4.1 Multipath propagation ................................................................................. 15 2.4.3 Frequency selective fading ......................................................................... 17 2.4.4 Equalization ................................................................................................ 18 2.5 Limitations in OFDM ...................................................................................... 18 2.5.1 Synchronization .......................................................................................... 19
2.5.1.1 Timing errors ................................................................................................................ 19 2.5.1.2 Carrier phase noise ....................................................................................................... 20 2.5.1.3 Frequency errors .......................................................................................................... 20

2.5.2 Non linearities ............................................................................................. 22 2.6 Applications of OFDM .................................................................................... 22 2.6.1 COFDM ...................................................................................................... 22 2.6.2 Digital Audio Broadcasting ........................................................................ 23 2.6.3 Digital Video Broadcasting ......................................................................... 24 2.6.4 HiperLan2/802.11a ..................................................................................... 24 2.6.5 ADSL .......................................................................................................... 25

2.6.6 MIMO OFDM ............................................................................................. 26 2.7 Conclusion ........................................................................................................ 26 Chapter 3 Peak to Average Power in OFDM .......................................................... 28 3.1 Peak to Average Power Ratio ......................................................................... 29 3.2 Statistical distribution of OFDM samples ..................................................... 31 3.3 Oversampling discrete OFDM symbols to find true (continuous) peaks.... 35 3.4 Effect of Non Linearity on OFDM ................................................................. 45 3.4.1 Description of memoryless Non Linearity .................................................. 45 3.4.2 Impact on Power Spectral Density .............................................................. 50 3.4.3 Impact on Bit Error Rate ............................................................................. 51 3.5 Conclusion ........................................................................................................ 53 Chapter 4 Peak to Average Power Solutions -Distortionless Techniques ............ 55 4.1 Coding techniques ............................................................................................ 56 4.1.1 Block Codes ................................................................................................ 57 4.1.2 Bounds on PAPR ........................................................................................ 61 vi 4.1.3 Cyclic Codes ............................................................................................... 61 4.1.4 Shapiro-Rudin codes ................................................................................... 62 4.1.5 Golay complementary codes ....................................................................... 63 4.1.6 Reed-Muller Codes ..................................................................................... 63 4.2 Multiple Signal Representation ...................................................................... 66 4.2.1 Partial Transmit Signals .............................................................................. 66 4.2.2 Oversampling PTS ...................................................................................... 72 4.2.3 Selective Mapping ...................................................................................... 77 4.3 Tone Reservation/Injection ............................................................................. 83 4.3.1 Tone Reservation ........................................................................................ 83 4.3.2 Tone Injection ............................................................................................. 87 4.4 Conclusion ........................................................................................................ 89 Chapter 5 Cyclic Shifted Sequences and Time Inversion of PTS .......................... 92 5.1 PTS subblock creation ..................................................................................... 92 5.2 New techniques for PTS subblock creation ................................................... 93 5.2.1 Cyclic Shifted Sequences ............................................................................ 93 5.2.2 PTS with CSS ............................................................................................. 96 5.2.3 Time Inversion ............................................................................................ 98 5.3 Filtering new techniques .................................................................................. 99 5.4 Oversampling new techniques ...................................................................... 104 5.5 Complexity evaluation ................................................................................... 108 5.6 Conclusion ...................................................................................................... 109 Chapter 6 Peak to Average Power Solutions -Distorted Techniques .................. 111 6.1 Clipping in the Baseband .............................................................................. 112 6.1.1 Quantisation and Clipping ........................................................................ 121 6.2 Amplifier non linearities ................................................................................ 124 6.3 Windowing ...................................................................................................... 128 6.4 Conclusion ...................................................................................................... 130 Chapter 7 Reduced Complexity Clipping Algorithms ......................................... 132 7.1 Conventional clipping .................................................................................... 134

7.2 New Sector Clipping method ........................................................................ 143 7.2.1 Theoretical Analysis of Clipping Techniques ........................................... 143
7.2.1.1 SNR Analysis ............................................................................................................... 144 7.2.1.2 Conventional clipping ................................................................................................. 146 7.2.1.3 Sector Clipping ........................................................................................................... 149 7.2.1.4 Square Clipping .......................................................................................................... 158 7.2.1.5 Theoretical Results ..................................................................................................... 158

7.2.2 Extensions of Sector Clipping .................................................................. 160 7.2.3 Hardware Implementation ........................................................................ 162 7.3 New Vector Subtraction clipping method .................................................... 167 7.3.1 Lucent Algorithm ...................................................................................... 167 7.3.2 Vector Subtraction .................................................................................... 168 7.4 Comparison of new and existing clipping methods .................................... 173 7.4.1 BERF ......................................................................................................... 173 7.4.2 PSD results ................................................................................................ 184 7.5 New adaptive clipping method ...................................................................... 187 7.6 Conclusion ...................................................................................................... 196 Chapter 8 Conclusion .............................................................................................. 198 8.1 Future Work ................................................................................................... 200 Bibliography ............................................................................................................. 201 vii

List of Tables
2.1: DAB parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 2.2: DVB system parameters for 2K mode . . . . . . . . . . . . . . . . . . . . . 24 2.3: HIPERLAN2 parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 2.4: Data rates for HIPERLAN2 . . . . . . . . . . . . . . . . . . . . . . . . . . .25 4.1: Number of complex multiplications and magnitude operations required . . . 76 5.1: Complexity comparison of various techniques (adjacent partitioning) . . . . 109 7.1: Truth Table for 3 Sector clipping . . . . . . . . . . . . . . . . . . . . . . . .155 7.2: Synopsys reports for Square and 3 sector clipping. . . . . . . . . . . . . . 164 7.3: Cadence area utilization report on Square clipping . . . . . . . . . . . . . .165 7.4: Cadence area utilization report on 3 Sector clipping . . . . . . . . . . . . 166 7.5: Baseband clip level required to maintain a BER=10-4 at varying IBO in HPA. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183 viii

List of Figures
2.1: Block diagram of a basic multicarrier system . . . . . . . . . . . . . . . . . . . 9 2.2: Basic OFDM transmitter and receiver. Occupied frequency band shown in between. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.3: Basic OFDM transmitter and receiver pair utilizing Fourier transform. . . . . . 12 2.4: Frequency spectrum of 5 orthogonal subcarriers of an OFDM transmit signal . 14 2.5: OFDM symbol a) without cyclic prefix, and b) with cyclic prefix. . . . . . . . 16 2.6: Time and frequency properties of single carrier and OFDM techniques. . . . . 19 2.7: Effects of frequency offset F: reduction of signal amplitude (star), and ICI (circle). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

3.1: Simulated envelope for OFDM system (N=64) normalized by average power. 33 3.2: Simulated (solid line) and theoretical (3.13, dashed line) OFDM symbol CCDF for N=32, 64, 128, and 256 subcarriers. QPSK, 30000 runs. . . . . . . . . . 33 3.3: Simulated OFDM sample CCDF for N=32, 64, 128, and 256 subcarriers. QPSK, 30000 runs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 3.4: Simulated OFDM CCDF for M=4, 16, and 64 constellation mapping. N=64, 30000 runs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36 3.5: Simulated OFDM symbol with no oversampling (dashed) with its oversampled version (solid) overlaid on top. The solid circle represents the 6dB level with respect to the average power. N=64, oversampling factors are 1 and 8. . . . 36 3.6: Simulated OFDM CCDF for oversampling rates of 1, 2, 4, and 8. N=64, QPSK, 15000 runs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 3.7: Theoretical OFDM CCDF from (3.15) and (3.21) for N=64, 512 with simulated results: QPSK, oversampling factor rate of 16, 20000 runs, N=64, 512. . . . 40 3.8: Theoretical OFDM CCDF from (3.22) for N=64, 512 with simulated results: QPSK, oversampling factor rate of 16, 20000 runs, N=64, 512. . . . . . . . . 41 3.9: Simulated OFDM CCDF QPSK, 64 point IFFT, 256 tap RRCF, _=0.15. Clipped at 3dB after IFFT, then filtered (solid line). No clipping, then filtered (dashed line). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 3.10: Zero padding of the IFFT, null carriers are set in the middle of the input. . . 44 3.11: Simulated OFDM CCDF after IFFT (os=1 and 2) and after filtering (os=1 and 2). QPSK, N=64, 256 tap RRCF, _=0.15, 15000 runs. . . . . . . . . . 44 ix 3.12: AM/AM properties of a Soft Limiter (SL). . . . . . . . . . . . . . . . . . . .47 3.13: AM/AM properties of a Traveling Wave Tube Amplifier (TWTA). . . . . . .48 3.14: AM/AM properties of a Solid State Amplifier (SSPA) for different values of P, and ideal amplifier. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 3.15: PSD of 64 subcarrier OFDM signal with 64 point IFFT, RRCF with excess bandwidth of 0.15. A SSPA with P=3 and various backoffs is used. . . . . 51 3.16: Signal constellation at the output of the HPA, P=3 after non linear amplification a) 4 QAM, 6dB IBO; b) 4 QAM, 0dB IBO; c) 16 QAM 6dB IBO; and d) 16 QAM 0dB IBO. . . . . . . . . . . . . . . . . . . . . . . . . 52 3.17: BER of 64 subcarrier OFDM signal with 64 point IFFT, RRCF with excess bandwidth of 0.15 for 4, 16, and 64 QAM constellations. A SSPA with P=3 and various backoffs is used. . . . . . . . . . . . . . . . . . . . . . . . 53 4.1: Block diagram of OFDM transmitter showing PAPR coding. . . . . . . . . . 56 4.2: Block diagram of PAPR reduction using the PTS approach. . . . . . . . . . . 68 4.3: An example of the 3 main PTS structures: (a) Interleaved (b) Adjacent (c) Pseudo-random. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70 4.4: Simulated CCDF for PTS-OFDM with V=2 and varying W. N=64, adjacent subblock partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .71 4.5: Simulated CCDF for PTS-OFDM with W=4 and varying V. N=64, adjacent subblock partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .71 4.6: Generation of subblocks for PTS using Concatenated Pseudo Random Sub Block Partition Scheme (CPR SPS). . . . . . . . . . . . . . . . . . . . . . . 73 4.7: Block diagram of OLS-PTS transmitter. . . . . . . . . . . . . . . . . . . . . 76

4.8: Block diagram of an SLM OFDM transmitter. . . . . . . . . . . . . . . . . . 78 4.9: Simulated CCDF for SLM-OFDM for varying values of U. N=64, os=1. . . .79 4.10: Simulated CCDF for SLM-OFDM. U=1, 3, and 8. Oversampling rates are 1, 2, and 4, and 8. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 4.11: SLM transmitter block diagram employing technique to avoid explicit transmission of side information, a) serial form, b) parallel form. . . . . . . 81 4.12: Scrambler polynomial for new SLM technique. . . . . . . . . . . . . . . . . 81 4.13: a) Receiver structure of the proposed SLM system, b) Descrambler polynomial at the receiver. . . . . . . . . . . . . . . . . . . . . . . . . . . 82 4.14: Block diagram of a Tone Reservation (TR) OFDM transceiver. . . . . . . . .84 4.15: Block diagram of a Tone Injection (TI) OFDM transceiver. . . . . . . . . . 87 x 4.16: Example of possible expansions of constellation in TI for 16 QAM. . . . . .88 5.1: Block diagram showing the simulation model of Section 5.2. . . . . . . . . .93 5.2: Block diagram of PTS, CSS and TI transceiver. . . . . . . . . . . . . . . . .95 5.3: Simulated CCDF for PTS (V=2, W=8; V=4, W=4; V=4, W=8), CSS (V=2, S=8; V=4, S=4; V=4, S=8) and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT, (os=1). . . . . . . . . . . . . . . 96 5.4: Simulated CCDF for PTS (V=2, W=8; V=3, W=4), PTS/CSS (V=2, W=4, S=2; V=2, W=4, S=4), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT, (os=1). . . . . . . . . . . . . . . . . . . . . .98 5.5: Simulated CCDF for PTS (V=2, W=8), PTS/TI (V=2, W=4, S=2) and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in the IFFT, (os=1). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .99 5.6: Block diagram showing the simulation model of Section 5.2. . . . . . . . . 100 5.7: Simulated CCDF of discrete and filtered PTS (V=4, W=4), CSS (V=4, S=4), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT (os=1), interpolated by 8, filtered with RCF (_=0.15). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 5.8: Simulated CCDF of discrete and filtered PTS (V=4, W=8), CSS (V=4, S=8), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT (os=1), interpolated by 8, filtered with RCF (_=0.15). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 5.9: Simulated CCDF of discrete and filtered PTS (V=2, W=8), PTS/CSS (V=2, W=4, S=2), and PTS/TI (V=2, W=4, S=2), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT (os=1), interpolated by 8, filtered with RCF (_=0.15). . . . . . . . . . . . . . . . . 103 5.10: Simulated CCDF of discrete and filtered PTS (V=3, W=4), PTS/CSS (V=2, W=4, S=4), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in the IFFT (os=1), interpolated by 8, filtered with RCF (_=0.15). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 5.11: Simulated CCDF for PTS/CSS (V=2, W=4, S=4), and Uncoded OFDM. Discrete oversampled curves (dashed) move from left to right and oversampled (solid) filtered curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, 4, and 8, interpolated by 8, and filtered with RCF (_=0.15). . . . . . . . . . .105

xi 5.12: Simulated CCDF for PTS (V=3, W=4), and Uncoded OFDM. Discrete oversampled (dashed) curves move from left to right and oversampled (solid) filtered curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, 4, and 8, interpolated by 8, and filtered with RCF (_=0.15). . . . . . . . . . . . . . 106 5.13: Simulated CCDF for PTS (V=2, W=8), and Uncoded OFDM. Discrete oversampled curves (dashed) move from left to right and filtered (solid) oversampled curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, and 4, interpolated by 8, and filtered with RCF (_=0.15). . . . . . . . . . . . . . . . . . . . . . . . 107 5.14: Simulated CCDF for PTS/CSS (V=2, W=4, S=2), and Uncoded OFDM. Discrete oversampled curves (dashed) move from left to right and filtered (solid) oversampled curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, and 4, interpolated by 8, and filtered with RCF (_=0.15). . . . . . . . . . . . . 107 5.15: Simulated CCDF for PTS/TI (V=2, W=4, S=2), and Uncoded OFDM. Discrete oversampled curves (dashed) move from left to right and filtered (solid) oversampled curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, and 4, interpolated by 8, and filtered with RCF (_=0.15). . . . . . . . . . . . . 108 6.1: Analytical symbol error probability from (6.10) and (6.12) for 64 QAM and N=64. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 6.2: Probability of clipping DMT signal as a function of for p=1,2,3. N=64 subcarriers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123 7.1: Average noise in the channel vs. BER for 4, 16, and 64 QAM . . . . . . . 133 7.2: IQ diagram showing conventional clipping region. . . . . . . . . . . . . . 134 7.3: Block diagram of simulation model used for clipping models. . . . . . . . . 135 7.4: Baseband clip level vs the BERF for varying RRCF parameters. 64 QAM symbols, 64 point IFFT, LPA, no channel impairments. AWGN=0 . 136 7.5a: Demapped constellation, M=64, with no clipping or channel impairments. 64 filter taps in RRCF, _=0.15. . . . . . . . . . . . . . . . . 137 7.5b: Demapped constellation, M=64, with no clipping or channel impairments. 128 filter taps in RRCF, _=0.15. . . . . . . . . . . . . . . . .137 7.6: Baseband clip level vs the BERF with varying IBO in HPA. 64 QAM xii symbols, 64 point IFFT (os=1), RRCF with 128 taps and _=0.15. . . . . . . 139 7.7: Baseband clip level vs. the BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. . . . . . 139 7.8: CCDF clipped in baseband at 5dB, HPA backoff set to 8dB for 64 and 128 IFFT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140 7.9: Baseband clip level vs the BERF with varying M-ary constellations. 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. AWGN=0. . . . . 141 7.10: Baseband clip level vs the BERF with varying p in the SSPA. 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. HPA backoff set equal to baseband clipping level. . . . . . . . . . . . . . . . . . . . . . . . . . . 142

7.11: I Q diagram showing different sector clipping regions and the direction of data reduction for 3 Sector Clipping. . . . . . . . . . . . . . . . . . . . 144 7.12: Input output relationship of clipping operation. . . . . . . . . . . . . . . . 144 7.13: I Q diagram of 1st quadrant of a 3 Sector clipping system showing the vector of an unclipped and clipped sample. . . . . . . . . . . . . . . . 150 7.14: Block diagram of 3 Sector clipping. . . . . . . . . . . . . . . . . . . . . . 156 7.15: Flowchart for the LUT in Figure 7.14 (3 Sector clipping) . . . . . . . . . . 157 7.16: Theoretical Clip level vs. SNR for Conventional (Standard), 3 Sector, and Square clipping. Theoretical (dashed), simulated (solid) . . . . . . . . 159 7.17: Clipping angle, , vs. the SNR for 3 Sector clipping, based on (7.41). . . . .160 7.18: I Q diagram showing the 1st quadrant Sector clipping regions of a) 4 Sector clipping and b) 5 Sector clipping. . . . . . . . . . . . . . . . . . . 161 7.19: Simulated clip level vs. SNR for Conventional, Square, 3, 4, and 5 Sector Clipping. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162 7.20: Design flow for silicon implementation of 3 Sector clipping. . . . . . . . . 163 7.21: Block diagram of 3 Sector clipping implemented in VHDL. . . . . . . . . 163 7.22: Application Specific Integrated Circuit (ASIC) view of 3 Sector clipping algorithm implemented in Silicon using Cadence Silicon Ensemble. 0.5 process, 3 metal layers. . . . . . . . . . . . . . . . . . . 166 7.23: Block diagram of new Vector Subtraction scaling operation. . . . . . . . .169 7.24: IQ plane for Vector Subtraction showing vector of sample being clipped. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .170 7.25: Simulated CCDF for Lucent patent [101] for various iterations clipped xiii at 5dB showing the leakage of under clipped samples. . . . . . . . . . . . .171 7.26: Simulated CCDF for Vector Subtraction for various iterations clipped at 5dB showing the leakage of under clipped samples. . . . . . . . . . . . 171 7.27: Simulated clip level vs. SNR for Lucent clipping technique with varying iterations, as well as Conventional and Square clipping. . . . . . . 172 7.28: Simulated clip level vs. SNR for Vector Subtraction clipping with varying iterations, as well as Conventional and Square clipping. . . . . . . 173 7.29: Simulated 3, 4, and 5 Sector, Conventional and Square clipping vs. BERF with a LPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and _=0.15, AWGN=0 . . . . . . . . . . . . . . . . . . . . 174 7.30: Simulated Vector Subtraction (1, 2, 3, and 4 iterations), Conventional and Square clipping vs. BERF with a LPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and _=0.15, AWGN=0 . . . . . . 175 7.31: Simulated Lucent [101] clipping (1, 2, 3, and 4 iterations), Conventional and Square clipping vs. BERF with a LPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and _=0.15. AWGN=0 . . . . . . 176 7.32: Simulated 3 Sector clipping, vs. BERF with varying IBO in HPA, 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. AWGN=0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177 7.33: Simulated 4 Sector clipping, vs. BERF with varying IBO in HPA 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. AWGN=0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177

7.34: Simulated 5 Sector clipping, vs. BERF with varying IBO in HPA, 16 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. AWGN=0. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178 7.35: Simulated Vector Subtraction (1 iteration), vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. AWGN=0. . . . . . . . . . . . . . . . . . . . .179 7.36: Simulated Vector Subtraction (4 iterations), vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. AWGN=0 . . . . . . . . . . . . . . . . . . . . 179 7.37: Simulated Lucent clipping (1 iteration), vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and _=0.15. AWGN=0. . . . . . . . . . . . . . . . . . . . . . . . . . . . 181 xiv 7.38: Simulated Lucent clipping (4 iterations), vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and _=0.15. AWGN=0. . . . . . . . . . . . . . . . . . . . 181 7.39: Simulated Square clipping vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and _=0.15. AWGN=0. . . . . . . . . . . . . . . . . . . . . . . . . . . . 182 7.40: PSD 0dB clipping in baseband with increasing amplifier backoffs (HPA=CL, CL+1,CL+2,CL+3,CL+4, LPA). RRCF, alpha=0.15, 128 taps. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185 7.41: PSD 5dB clipping in baseband with increasing amplifier backoffs (HPA=CL, CL+1,CL+2,CL+3,CL+4, LPA). RRCF, alpha=0.15, 128 taps. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185 7.42: PSD after 0dB clipping in baseband and receiver filtering, with increasing amplifier backoffs (HPA=CL, CL+1,CL+2,CL+3, CL+4, LPA). RRCF, alpha=0.15, 128 taps. . . . . . . . . . . . . . . . . . 186 7.43: PSD after 5dB clipping in baseband and receiver filtering, with increasing amplifier backoffs (HPA=CL, CL+1,CL+2,CL+3,CL+4, LPA). RRCF, alpha=0.15, 128 taps. . . . . . . . . . . . . . . . . . . . . .186 7.44: Block diagram of the Level Detection Algorithm (LDA). . . . . . . . . . .189 7.45: Detailed block diagram of the LDA. . . . . . . . . . . . . . . . . . . . . 189 7.46: a) Amplitude of zero padded input to filter b) amplitude of filtered output c) LDA correction vectors. . . . . . . . . . . . . . . . . . . . . . . 190 7.47: Vector representation of filtering with 3 active taps and the required correction vector to bring the output back to CL. . . . . . . . . . . . . . . 191 7.48: Block diagram of the simulation model used to evaluate LDA. . . . . . . .192 7.49: In band distortion for Conventional and Vector Subtraction (2 iter) with 64 and 128 taps in compensation filter. ACL=5dB. . . . . . . . . . . . . 193 7.50: In band distortion for Conventional and Vector Subtraction (2 iter) with 64 and 128 taps in compensation filter. ACL=8dB. . . . . . . . . . . . . 193 7.51: Clipping error vs. in-band distortion. Performance curves for LDA using Conventional clipping and Vector Subtraction with 2 iterations. Compensation filter has 64 and 128 taps. ACL=5dB. . . . 195 7.52: Clipping error vs. in-band distortion. Performance curves for LDA

using Conventional clipping and Vector Subtraction with 2 xv iterations. Compensation filter has 64 and 128 taps. ACL=8dB. . . . 195 xvi

Acronyms and Symbols


ADC Analog to Digital Converter ACI Adjacent Channel Interference ADSL Asymmetric Digital Subscriber Line AGC Automatic Gain Control AM/AM Amplitude Modulation to Amplitude Modulation AM/PM Amplitude Modulation to Phase Modulation AWGN Additive White Gaussian Noise BER Bit Error Rate BPSK Binary Phase Shift Keying BSLM Blind Selected Mapping BW Bandwidth CCDF Complementary Cumulative Density Function CCOFDM Combined Coded OFDM CDF Cumulative Density Function CDMA Code Division Multiple Access CF Crest Factor CL Clip Level COFDM Coded Orthogonal Division Multiplexing CP Cyclic Prefix CPR-SPS Concatenated Pseudo Random Subblock Partition Scheme CSS Cyclic Shifted Sequences DAB Digital Audio Broadcasting DAC Digital to Analog Conversion DAR Decision Aided Reconstruction DFT Discrete Fourier Transform DMT Discrete MultiTone DQSK Differential Quadrature Phase Shift Keying DRL Data Rate Loss DSP Digital Signal Processing DVB Digital Video Broadcasting EDGE Enhanced Data rates for Global Evolution xvii FEC Forward Error Correction FDM Frequency Division Multiplexing FFT Fast Fourier Transform FPGA Field Programmable Gate Array GSM Global System Mobile HD-DIVINE High Definition-Digital Video Narrowband Emission HIPERLAN2 HiPERformance Local Area Network version 2 HF High Frequency

HPA High Power Amplifier IBO Input BackOff ICI Inter Channel Interference IDFT Inverse Discrete Fourier Transform IEEE Institute of Electrical and Electronic Engineering IFFT Inverse Fast Fourier Transform ISI Inter Symbol Interference LAN Local Area Network LDA Level Detection Algorithm LP Linear program LPA Linear Power Amplifier LUT Look Up Table MIMO Multiple Input Multiple Output MLD Maximum Likelihood Detection MMSE Minimum Mean Squared Error MSE Mean Squared Error MSR Multiple Signal Representation MQAM M-ary Quadrature Amplitude Modulation OBO Output BackOff OBR Out of Band Radiation OLS Optimal Limited Search PAP Peak to Average Power PAPR Peak to Average Power Ratio PDF Power Density Function PEP Peak Envelope Power PICR Peak-Intercarrier-to Carrier Interference xviii PMEPR Peak to Mean Envelope Power Ratio PRT Peak Reduction Tones PSD Power Spectral Density PSK Phase Shift Keying P/S Parallel to Serial PTS Partial Transmit Sequences QAM Quadrature Amplitude Modulation QCQP Quadratically Constrained Quadratic Program QPSK Quadrature Phase Shift Keying RF Radio Frequency RMS Root Mean Square RRCF Root Raised Cosine Filter RS Reed-Solomon SBC Sub Block Coding SC Single Carrier SER Symbol Error Rate SES Suboptimal Exhaustive Search SL Soft Limiter SLM Selected Mapping

SNR Signal to Noise Ratio S/P Serial to Parallel SSPA Solid State Power Amplifier STERNE System de Television En Radiodiffusion NumeriquE SVD Singular Value Decomposition TCM Trellis Coded Modulation TI Time Inversion TR Tone Reservation TWTA Travelling Wave Tube Amplifier VHDL Visual Hardware Design Language VLSI Very Large Scale Integration WCDMA Wide Band Code Division Multiplexing WLAN Wireless Local Area Network N Number of subcarriers n nth subcarrier in OFDM symbol xix

( ) m x t mth continuous OFDM symbol


x nth discrete sample of mth OFDM symbol W Total occupied frequency bandwidth of OFDM symbol f Frequency separation between subcarriers T Total OFDM symbol duration Ts Duration of one sample in OFDM symbol m,k X kth mapped transmit sample of mth OFDM symbol m,k Y kth mapped received sample of mth OFDM symbol Ng Length of Cyclic Prefix TE Number of taps in equalization algorithm P Mean envelope power Pav Average Power of an OFDM symbol Peak to Average Power Ratio of OFDM CF Crest Factor of OFDM mPB x mth passband OFDM symbol
m,n

()

P Probability density function of an OFDM symbol F [ ] Transfer properties of HPA min d Minimum code distance d(Ci, Cj) Code distance Rc Code rate RM(2, m) 2nd order Reed-Muller codes m,k p Phase rotation of PTS subblock W Number of phase rotations in PTS V Number of Subblocks os Oversampling rate in IFFT U Number of alternate SLM transmit symbols

v Cyclic shift in time domain for TI Rxx Autocorrelation of the input Rxy Cross correlation Ryy Autocorrelation of the output Roll off factor of pulse shaping filter xx R Clip level SNRconv SNR of conventional clipping SNR3sec SNR of 3 Sector clipping d data stream Chapter 1: Introduction 1

Chapter 1 Introduction
With the advance of communications technology comes the demand for higher data rate services such as multimedia, voice, and data over both wired and wireless links. New modulation schemes are required to transfer the large amounts of data which existing 3rd generation schemes such as Global System Mobile (GSM), its enhanced version Enhanced Data Rates for Global Evolution (EDGE), and Wideband Code Division Multiple Access (WCDMA) cannot support. These new modulation schemes must be able to act over point to point links and in broadcast mode, support bi-directional communications, and be able to adapt to different requirements of individual services in terms of their data rate, allowable Bit Error Rate (BER), and maximum delay. One new modulation scheme which has received significant attention over the last few years is a form of multicarrier modulation called Orthogonal Frequency Division Multiplexing (OFDM). OFDM has been used for Digital Audio Broadcasting (DAB) and Digital Video Broadcasting (DVB) in Europe, and for Asymmetric Digital Subscriber Line (ADSL) high data rate wired links. OFDM has also been standardized as the physical layer for the wireless networking standard HIPERLAN2 in Europe and as the IEEE 802.11a, g standard in the US, promising raw data rates of between 6 and 54Mbps. OFDM has various properties that make it desirable over existing single carrier systems, the main advantage is OFDMs immunity to frequency selective fading. "We are all interested in the future, for that is where you and I are going to spend the rest of our lives. And remember my friend, future events such as these will affect you in the future." ---Criswell, Intro to "Plan 9 From Outer Space" (1958) Chapter 1: Introduction 2 Single carrier systems can increase their data rate by shortening the symbol time,

thereby increasing the occupied bandwidth. Wideband channels are sensitive to frequency selective fading which require complex equalizers in the receiver to recover the original signal. OFDM overcomes this problem by dividing the wideband channel into a series of narrowband channels which each experience flat fading. Therefore only 1 tap equalizers are required in the receiver, reducing complexity greatly. Other factors such as advances in silicon and Digital Signal Processing (DSP) allow the use of efficient Fourier transforms in the transmitter and receiver to perform the modulation, demodulation respectively. Due to the orthogonality of the subcarriers the transmission bandwidth is used efficiently as the subcarriers are allowed to overlap each other and still be decoded at the receiver. Despite the many advantages of OFDM it still suffers from some limitations such as sensitivity to carrier frequency offset and a large Peak to Average Power Ratio (PAPR). The large PAPR is due to the superposition of N independent equally spaced subcarriers at the output of the Inverse Fast Fourier Transform (IFFT) in the transmitter. A large PAPR is a problem as it requires increased complexity in the wordlength at the output of the IFFT and the Digital to Analog Converter (DAC). Perhaps the most serious problem is the reduced efficiency of the High Power Amplifier (HPA) which must cater for these low probability large peaks. If the high PAPR is allowed to saturate the HPA out of band radiation is produced affecting adjacent channels and degrading the BER at the receiver. As portable devices have a finite battery life it is important to find ways of reducing the PAPR allowing for a smaller more efficient HPA, which in turn will mean a longer lasting battery life.

1.1 Structure of thesis


This thesis analyses the principles of OFDM concentrating on the PAPR problem in OFDM. The thesis is structured as follows: Chapter 1: Introduction 3 Chapter 2 provides an initial overview of OFDM starting with a brief history of multicarrier networks and their evolution towards OFDM. The multicarrier principle is explained mathematically encompassing the use of the Fourier transform and the principle of orthogonality. OFDM in time varying channels is discussed, its advantages in terms of multipath propagation, the use of a cyclic prefix, frequency selective fading, and equalization. Problems with OFDM are also discussed such as synchronization, which includes timing errors, carrier phase noise, and frequency errors. The issue of the PAPR is also briefly presented. The chapter concludes with a discussion on the applications of OFDM in society. Chapter 3 explores the issue of the PAPR in more detail starting with a mathematical definition of the PAPR. Theoretical Cumulative Complementary Distribution Function (CCDF) results are compared to simulated CCDF results identifying the processes which influence large peaks such as the number of subcarriers and oversampling. Non linearities are treated with a description of various models for the HPA, and finally the effect of saturation of the HPA is analyzed in terms of the PSD and the BER. Chapter 4 begins the literature review for PAPR reduction techniques

reviewing distortionless techniques. Distortionless techniques do not corrupt the data and encode it in such a way that it can be completely recovered at the receiver. Initially the family of coding techniques such as block codes, cyclic codes, Shapiro-Rudin Sequences, Golay complementary sequences, and Reed Muller codes are presented. Multiple representation techniques such as SeLective Mapping (SLM), Partial Transmit Sequences (PTS) are reviewed with and without oversampling. Finally modified constellation techniques Tone Reservation (TR) and Tone Insertion (TI) are examined. Chapter 5 introduces several new alterations to PTS called Cyclic Shifted Sequences (CSS) and Time Inversion (TI). PTS produces alternative transmit signals by dividing the bit source into a V sub-blocks which each have an IFFT performed on them. Sub-blocks are then rotated by a set phase rotation Chapter 1: Introduction 4 (which must be sent as side information to the receiver) and combined to produce a possible transmit symbol, after a number of set phase rotations the transmit symbol with the lowest PAPR is chosen for transmission. CSS and TI reduce complexity and improve performance of PTS by using time shifts of the data instead of phase rotations which can be combined with standard PTS to reduce complexity and in some cases allow for the removal of a whole IFFT operation without degrading performance. It is further shown that CSS and TI perform better after oversampling and filtering than PTS. This work is published in [1, 2]. Chapter 6 continues the literature review for distorted PAPR reduction techniques which do not attempt to create a transmit signal with a low crest factor, instead they take the output of the IFFT and then limit the amplitude of large samples which invariably causes distortion degrading the BER. Methods reviewed are pulse shaping (or windowing), and clipping at every stage from the output of the IFFT to limited backoffs in the amplifier. Results are analyzed in terms of their BER and affect on the PSD. Chapter 7 introduces new low complexity clipping techniques starting with a comprehensive analysis of an OFDM transceiver with clipping at various points in the transmission chain and under other variable conditions such as the amount of oversampling in the IFFT, pulse shaping taps and roll off, and the HPA parameters. New low complexity clipping methods are introduced which avoid complex hardware operations while maintaining similar performance to conventional clipping. The new clipping algorithms called Sector clipping and Vector Subtraction are then implemented in a new clip and filter algorithm which is much less susceptible to peak regrowth after baseband filtering. This work is published in [3]. 1.1.1 Contributions The contribution that this research work has made to the wireless communications field is summarized as follows: Chapter 1: Introduction 5 A thorough analysis of the theory, principles, and techniques of OFDM based

wireless systems including a detailed analysis of PAPR reduction techniques are presented (Chapters 2, 3, 4, and 6). A new method for producing Partial Transmit Sequences (PTS) signals and their performance under oversampling conditions is proposed (Chapter 5). A detailed analysis of the effect of clipping on an OFDM transceiver under various system conditions (Chapter 7). Several new low complexity clipping algorithms are proposed (Chapter 7). Implementation and analysis of a proposed clip and filter algorithm utilizing one of the new low latency clipping algorithms (Chapter 7). Chapter 2: Orthogonal Frequency Division Multiplexing 6

Chapter 2
Orthogonal Frequency Division Multiplexing
This chapter provides an initial overview of Orthogonal Frequency Division Multiplexing (OFDM). Section 2.1 provides a brief history of multicarrier networks and their evolution towards OFDM. Section 2.2 explains the multicarrier principle and Section 2.3 explains how it is applied to OFDM, detailing the use of the Fourier transform, and the importance of orthogonality. Section 2.4 explores OFDM in time varying channels describing its advantages in terms of multipath propagation, the use of a cyclic prefix, frequency selective fading, and equalization. Limitations of OFDM, such as synchronization, which includes timing errors, carrier phase noise, and frequency errors are discussed in Section 2.5. Non linearities are also introduced as a major hindrance to a practical OFDM system in this section. Section 2.6 looks at applications of OFDM in society and discusses where this new communications technology will be used. Finally Section 2.7 summarizes the chapter with a brief recap of the chapter.

2.1 History of multicarrier networks


Multicarrier networks such as Frequency Division Multiplexing (FDM) have been around since the late 1950s [4], however due to their implementational complexity and inefficient use of the frequency band they were restricted to military applications. A multicarrier system is basically a number of information bearing carriers Chapter 2: Orthogonal Frequency Division Multiplexing 7 transmitted in parallel. Multicarrier systems in wireless applications are less susceptible to channel induced distortions than single carrier systems at corresponding data rates. Chang [5] and Saltzberg [6] further developed FDM in the mid 60s by introducing multiple carriers which overlap in the frequency domain without interfering with each other, utilizing the frequency spectrum more efficiently, hence OFDM. However the complexity issue still remained. In the 1970s Weinstein and Ebert [7] used an Inverse Discrete Fourier Transform (IDFT) and Discrete Fourier Transform (DFT) to perform the modulation and

demodulation respectively, exploiting the sinusoidal nature of the Fourier Transform and significantly reducing the complexity of an OFDM system. In the last 10 years more advances in practical OFDM systems have been made, particularly in Europe where various projects and prototypes were initiated such as DIgital VIdeo Narrowband Emission (HD-DIVINE), System de Television En Radiodiffusion NumeriquE (STERNE), and digital Terrestrial Television broadcasting (dTTb). This has led to the adoption of OFDM in many European standards. OFDM has progressed to the point where it has now been used for various communication applications such as Digital Audio Broadcasting (DAB) and Digital Video Broadcasting (DVB) in Europe. It has also been adopted as the physical layer modulation scheme for wireless networking standards such as Hiperlan2 in Europe and the Institute of Electrical and Electronic Engineers (IEEE) 802.11a, g standards in the United States. However while OFDM successfully alleviates the problem of dispersive channels there are still some problems which need to be addressed such as time and frequency synchronization, frequency selective fading, and the Peak to Average Power Ratio (PAPR). Chapter 2: Orthogonal Frequency Division Multiplexing 8

2.2 Multicarrier principle


An early form of a multicarrier system is shown in Figure 2.1. The basic principle of multicarrier modulation is to divide the data stream, d, into N parallel data streams with a reduced data rate of d/N. Each low rate data steam is then modulated on a separate narrow band subcarrier and summed together for transmission, thereby providing the same data rate as an equivalent single carrier system. At the receiver a set of filter banks separate the wideband signal into the original narrowband subcarriers for demodulation. The advantage of this structure over single carrier systems is that the extended symbol time (due to lower data rate) makes the signal less susceptible to effects of the channel such as multipath propagation which introduces Inter Symbol Interference (ISI). Each subchannel will therefore experience flat fading reducing the equalization complexity in the receiver dramatically. This issue will be explored in more depth in Section 2.4. A disadvantage of the method shown in Figure 2.1 is the implementation complexity due to the large number of filter banks required in the transmitter and receiver as well as the inefficient use of the available frequency band [8]. The spectra of the different carriers cannot overlap as this would introduce distortion degrading system performance.

2.3 OFDM implementation of multicarrier modulation


A more spectrally efficient implementation of the aforementioned multicarrier system is OFDM (Figure 2.2). In OFDM the transmit signals are constructed in such a way that the frequency spectra of the individual subchannels are allowed to overlap thereby utilising the frequency spectrum much more efficiently. Mathematically the continuous time representation of the OFDM transmit signal depicted in Figure 2.2 is

()()2
2

, 21

1 ..
N j k ft mmkk N

x t X e w t mT N

= _ 0 > t > T (2.1) Chapter 2: Orthogonal Frequency Division Multiplexing 9


Figure 2.1: Block diagram of a basic multicarrier system.

RF
Filter f0 QAM

QAM QAM
Filter f1
Filter fN-1

f0 f1 fN-1
QAM Filter QAM Filter RF QAM Filter

f0 f1 fN-1
d/N b/s d/N b/s d/N b/s d0(t) d1(t) dN-1(t)

f0 f1 f2
Transmitter Receiver
Chapter 2: Orthogonal Frequency Division Multiplexing 10
Figure 2.2: Basic OFDM transmitter and receiver. Occupied frequency band shown in between.

f0 f1 f2 f3 f4
Transmitter

Parallel to serial convertor QAM

_ _ _

RF

f0 f1 fN-1
0

X
1

X
1

Receiver
QAM

Serial to parallel converter


RF
0 1

f f f

N1

. . . . . .
x(t)
1 0

( ) Re xp( )
N nn n =

x t X e j t __ =__ __

Time-limited signal (block processing)


X X 1X N1X R b/s
n 0

Chapter 2: Orthogonal Frequency Division Multiplexing 11 where m,k X is the mapped (QAM, PSK, etc) data to be transmitted on the kth subcarrier of the mth transmitted symbol, e j2 kft is the kth subcarrier, f is the

frequency spacing between subcarriers, and ( ) k w t mT is a rectangular window applied to each subcarrier, N is the number of subcarriers, and T is the total time of the transmit symbol. To ensure the orthogonal relationship between subcarriers f is set as W1 NT = (W is the total bandwidth of the signal). In the receiver an integrate and dump operation is performed over time T to recover the data. 2.3.1 Use of Fourier Transform for modulation and demodulation In order to make multicarrier systems a more practical technology an IDFT and DFT are used for the baseband modulation and demodulation respectively, as first suggested in reference [7], where the sinusoidal nature of the Fourier transform basic functions is exploited. Advances in silicon technology have made the production of the DFT more cost efficient [9-11]. Figure 2.3 shows a block diagram of a basic OFDM system in the baseband utilising the IDFT, DFT pair. A discrete time representation of (2.1) can be obtained by sampling the continuous signal. Under the condition that W = Nf and 1 f T = the signal can be determined by its samples if sampled at T t N = . Under this condition (2.1) then becomes (2.2)

{}
,,, 0

12

1 .
N j nk N mnmkmk k

x X e IDFT X N
=

= _ = 0 n N 1 (2.2) where n are the discrete sampling points. This equation describes exactly the IDFT operation. In hardware the more efficient form of the IDFT and DFT, the Inverse Fast Fourier Transform (IFFT) and Fast Fourier Transform (FFT) is used for the modulation, demodulation respectively, where N is set to be a power of 2. Chapter 2: Orthogonal Frequency Division Multiplexing 12 Figure 2.3 shows a baseband transceiver structure for OFDM utilising the Fourier transform for modulation and demodulation. Here the serial data stream is mapped to complex data symbols (PSK, QAM, etc) with a symbol rate of 1 s T . The data is then demultiplexed by a serial to parallel converter resulting in a block of N complex symbols, X0 to XN-1. The parallel samples are then passed through an N point IFFT (in this case no oversampling is assumed) with a rectangular window of length N.Ts, resulting in complex samples x0 to xN-1. Assuming the incoming complex data is random it follows that the IFFT is a set of N independent random complex sinusoids summed together. The samples, x0 to N 1 x are then converted back into a serial data stream producing a baseband OFDM transmit symbol of length T=N.Ts.
Figure 2.3: Basic OFDM transmitter and receiver pair utilizing Fourier transform.

A Cyclic Prefix (CP), which is a copy of the last part of the samples is appended to the front of the serial data stream before Radio Frequency (RF) up conversion and Serial to Parallel Parallel to Serial IFFT
Channel + noise

Data source Data sink Parallel to Serial Serial to Parallel FFT Constellation mapping

Constellation demapping CP CP Binary data y0 yN-1 y1 X0 XN-1 X1 x0 xN-1 x1 Y0 YN-1 Y1 Complex data Chapter 2: Orthogonal Frequency Division Multiplexing 13 transmission. The CP combats the disrupting effects of the channel which introduce Inter Symbol Interference (ISI) and is discussed in more detail in section 2.4.2. In the receiver the whole process is reversed to recover the transmitted data, the CP is removed prior to the FFT which reverses the effect of the IFFT. The complex symbols at the output of the FFT, Y0 .. YN-1 are then decoded and the original bit steam recovered. Mathematically the demodulation process (assuming no CP and no channel impairments) using the FFT is (2.3)

{}
() ()

[]
,, 12 , 0 112 , 00 112 , 00 1 , 0 ,

1 1 1 1
mkmn N j nk N

mn n NNjndk N md nd NNjndk N md dn N md d mk

Y FFT x xe N Xe N Xe N XNdk N X

= == == =

= = = = = =

_ __ __ _
(2.3) 2.3.2 Orthogonality in OFDM One of the key advantages of OFDM is its efficient use of the frequency band as the subcarriers are allowed to overlap each other in the frequency domain. The N equally spaced subcarriers will be orthogonal if the frequency separation between subcarriers is 1 1
s

f = N T = T , where N.Ts is symbol duration, and rectangular windowing of the IFFT is performed. Under these conditions the subcarriers will have a sinc waveform frequency response. Figure 2.4 shows the frequency response of a 5 carrier system where it is seen that because of the orthogonal relationship the maximum of a particular sample corresponds to a null in all other carriers, therefore eliminating the effects of interference. Smoother window functions (eg. Raised Cosine Filter) reduce Chapter 2: Orthogonal Frequency Division Multiplexing 14 the out of band emissions and Inter Carrier Interference (ICI) susceptibility to system imperfections (e.g. frequency offset) but they increase the symbol period. Mathematically, orthogonality of two signals, k (t ) and ( is described in reference [12] and expressed here as (2.4)

) l t over time period N.Ts

()()0
0, ,
NTs kl

kl t t dt Ckl _ =_ _=

(2.4) where C is a constant.


Figure 2.4: Frequency spectrum of 5 orthogonal subcarriers of an OFDM transmit signal

2.4 OFDM transmission over time varying channels


OFDM is being primarily deployed in the wireless environment. This section describes properties of the wireless channel and describes the advantages and disadvantages of OFDM in this environment.
-6 -4 -2 0 2 4 6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Carrier Spectrum with no carrier offset

1/NTs
Chapter 2: Orthogonal Frequency Division Multiplexing 15 2.4.1 Multipath propagation The wireless channel is a harsh one, electromagnetic signals travelling through this medium are fraught with disruptive and warping effects. The transmitted signal does not only have a direct path to the receiver (in the case of line of sight). The signal is reflected of buildings and mountains and other obstacles so that multiple delayed copies of the same transmitted signal arrive at the receiver affecting other symbols. This causes ISI which degrades the Bit Error Rate (BER). The longer the delay of the

paths the greater the ISI, a measure of the delay is given by the root-mean-square (rms) delay spread which is a measure of the delay experienced by a single pulse. It is this effect which restricts single carrier systems from achieving high data rates. The data rate in a single carrier system can be increased by shortening the symbol time of the transmitted pulses, but they will be even more affected by the rms delay spread and require more complex equalisation in the receiver. As the rms delay spread is a result of the physical channel it cannot be changed and systems must be designed to accommodate it. This phenomenon has prompted the use of multicarrier techniques where the transmitted bandwidth is divided into many narrow band channels which are then transmitted in parallel. Each subcarrier is modulated at a sufficiently low data rate so that it is not affected by the delay spread. 2.4.2 Use of a Cyclic Prefix In order to protect successive OFDM symbols from multipath a CP of length Ng is used which is a copy of the last part of the samples of a OFDM transmit block appended to the front before transmission as depicted in Figure 2.5. The transmitted signal is therefore N+Ng samples. Provided that the length of the CP is chosen so that it is longer than the longest expected delay path successive OFDM symbols will be free of ISI [13]. At the receiver a window of N samples is chosen from the N+Ng length block for maximum power, the rest of the repeated samples are discarded. After cyclic shifting to get the samples back into the original order a FFT is performed to demodulate the data. Chapter 2: Orthogonal Frequency Division Multiplexing 16 Obviously the use of a CP decreases the data rate by a factor of
g

N N+N (2.5) as the repeated samples are discarded in the receiver so it is important to keep the length of the CP as short as possible with respect to the rms delay spread. A loss in the SNR of the received signal is also incurred due to the lost energy in the CP. However there are techniques which use the CP for both frequency offset estimation and symbol synchronization [14]. Also when filtering the signal there is a delay before the filter is at full power, by using a CP the delay will occur in CP so that N of the samples will be at full power. The CP (with repeated samples) retains the cyclic nature of the symbol by creating a periodic received signal for processing, eliminating ICI.
Figure 2.5: OFDM symbol a) without cyclic prefix, and b) with cyclic prefix.

0 N-1 time a) Original N point OFDM symbol Original N samples time Ng b) OFDM symbol with Cyclic Prefix (CP) Chapter 2: Orthogonal Frequency Division Multiplexing

17 More recent work has shown that it is possible to use null values for the CP provided additional processing is done at the receiver. The received signal in the null sample positions is wrapped around and added to the samples at the start of the symbol to restore orthogonality and eliminate ICI. This is one of the proposals for the new IEEE 802.15.3a ultra wideband standard. Other forms of the CP have also been investigated, in particular reference [15] examines the effect that using null values in the CP will have. It was concluded that null samples have a detrimental effect through loss of orthogonality increasing ICI. 2.4.3 Frequency selective fading Multipath propagation as discussed in the previous section can be combated successfully through the use of a cyclic prefix. Frequency selective fading is the reciprocal effect of multipath propagation in the time domain and can be defined thus. If the channel has a constant magnitude and phase response over a bandwidth that is smaller than the bandwidth of the transmitted signal the channel creates frequency selective fading [16]. Under this condition the signal experiences multipath introducing ISI, this effect shows itself in the frequency domain where certain frequency components in the received spectrum have greater or less power than the transmitted spectrum. Narrow pulses in time (such as high data rate single carrier transmission) occupy a wide frequency bandwidth, conversely pulses with a long duration (such as OFDM) occupy a relatively narrow frequency band. Figure 2.6 compares a single and multicarrier signal in the time and corresponding frequency domain with equivalent data rates. Here we see that the many narrowband channels of the OFDM signal experience fading, however each subchannel has a constant gain within its own frequency band. Constant fading over the occupied bandwidth is known as frequency flat fading and is a much easier effect to correct in the receiver than frequency selective fading. Chapter 2: Orthogonal Frequency Division Multiplexing 18 2.4.4 Equalization In order for the receiver to correct the effect of fading equalization is performed in the receiver which is the process of measuring the channel response and using this information to correct the received signal. Several methods have been suggested, one of the more popular methods utilize pilot tones [17-19] which are certain (usually evenly spaced) subcarriers with a known amplitude and phase at the receiver. By measuring the difference between the received and the transmitted value a picture of the channel can be extrapolated. Other methods use blind estimation techniques [20, 21] which do not require pilot tones. The equalization technique depends on the modulation scheme and the channel properties. Equalization algorithms are usually implemented as tapped delay lines. The relation between the number of taps required, TE and the occupied bandwidth, W, of the signal is [22].

( )2
TE = BW (2.6) Signals which experience frequency selective fading such as single carrier systems require complex equalization (i.e. more taps) where the complexity is directly

proportional the bandwidth of the signal. Signals such as OFDM which experience frequency flat fading only require a 1 tap equalizer. This reduction in equalization complexity is a driving force for the use of OFDM.

2.5 Limitations in OFDM


Previous sections have detailed the advantages of OFDM, however the advantages are offset by some problems that are unique to OFDM, namely time and frequency synchronization problems and non linearities. Chapter 2: Orthogonal Frequency Division Multiplexing 19
Figure 2.6: Time and frequency properties of single carrier and OFDM techniques.

2.5.1 Synchronization Both time and frequency synchronization are a major drawback in OFDM, the following sections detail the problem and provide a basic introduction into solutions for these problems. 2.5.1.1 Timing errors Timing synchronization is the process of finding the start of a symbol in the receiver. If the timing mismatch is within the CP the demodulation produces a linear phase rotation at the output of the FFT which can be corrected with a channel estimator.

( ) [ ] ()1
2 0
c

N j k ft f k f t b k

x t a k e
__ + + __ =

=_ (2.7) freq freq time time time time OFDM time and frequency properties Single carrier time and frequency properties Frequency fading envelope Frequency fading envelope NTs Ts N 1

2
1 s NT 1 sT

Chapter 2: Orthogonal Frequency Division Multiplexing 20 where [ ] [ ] j2 ( fc n f ) t ta k a k e

= + (2.8) is the phase shift. If the timing mismatch is not corrected additional interference (ISI) is generated. Therefore a sufficient length of the CP needs to be chosen. An alternative approach is to use pilot based methods [13] which uses certain carriers with a known amplitude and phase at the receiver. By analyzing the phase rotation and amplitude change, an estimate of the channel can be made. 2.5.1.2 Carrier phase noise Carrier phase noise is caused by a mismatch in the RF oscillators in the transmitter and receiver and manifests itself in the baseband as additional phase rotation and amplitude attenuation [13]. No distinction can be made between phase rotations introduced by timing errors and carrier phase offset [14]. The effect of phase noise is more pronounced in differential detection schemes than coherent detection schemes [13]. Several references [23, 24] have analyzed the effect of carrier phase noise on the performance of OFDM schemes. 2.5.1.3 Frequency errors Frequency offset errors are caused by mismatch between the RF oscillators, Doppler shifts, and phase noise introduced by non linear channels [14]. Frequency offset causes the received signal to not be sampled at the peak, this means that the sample under consideration is not at maximum power. Power from adjacent subcarriers is also sampled as well. A simplistic representation of this effect in the frequency domain is visualized in Figure 2.7. Chapter 2: Orthogonal Frequency Division Multiplexing 21
4 3 2 1 0 1 2 3 4 0.4 0.2 0 0.2 0.4 0.6 0.8 1 frequency (normalised)

Amplitude

Figure 2.7: Effects of frequency offset _F: reduction of signal amplitude (star), and ICI (circle).

OFDM is more sensitive to frequency offset than single carrier systems due to the tight orthogonal packing of the subcarriers. Reference [14] concludes that to avoid severe degradation the frequency accuracy should be better than 2%. Suggested solutions to frequency synchronization (like symbol synchronization) are based on pilot symbols and the cyclic prefix are used in reference [14] where it is

noted that time and frequency synchronization are closely related. Frequency sensitivity can be made more robust by reducing the number of subcarriers within a set bandwidth thereby increasing the frequency distance between subcarriers. However this shortens the symbol time which increases the demands on timing synchronization, therefore a trade off must be made.

F
Chapter 2: Orthogonal Frequency Division Multiplexing 22 2.5.2 Non linearities Another limiting aspect of multicarrier and OFDM modulation is the high instantaneous signal peak with respect to the signals average power. Large peaks are due to the superposition of N random phase sine waves in the IFFT. Hardware components such as the Digital-to-Analog Converter (DAC), IFFT/FFT with limited word length and most importantly the High Power Amplifier (HPA) will be driven into saturation unless they are designed to operate over large dynamic ranges. If the signal is allowed to go into saturation both in band noise which degrades the BER and out of band radiation introducing ICI will result. Therefore many papers (refer to Chapters 4, 5, 6, and 7) have been published on ways to overcome the PAPR and can be divided into two methods: distortionless techniques which attempt to create a transmit signal with a low PAPR without affecting BER of the data, and distorted techniques which deliberately reduce peaks but increase distortion and therefore the BER. This problem is the area of research of this thesis and will be treated with more detail in the following chapters.

2.6 Applications of OFDM


The previous section detailed some of the problems with OFDM, it should be noted that depending on the application and medium different design issues take precedence. This section identifies some of the current and future applications of OFDM. OFDM takes its place in the next generation of communication systems because of its high data rates and low complexity. 2.6.1 COFDM Coded OFDM (COFDM) is a practical form of OFDM where redundant bits are inserted into the bit stream at the transmitter. These specially chosen bits allow powerful error correction codes in the receiver to reduce the BER. The more bits used for error correction the better the error correction properties, however the useful data rate is decreased. Chapter 2: Orthogonal Frequency Division Multiplexing 23 Types of error correction codes used for example DAB-OFDM are Trellis Coded Modulation (TCM) combined with frequency and time interleaving. In practice all the following technologies use some form of COFDM. 2.6.2 Digital Audio Broadcasting Digital Audio Broadcasting (DAB) using OFDM has been standardized in Europe [25] and is the next step in evolution beyond FM radio broadcasting providing interference free transmssion. The standard for DAB is known as Eureka-147 [26] and is a multi-service digital broadcasting method transmitting at around 1.5Mbps in the 1.536MHz band. In DAB between 192 and 1536 carriers are used with

Differential Quadrature Phase Shift Keying (DQPSK), which allows the system to avoid channel estimation techniques. The very long symbol time means that large echos can be tolerated and that the redundancy due to the CP is not that great. Large echoes are expected as the broadcasting is over large distances so that long delay paths will be present. The PAPR is a problem but as DAB only uses DQPSK modulation it is more impervious to noise generated through saturation of the amplifier. The DAB data payload contains audio, data associated with audio, and other optional data services. Table 2.1 displays system parameters for DAB.
Table 2.1: DAB parameters.

Parameters Mode I II III Application SFN Terrestial Satellite Modulation DQPSK DQPSK DQPSK Total number of subcarriers 1536 384 192 OFDM symbol duration 1246S 312S 156S Guard interval 246S 62S 31S Frequency range _375MHz _1.5GHz _3GHz Chapter 2: Orthogonal Frequency Division Multiplexing 24 2.6.3 Digital Video Broadcasting Digital Video Broadcasting (DVB) [27, 28] is also using OFDM as the carrier modulation scheme. DVB promises to deliver full multimedia in digital form in a broadcast format. DVB adapts the baseband TV signal from the output of the MPEG2 [29] transport multiplexer to the terrestrial channel characteristics. Maximum spectral efficiency within the VHF and UHF bands is achieved by utilizing Single Frequency Network (SFN) operation. There are two modes defined in DVB: 1/ 2K mode, and 2/ 8K mode. The 2K mode is used for single transmitters and small SFNs where the distance for transmission is limited. The 8K mode encompasses the 2K mode as well as larger SFNs. One of the many advantages of OFDM is that different mapping types can be used on different subcarriers, this aspect is taken advantage of in DVB so that the data rate on a channel mirrors its quality. Table 2.2 shows the system parameters for DVB in 2K mode [28].
Table 2.2: DVB system parameters for 2K mode.

Parameters Value Information data rate 5-30 Mbps Modulation QPSK, 16 QAM, 64 QAM FEC code Reed Solomon outer code Convolutional inner code Code Rates , 2/3, Total number of subcarriers 1705 (2K mode) OFDM symbol duration 303S Guard interval 75.9S Signal bandwidth 5.62MHz 2.6.4 HiperLan2/802.11a Wireless networking standards such as HIPERLAN2 and 802.11a use OFDM as the physical layer modulation scheme and operate in the unlicensed 5GHz frequency band. Hiperlan2 promises to deliver raw data rates of up to 56Mbps which puts them

in the ballpark of wired LANs which have data rates of up to 100Mbps. Wireless Chapter 2: Orthogonal Frequency Division Multiplexing 25 LANs applications are for home and office networking over short distances (<50 metres) as well as community spaces such as Starbucks which operates the 802.11b wireless standard free of charge for customers and provides data rates up to 11Mbps [52]. Tables 2.3 and 2.4 list Hiperlan2/802.11a specifications and data rates respectively.
Table 2.3: HIPERLAN2 parameters

Parameter Value Sampling Rate fs=1/T 20MHz Useful symbol part duration TU 64 T 3.2S Cyclic Prefix duration TCP 16 T 0.8S (mandatory) 8T 0.4S Symbol Interval TS 80 T 4.0S (TU+TCP) 72 T 3.6S (TU+TCP) Number of data sub-carriers NSD 48 Number of pilot sub-carriers NSP 4 Total number of sub-carriers NST 52 (NSD+NSP) Subcarrier spacing _f 0.3125MHz (1/TU) Spacing between two outmost sub-carriers NST 16.25MHz (NST _f)
Table 2.4: Data rates for HIPERLAN2

Modulation Coding rate R Nominal bit rate (Mbps) BPSK 1/2 6 BPSK 3/4 9 QPSK 1/2 12 QPSK 3/4 18 16QAM 9/16 27 16QAM 3/4 36 64QAM 3/4 54 (optional) 2.6.5 ADSL Asynchronous Digital Subscriber Lines (ADSL) utilize OFDM over wired links [30]. Data rates for ADSL standard [14] are 1.54Mbps to 6.1Mbps in the downlink and 9.6 to 192Kbps in the uplink over several kilometers of ordinary twisted pair telephone line, while still supporting the standard telephone. The unbalanced data rates make Chapter 2: Orthogonal Frequency Division Multiplexing 26 ADSL particularly applicable to internet type applications where the downlink rate is typically much larger than the uplink rate. Stationary channels like wireless links do not change over time, therefore a technique called bit loading is used. Bit loading assigns a mapping type to sub-carriers depending on its quality, using the available bandwidth efficiently. Bit loading used

in conjunction with OFDM over wired links is usually called Discrete MultiTone (DMT). 2.6.6 MIMO OFDM Multiple In Multiple Out (MIMO) [31] OFDM combines OFDM with multiple antennas at the transmitter and receiver. This structure allows greater diversity when techniques such as Singular Value Decomposition (SVD) are used. This process, called spatial multiplexing, proportionally boosts the data-transmission speed by a factor equal to the number of transmitting antennas. In addition, since all data is transmitted both in the same frequency band and with separate spatial signatures, this technique utilizes spectrum very efficiently.

2.7 Conclusion
This chapter introduced fundamental properties of OFDM, identifying its advantages and discussing its limitations. Specifically the history of muticarrier networks was discussed and its evolution towards OFDM. The introduction of the Fourier transform, advances in silicon technology, and the efficient use of the frequency spectrum with orthogonally spaced subcarriers were shown to make OFDM a practical technology for the next generation of digital communications. OFDM transmission over wireless channels was discussed focusing on multipath propagation and the advantages of OFDM in this medium. The use of the cyclic prefix in OFDM was shown to reduce the equalizer complexity dramatically down to one tap per subcarrier. This is one of the great advantages of OFDM over single Chapter 2: Orthogonal Frequency Division Multiplexing 27 carrier networks which require prohibitively high complexity equalizer structures as the carrier frequency is increased to cater for higher data rates. Frequency selective fading was also introduced as a major advantage of OFDM where due to the long effective symbol time OFDM subcarriers experience flat fading. Limitations of OFDM were analyzed next, with the two main disadvantages; synchronization errors and non linearities treated. Synchronization errors were shown to include timing errors, carrier phase noise, and frequency errors. Non linearities due to the Rayleigh distributed samples at the output of the IFFT were also briefly introduced and shown to have a degrading affect on the quality of OFDM systems. Finally applications of OFDM were presented detailing where this new technology has manifested itself in society. Areas of application in the wireless field were shown to be COFDM, DAB, DVB where OFDM is used in a broadcast mode, Hiperlan2/802.11a for wireless networking potentially taking the place of large wired networks. OFDM in DMT form is being used in wired networks for ADSL. Extensions of OFDM such as MIMO OFDM were also shown to expand the reach of OFDM systems. Chapter 3: Peak to Average Power in OFDM 28

Chapter 3 Peak to Average Power in OFDM

Chapter 2 discussed fundamental principles of OFDM and showed how it is a practical technology for the next generation of high data rate communication systems. However several design issues need to be addressed, one of the most important being the Peak to Average Power Ratio (PAPR) of the highly fluctuating transmit signal envelope. Due to the nature of the IFFT which, as described in Section 2.3, sums N sinusoids through superposition, some combinations of the sinusoids create large peaks. The drawback of a large dynamic range is that it places pressure on the design of components such as the word length of the IFFT/FFT pair, DAC and ADC, mixer stages, and most importantly the HPA which must be designed to handle irregularly occurring large peaks. Failure to design components with a sufficiently large linear range results in saturation of the HPA. Saturation creates both in band distortion, increasing the BER and out of band distortion, or spectral splatter, which causes ACI. One obvious solution is to design the components to operate within large linear regions, however this is impractical as the components will be operating inefficiently and the cost becomes prohibitively high. This is especially apparent in the HPA where much of the cost and ~50% of the size of a transmitter lies. This chapter provides a mathematical definition of the PAPR and identifies the processes which contribute to large peaks. Specifically section 3.1 gives a Chapter 3: Peak to Average Power in OFDM 29 mathematical definition of the PAPR, section 3.2 provides a statistical analysis of PAPR and identifies contributing factors to large peaks. Section 3.3 introduces oversampling of OFDM signals while section 3.4 investigates the effect of non linearities on OFDM, finally section 3.5 summarizes the chapter reiterating the main points of this section. Note that the terms subcarrier, tones, and N will be used interchangeably to signify the number of subcarriers in an OFDM symbol.

3.1 Peak to Average Power Ratio


The PAPR is the relation between the maximum power of a sample in a given OFDM transmit symbol divided by the average power of that OFDM symbol. The mean envelope power of the baseband expression (assuming same constellation on each subcarrier) is defined as (3.1)

()1
22 0, 0

1 T1 N
tmmk k

P x t dt X TN
= =

= _ = _ (3.1) where ( ) m x t is defined in (2.1), Xm,k are assumed to be complex Quadature Amplitude Modulated (QAM) data which are statistically independent, identically distributed (i.i.d) random variables with 0 mean and variance 2 2

E_X_ __ . The average power is defined as (3.2)


m,k

[]()2
av m

P = E P = E _ x t _ _ _ (3.2) The PAPR can then be defined

()2
0 av

max t T m

xt P = (3.3)

where ( ) 2 0 max t T m x t is the maximum instantaneous power within the period 0tT. Chapter 3: Peak to Average Power in OFDM 30 Another definition for the PAPR is crest factor, defined in reference [32] as (3.4)

()2
0 CF av

max t T m

xt P = (3.4) and results in a 3dB shift in results compared to (3.3). Throughout this thesis the definition of the PAPR given in (3.3) will used unless specified otherwise. For passband transmission the OFDM symbol is modulated onto a carrier frequency, fc ,

{() }
j2 fct mPB m

x=xte
mcmc

{ ( )}cos (2 ) { ( )}sin (2 )

= x t f t j x t f t

( )cos ( 2 ) ( )sin ( 2 ) mI c mQ c = x t j f t jx t j f t (3.5)


The carrier frequency is usually much higher than the signal bandwidth, i.e. fc>>_f, therefore the maximum of the passband signal is approximately equal to the baseband expression, i.e. max ( ) max ( ) mPB m x t x t (3.6) Most OFDM schemes usually employ QAM mapping for the modulation where

( ) ( ) 22
mI mQ

x t = x t therefore

{ ()} { ()} { ()}

222

2 2 m mI mQ E x t = E x t = E x t (3.7)

The average RF power of the passband signal can be derived as

{ ( ) } { ( ) ( ) ( ) ( ) } cos 2 sin 2 { ( ) ( ) } { ( ) ( ) } cos 2 sin 2


22 22

mPB mI c mQ c

E x t = E x t f t jx t f t

mI c mQ c

=Extft+Extft

Chapter 3: Peak to Average Power in OFDM 31

{ )} { ()} 1
2

12

2 2 mI mQ = E x t + E x t

{ ()} 1
()
2 0 2

2 m= E x t 2 av = P (3.8) Substituting (3.8) back into (3.3) gives the PAPR in the passband.

{ ()}
max t T mPB
PB mPB

xt Ext =

()2
0

max 2
t T mPB av

xt P = (3.9) The problem with OFDM is that theoretically the PAPR can be up to log2(N), which is huge. But as will be shown in the next section the general distribution of samples is much lower.

3.2 Statistical distribution of OFDM samples


Section 3.1 provided a worst case scenario or upper bound of the PAPR, but only a few combinations of input data sequences produce large peaks, therefore it is more pertinent to define the statistical distribution of the PAPR in OFDM. (2.2) describes a

Nyquist sampled baseband OFDM symbol with N subcarriers, from the central limit theorem [33] the sum of these elements are zero mean complex random near Gaussian (provided N>64) distributed variables with variance, _2, of . It then follows that the amplitude, an, of the OFDM symbol has a Rayleigh distribution [13] with a Probability Density Function (PDF) of

()
2

22 n2

pe

= (3.10) Chapter 3: Peak to Average Power in OFDM 32 Substituting _2=

()
n

pe = (3.11) Figure 3.1 shows the simulated envelope (N=64) of a baseband OFDM system which as expected follows a Rayleigh distribution. It is seen here that the probability of any given sample having a magnitude above 3dB decays rapidly. The probability that the magnitude of a sample is below a certain threshold, 0 , is given by the Cumulative Distribution Function (CDF)

{ } ( ) 0 Pr
n

= _ 02
0

2e = _ 2 =1 e 0 (3.12) Under the assumption of statistically independent samples the Complementary Cumulative Distribution Function (CCDF) can be found for the case where at least one sample in an OFDM symbol exceeds the magnitude, 0 .
00 00

Pr max 1 Pr max
nNnN <<

_____>_=__

____

( { })

1 Pr N =

()
110
N

= e (3.13) Figure 3.2 displays simulated and theoretical results of (3.13) with varying N. The simulation model passes N QPSK symbols through a N point IFFT, the maximum sample of each OFDM symbol is stored and plotted. As in Figure 3.1 the probability that large peaks occur is very irregular, doubling the number of subcarriers results in a modest increase in the PAPR leading to the assumption that using a large number of subcarriers makes sense as this will allow for greater data throughput (provided the CP length is constant). However a larger number of subcarriers leads to increased sensitivity to carrier and sampling frequency offsets as described in section 2.5.1.2 and 2.5.1.3. Therefore, balance must be met between these design constraints. Comparing the theoretical and simulated results we see that the results only converge Chapter 3: Peak to Average Power in OFDM 33 at N=128, the theoretical results for N<128 are slightly more pessimistic at higher PAPR levels.
Figure 3.1: Simulated envelope for OFDM system (N=64) normalized by average power.

.
2 3 4 5 6 7 8 9 10 11 12 10-4 10-3 10-2 10-1 100

0
0

(dB) Pr( > ) N=32 N=64 N=128 N=256

Figure 3.2: Simulated (solid line) and theoretical (3.13, dashed line) OFDM symbol CCDF for N=32, 64, 128, and 256 subcarriers. QPSK, 30000 runs

0 0.5 1 1.5 2 2.5 3 3.5 4 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 Amplitude, volts Probability Chapter 3: Peak to Average Power in OFDM

34 To provide further insight into the total distribution of the PAPR Figure 3.3 plots all simulated OFDM samples for the same case as Figure 3.2. Here it is seen that the sample distribution is lower than the symbol distribution as is predicted by (3.13).
-4 -2 0 2 4 6 8 10 12 10-4 10-3 10-2 10-1 100

(dB) Pr(>0) N=256 N=128 N=64 N=32

Figure 3.3: Simulated OFDM sample CCDF for N=32, 64, 128, and 256 subcarriers. QPSK, 30000 runs.

The effect of various QAM mapping constellations is simulated in Figure 3.4 where it is seen that changing the constellation has a minimum affect on the PAPR which is to be expected considering the M-ary constellations are normalized to have the same average power. This same principle applies to non active subcarriers which also do not influence the PAPR as the average power decreases in line with a reduction in the number of active subcarriers. However null subcarriers have the advantageous side effect of increasing the resolution of the OFDM symbol due to the oversampling effect. Chapter 3: Peak to Average Power in OFDM 35
2 3 4 5 6 7 8 9 10 11 12 10-4 10-3 10-2 10-1 100 Pr(>0)

0
(dB) M=4 M=16 M=64

Figure 3.4: Simulated OFDM CCDF for M=4, 16, and 64 constellation mapping. N=64, 30000 runs.

This section has shown that the PAPR per symbol is only a function of the length, N, of the IFFT. The constellation type and number of active subcarriers have a negligible affect on the PAPR after modulation with the IFFT.

3.3 Oversampling discrete OFDM symbols to find true (continuous) peaks


Section 3.2 provided an analysis of the PAPR for critically sampled baseband OFDM symbols. However this analysis does not reveal the peak of the band limited OFDM signal. Oversampling the data in the IFFT increases the resolution of the OFDM symbol giving a closer approximation to the band limited signal after filtering. This is best explained in Figure 3.5 where the complex components of one OFDM symbol with no oversampling is overlaid with the same symbol oversampled at the IFFT by a factor of 8 (to approximate the continuous filtered signal). Note the parabolic trajectory from one discrete sample to another, the growth in new peaks occurs in between the discretely sampled peaks. Chapter 3: Peak to Average Power in OFDM 36
-0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4

-0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 I channel Q channel

P
3

P
1

P
2

Figure 3.5: Simulated OFDM symbol with no oversampling (dashed) with its oversampled version (solid) overlaid on top. The solid circle represents the 6dB level with respect to the average power. N=64, oversampling factors are 1 and 8.

Some interesting observations that can be made viewing Figure 3.5 are at P1 where the only sample of the critically sampled OFDM symbol is above 6dB, when oversampled the true peak grows slightly larger still. At P2 it is seen that the two critical samples that make up its end points are well under 6dB, but after oversampling a peak is produced in between the 2 critical samples. At P3 we see that the second largest peak in the oversampled case occurs at a position where no peak existed in the critically sampled symbol. These results show how the critically sampled OFDM symbol and its oversampled version can diverge greatly in PAPR. The CCDF for various oversampling rates at the IFFT is shown in Figure 3.6, here it is seen that an oversampling factor of 8 is sufficient to represent the continuous signal and results in around 0.5dB increase in the PAPR. Chapter 3: Peak to Average Power in OFDM 37
2 3 4 5 6 7 8 9 10 11 10-3 10-2 10-1 100 os=1 os=2 os=4 os=8 0 (dB) Pr(>0)

Figure 3.6: Simulated OFDM CCDF for oversampling rates of 1, 2, 4, and 8. N=64, QPSK, 15000 runs.

Equation (3.13) assumes that samples are mutually independent and uncorrelated, however Parsevals theorem states that
1 2 0 N m m

xN
=

= (3.14) Therefore the independent assumption of (3.13) is not true, especially in the

oversampling case where adjacent samples are highly correlated to each other. Various papers [34-36] have been published which address the issue of oversampling. Reference [37] suggests that adding a number of extra independent samples to (3.13) will give a closer approximation to the oversampled signal, (3.15)

()
0

0 0

Pr max 1 1
N nN


<

_ _ _ >_ = __ (3.15) Chapter 3: Peak to Average Power in OFDM 38 where _=2.8 gives a good approximation to oversampled signals. Reference [35] takes exception to this non theoretical approximation of the over sampled signal and states that the bound is not close to the theoretical bound for large numbers of N. Reference [35] developed a method for finding the exact peak distribution of band limited Rayleigh processes giving an expression for the CCDF of the PAPR as (3.16)

( ) (0) ( ( )) (0)
00

() ()
(0)

Pr 1 Pr p p N N < = >

1 0
0 Np p p

N N __ = __ __ __ (3.16) where

( ) ()()(
22 22 0

)()

5 112422 00

455

11 15 2 2

uerfc
uu pa

N N u e e u d du

_ _ __ _ _ __ __

__
which is the mean number of peaks above the level 0 in one OFDM symbol, and

(0) 0.64 p N N (3.17)

which is the mean number of total peaks. As this method is numerically cumbersome to solve due to the double integration in (3.16) a simpler approximation of the distribution is developed which is as accurate as the previous method for a large number of N. The simpler method derives the peak distribution of the band limited Rayleigh process based on the level crossing rate approximation and then applies the result to the derivation of the distribution of the CF in OFDM signals. This is made under the assumption that 1) the complex components of the signal, x(t) are ideally band limited Gaussian processes (i.e. N>64) and 2) the peaks are statistically uncorrelated. Also a suitiably high level for a0 must be chosen well above 0 to make the assumption valid, Chapter 3: Peak to Average Power in OFDM 39 i.e. each positive crossing of the level 0 has a single positive peak that is above the level 0 . The CDF of the CF is given as (3.18)

( ) ( ) ()
000 C

Pr Np FC > = < > (3.18) where (

p0

N is the mean number of peaks above 0 , and can be approximated for

high 0 by (3.19)

()
0

00

An expression for the CDF, (

3 p N N e = (3.19)

) C 0 F is then obtained:

()(
2 0

C0C00

F F C >

20 0 2 0

3 0 0

1 0
Ne

e e

_ _ _ __ = ___ __ _ __
00 00

> (3.20) The CCDF can then be expressed as ( ) 0 1 C F (3.21) [35] suggests 0 = for QPSK modulation and a marginally lower value for 16 QAM. Figure 3.7 plots simulated CCDF of the PAPR for QPSK with N=64 and 512 with an oversampling factor of 16, and 20000 OFDM symbols against (3.15) and (3.21). Here we see that the bound from (3.15) is closer to the simulated results for N=64 as a small number of subcarriers does not have a completely Gaussian distribution. On the other hand (3.21) is an excellent bound for N=512 where the Gaussian assumption is true. for for Chapter 3: Peak to Average Power in OFDM 40
4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100 Pr(>0) Eq (3.15) Eq (3.21) Simulated N=64 N=512

0
(dB)

Figure 3.7: Theoretical OFDM CCDF from (3.15) and (3.21) for N=64, 512 with simulated results: QPSK, oversampling factor rate of 16, 20000 runs, N=64, 512.

Reference [36] finds bounds for the peak of the continuous envelope based on the maximum of the oversampled sequence. This bound is used to derive a closed form expression for the upper bound of the CCDF in an uncoded OFDM system with large N. Unlike [35] where the PMEPR is derived under the assumption that OFDM signals behave as band limited Gaussian processes, only the Gaussian assumption for each sample is used, there is no assumption on the joint distribution of the samples. The CCDF is given as

{}
2 21

2 0

Pr k PMEPR kNe

__ __ __

> < _ _ (3.22) where k is the oversampling factor, N is the number of subcarriers, and 0 is the clip value. Note that k must be > 2 . Figure 3.8 plots the simulated results (QPSK, os=16) against the theoretical results of (3.22). This method predicts a much higher distribution of the peaks than is seen in simulation and is therefore very pessimistic. Chapter 3: Peak to Average Power in OFDM 41
4 6 8 10 12 14 16 18 20 10-4 10-3 10-2 10-1 100 0 (dB) Pr(>0) (3.22) Simulated N=64 N=512

Figure 3.8: Theoretical OFDM CCDF from (3.22) for N=64, 512 with simulated results: QPSK, oversampling factor rate of 16, 20000 runs, N=64, 512.

Reference [38] extends the theory of [37], [35], and [36] to find new upper bounds for different constellation types such as QAM and PSK, rather than just QPSK. The theoretical results are useful at low probability regions where simulations are time consuming. The upper bound on the CDF for QAM constellations is given as

()
() ()
22 0 222

41 34 0 , ; 1, 2

min k L

Ma MCC LKLK + >>

F KLNe __ __ __ __ __

(3.23) 0 0 where 1 cos 2 LC L = __ __ __ L >1 (3.24) Chapter 3: Peak to Average Power in OFDM 42

() () ()
1 cos 3 cos 1 cos
k

K C K K ____

=_ __ + __ 3 3 K K > Keven Kodd (3.25) where 0 is the clip level, L is the oversampling factor, N is the number of subcarriers, M is the constellation type: M=2 (4 QAM), 4 (16 QAM), or 8 (64 QAM), =0 (4QAM), 1 (16 QAM), or 5 (64 QAM). For BPSK constellation the CDF is given by

()()
2 1222 1 12

1 212,2,2 0 0 , ; 1, 2 10

4 min K L
LN N KllCNCC LN K LKLK ll

FBe

_ ___ __ >>==

__ __ __ + _ _ __ __ __

__
0

0 (3.26) where

()() ( ) (( ) ) 1
1 sin , , cos 1 2 2 2 sin

NN CNN = + + (3.27) and

()()
0 /

1 2/2
L

N N k N C keven

N B Nk
=_ _ _ _

__ =____

(3.28) Results comparing the new CDF calculation against the Gaussian model of (3.13) show that for BPSK the new bound is tighter below 10-4 probability for large N. Interestingly the Gaussian approximation provides better results than the new bound for all N. For 16 QAM and 64 QAM the Gaussian bound again looks better. The CCDF results from Figure 3.6 imply that using an oversampling factor of 1 predicts the PAPR within 0.5dB, however this is not true when PAPR reduction Chapter 3: Peak to Average Power in OFDM 43 techniques are applied to the OFDM symbol [30],[39]. Figure 3.9 shows the CCDF of a 64 point IFFT OFDM modulator clipped at 3dB, interpolated by 8 and then filtered with a 256 tap RRCF, as well as the same OFDM data set left unclipped at the IFFT and then interpolated and filtered with the RRCF.
2 3 4 5 6 7 8 9 10 11 12 10-3 10-2 10-1 100

Pr(>0) Clip at 3dB and filter No clipping and filter

Figure 3.9: Simulated OFDM CCDF QPSK, 64 point IFFT, 256 tap RRCF, _=0.15. Clipped at 3dB after IFFT, then filtered (solid line). No clipping, then filtered (dashed line).

Peak regrowth after clipping and filtering is dramatic, much worse than without clipping and filtering. The degree of peak regrowth is determined by the sequence of the filtered data, the value of the excess bandwidth, _, of the RRCF (smaller _ greater

peak regrowth), the length of zero padding in the IFFT, and the degree of clipping (harder the clipping the greater the regrowth). Oversampling means the introduction of null samples at the input of the IFFT as shown in Figure 3.10. The null samples are introduced in the centre of the IFFT input to ensure that they occupy the outer samples in the frequency spectrum. Chapter 3: Peak to Average Power in OFDM 44
Figure 3.10: Zero padding of the IFFT, null carriers are set in the middle of the input

Figure 3.11 shows the simulated CCDF for a non oversampled IFFT compared to a 2 times oversampled IFFT, both of which are interpolated by a factor of 8 and filtered with a RRCF.
4 5 6 7 8 9 10 11 12 10-3 10-2 10-1 100 Pr(>0) after IFFT -128 pts after filtering -128 pts after filtering -64 pts after IFFT -64 pts 0 (dB)

Figure 3.11: Simulated OFDM CCDF after IFFT (os=1 and 2) and after filtering (os=1 and 2). QPSK, N=64, 256 tap RRCF, _=0.15, 15000 runs.

IFFT
X0 X1 X(N/2)-1 X(N/2) 0 0 X(N/2)+1 X(N/2)-2 XN-1 XN x0 x1

xNos X(Nos)-1
Chapter 3: Peak to Average Power in OFDM 45 This time we notice that the oversampled IFFT has a slightly higher PAPR distribution, but what is of most importance is that the PAPR distribution after filtering has less peak regrowth and is below the critically sampled IFFT. We can conclude that oversampling of the data at the IFFT is an effective way to counter the peak regrowth after filtering. Oversampling beyond a factor of 2 provides a law of diminishing returns in terms of peak regrowth (see Figure 4.10: U=1).

3.4 Effect of Non Linearity on OFDM


The previous sections described the causes of large PAPR and the problems of non linearity, this section describes common models for HPAs which are used in wireless

communications and the effect they have on the OFDM signal in terms of the Power Spectral Density (PSD) and the increase in the BER. Section 3.4.1 describes commonly used HPA models, section 3.4.2 details the corruption of the frequency spectrum due to changing backoffs in the HPA, and finally section 3.4.3 describes the degradation in BER due to changing backoffs in the HPA. 3.4.1 Description of memoryless Non Linearity Non linearities provide the greatest obstacle to OFDM as a practical system due to their distorting effect on the quality of the system. Here we concentrate on the most common form of non linearity, distortion in the RF amplifier due to a limited linear range in the amplifier. Papers [40-44] tend to focus on the distortion due to the RF amplifier stage as this is the most expensive component in a transmitter and takes up to 50% of the cost and space in a unit. The RF amplifier must be driven as close as possible to the maximum signal in the linear region to make it efficient, however when operating near the saturation point it exhibits non linear behavior distorting the transmitted signal. This distortion causes spectral regrowth in the transmitter which can adversely affect adjacent frequency bands, and an increased BER at the receiver. A balance must be met between allowable distortion and the linear region of an amplifier. Therefore it is pertinent to evaluate the performance of OFDM signals through different non linear devices. Chapter 3: Peak to Average Power in OFDM 46 A convenient form of expression for non linear devices is in polar coordinates reference [45]. The input can be expressed as (3.29) x = x earg(x) = e j (3.29) Therefore the complex envelope of the output signal can be expressed as (3.30)

( ) [ ] j( [ ]) g x F e + = (3.30)
where F [ ] and [ ] represent the AM/AM and AM/PM conversion characteristics of the memoryless non linear amplifier respectively. Some commonly used models follow. Soft Limiter (SL) The Amplitude Modulation to Amplitude Modulation (AM/AM) and Amplitude Modulation to Phase Modulation (AM/PM) characteristics of a SL can be expressed as [43]

[]
A F A __ =___ if A if A A if A

< > (3.31)

and [ ] = 0 . There is no phase distortion in this model, only amplitude distortion, A is the clipping level of the amplifier. The AM/AM characteristics of a SL are plotted in Figure 3.12. Chapter 3: Peak to Average Power in OFDM 47
0 0 Input voltage Output voltage -A -A A A

Figure 3.12: AM/AM properties of a Soft Limiter (SL)

Travelling Wave Tube Amplifier (TWTA) According to reference [40] the AM/AM and AM/PM functions are

[ ] ( 2) 1 a
F = + (3.32) and

[]()
2

12

= + (3.33) A common choice for the above parameters is 0.25 a = , 12 = , and 0.25 = . The AM/AM characteristics of a TWTA are plotted in Figure 3.13. Chapter 3: Peak to Average Power in OFDM

48
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 Input voltage Output voltage Ideal TWT

Figure 3.13: AM/AM properties of a Traveling Wave Tube Amplifier (TWTA)

Solid State Power Amplifier (SSPA) Probably the most common and practical model for amplifiers is the SSPA [40]. The AM/AM and AM/PM transfer characteristics can be modeled as

[]1
22

1
PP

F A = _____+___ __ __ (3.34) and

[ ] = 0 (3.35) The parameter P controls the smoothness of the transition from the linear region into the saturation region. When P the SSPA acts as a SL, P=3 is a good Chapter 3: Peak to Average Power in OFDM 49 approximation of a practical amplifier. The AM/AM characteristics of a SSPA and an ideal amplifier are plotted in Figure 3.14.
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 Input voltage Output voltage Ideal P=12 P=3 P=1

Figure 3.14: AM/AM properties of a Solid State Amplifier (SSPA) for different values of P, and ideal

amplifier.

In all amplifiers discussed in this section A represents the saturating amplitude of the amplifier. The non linear distortion depends on the backoff of the amplifier and can be calculated as either the Input BackOff (IBO) or the Output BackOff (OBO), and is defined as
2 10 IN

10log s

A IBO P (3.36)
2 10 OUT

10log

A OBO P (3.37) Chapter 3: Peak to Average Power in OFDM 50 where As is the amplifier input saturation voltage, PIN is the average power at the input, A is the saturating amplitude at the output, and POUT is the average power at the output. 3.4.2 Impact on Power Spectral Density The simulated spectrum for N=64 subcarriers is shown in Figure 3.15. The PSD is measured for each OFDM block then averaged over 2000 blocks to eliminate the effect of the rectangular time window [46]. Blocks of 4 QAM data were mapped to a 64 point IFFT for modulation, the data is then interpolated by a factor of 8 and filtered. A RRCF was used for pulse shaping with 128 taps and an excess bandwidth of 0.15. The low rolloff factor of the filter results in a fast drop off of the spectral splatter outside the normalised FFT bandwidth. A SSPA with P=3 is used as the amplifier model. The frequency axis is normalized. Changing the IBO of the SSPA results in inband distortion and spectral regrowth, or splatter outside the normalized frequency bandwidth of the filter. An amplifier backoff of 0dB results in noise power that is only 16dB lower than signal power. With a backoff of 3dB the noise power is 19.5dB below the signal power. A 6dB amplifier backoff results in a slight amount of spectral splatter 23dB below the signal power and an almost indistinguishable amount of in band distortion. With an infinite amplifier backoff the PSD matches exactly the PSD after transmit filtering. OFDM standards such as Hiperlan2 and 802.11a require adjacent OFDM carriers to be closely packed in the frequency domain with overlapping spectrums. This is achieved by Chapter 3: Peak to Average Power in OFDM 51
-4 -3 -2 -1 0 1 2 3 4 -50 -45 -40

-35 -30 -25 -20 -15 -10 -5 0 Hz Absolute power (dB) HPA=0dB HPA=3dB HPA=6dB 16dB 19.5dB 23dB LPA

Figure 3.15: PSD of 64 subcarrier OFDM signal with 64 point IFFT, RRCF with excess bandwidth of 0.15. A SSPA (HPA in figure) with P=3 and various backoffs is used.

setting null subcarriers at the edge of the spectrum, where the rolloff of the filters occurs in the unused subcarriers, easing the design constraints on the filters. Filtering after the SSPA to reduce spectral splattering is complex, therefore reference [46] proposed clipping in the baseband after the IFFT followed by filtering. This successfully reduces the out of band distortion but the BER due to the in band distortion remains. Also peak regrowth after filtering remains a problem. These issues are explored in more depth in chapters 5, 7. 3.4.3 Impact on Bit Error Rate The inband distortion due to non linear amplification at the transmitter results in ISI when filtered with the matched filter at the receiver, resulting in an increase in the BER when the data is decoded. Two observed effects of clipping (whether due to the amplifier non lineraity or baseband clipping) are a Gaussian like spreading of the Chapter 3: Peak to Average Power in OFDM 52 decoded constellation points and a compaction of the whole received constellation, shown in Figure 3.16 where the two constellation types (4 and 16 QAM) are normalized to have the same average power.
Figure 3.16: Signal constellation at the output of the SSPA, P=3 after non linear amplification a) 4 QAM, 6dB IBO; b) 4 QAM, 0dB IBO; c) 16 QAM 6dB IBO; and d) 16 QAM 0dB IBO.

The noise due to clipping is evenly spread across all subcarriers [47], this is because each sample that is clipped is a conglomeration of all the input samples to the IFFT. Therefore the BER is nearly equal on all subcarriers. Figure 3.17 shows the BER due to a non linearity in the HPA for 3 M-ary constellation types. 4 QAM is very impervious to clipping noise as there is a greater Euclidean distance between constellation points. Indeed 4 QAM has a very acceptable BER performance without any coding or PAPR correction. Mapping types 16 and 64 QAM are much more susceptible to clipping in the amplifier. The BER of Figure 3.17 could be improved with Automatic Gain Control (AGC) in the receiver which would expand the constellation to the correct size before decoding [48]. Chapter 3: Peak to Average Power in OFDM 53
-4 -2 0 2 4 6 8 10-6 10-5 10-4 10-3 10-2 10-1 100

HPA backoff (dB) BER 4 QAM 16 QAM 64 QAM

Figure 3.17: BER of 64 subcarrier OFDM signal with 64 point IFFT, RRCF with excess bandwidth of 0.15 for 4, 16, and 64 QAM constellations. A SSPA (HPA on x-axis) with P=3 and various backoffs is used.

3.5 Conclusion
This chapter defined the problem of PAPR in OFDM beginning with a mathematical analysis of the PAPR in both the baseband and passband. Next the stochastic distribution of OFDM samples was shown through analytical and simulated means to only be a function of the number of subcarriers. It was also shown that the general distribution of samples is quite low and that the transmit envelope follows a Rayleigh distribution. Oversampling of the IFFT was also introduced as an important issue, it was shown that oversampling the IFFT increased the CCDF by ~0.5dB. These results were supported analytically by various papers which analysed the issue of oversampling. Chapter 3: Peak to Average Power in OFDM 54 Furthermore, it was shown through simulation that filtering has a dramatic affect on the CCDF when combined with PAPR reduction techniques such as clipping. The peak regrowth is much worse than when no PAPR reduction techniques are used. Oversampling at the IFFT was shown to both increase the discrete CCDF and reduce the continuous, filtered CCDF. Non linearities were also examined with models of the most common form of non linearity, the SSPA, described mathematically. The impact of non linearity on the PSD and BER was simulated. It was shown that reducing the IBO of the SSPA causes both spectral spreading, affecting the adjacent channels, and in band distortion corrupting the BER at the receiver. Higher order constellations were shown to be more susceptible to lower backoffs in the SSPA. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 55

Chapter 4 Peak to Average Power Solutions Distortionless Techniques


The previous chapter outlined the disruptive effects of an uncontrolled OFDM signal envelope on system performance. This chapter and chapter 6 provide an analysis of solutions to the PAPR problem, each solution has advantages and disadvantages in terms of PAPR reduction, distortion of data, and complexity. This chapter reviews distortionless PAPR reduction techniques. Distortionless techniques do not corrupt the data and encode it in such a way that it can be completely recovered at the receiver, however they are usually more complex. Specifically Section 4.1 explains the family of coding techniques for PAPR reduction, Section 4.2 elaborates on Multiple Signal Representation (MSR) and phase rotating techniques, Section 4.3 details modified constellation techniques, and finally Section 4.4 summarizes the

chapter and provides a comparison in terms of complexity and performance of the detailed techniques. The other side of PAPR reduction are distortion introducing techniques, these techniques deliberately attenuate the envelope of the signal corrupting the BER as described in Section 3.4, however things can be done to lessen the effect of the introduced distortion. These techniques will be explored in Chapter 6. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 56

4.1 Coding techniques


Many early papers considered how standard coding techniques could be applied to OFDM. The basic premise of coding is to insert redundant bits into the data stream which can be used for error correction at the receiver. Their application to PAPR reduction is in creating sequences of bits which will exhibit low PAPR after the IFFT. There are 2 types of error detection and correction codes, block codes and convolutional codes. Most papers relate to the block coding family for PAPR reduction. During the encoding process k information bits are encoded into n code d bits, therefore (n-k) redundant non information bits are added to the k information bits [16]. The block code is referred to as an (n,k) code, and the rate of the code as Rc=k/n. Figure 4.1 is a block diagram showing where coding for PAPR reduction is located in an OFDM transmitter.
Figure 4.1: Block diagram of OFDM transmitter showing PAPR coding

The ability of a code to correct errors is a function of the code distance, (4.1)

()()
1

,,

, mod
N ijiljl l

dCCCCq
=

=_ (4.1) where d is the distance of the codeword and q is the number of possible values of Cj and Cl. The smallest distance dmin is the minimum distance for a given set, (4.2)

{ ( )}
Bit source PAPR coding

min

, i j d = Min d C C (4.2)

IFFT
Serial To parallel

n k n+k
Chapter 4: Peak to Average Power Solutions Distortionless Techniques

57 Different codes exhibit different degrees of error correction ability. Another important property of codes is the weight of the code, which is the number of non zero elements in the codeword. Types of block codes are Hamming, Golay, and ReedSolomon, some of which are used for PAPR reduction. 4.1.1 Block Codes The first paper to apply coding techniques for PAPR reduction in OFDM was reference [49]. The basic premise of this paper was to determine which combinations of data at the IFFT input produced large peaks at the output and to avoid transmitting these sequences by adding redundant bits to the input. Initially a simple (impractical) OFDM system with 4 subcarriers and BPSK modulation is considered. A 3 bit data word is mapped on to a 4 bit code word, ie (3,4) block code, so that the set of allowable code words does not create excessive envelope spikes. They identify the code as an odd parity code and state that the PEP is reduced from 6.02dB to 2.48dB, a reduction of 3.54dB. An 8 subcarrier system with BPSK constellation is also evaluated to illustrate how block coding can be traded of against PAPR reduction. The permissible number of codewords CWperm is traded off against the total possible number of codewords, CWposs. With no block coding the PAPR is 9.03dB. If half the code words are allowed, i.e. a (7,8) rate code, the resultant PAPR is 4.45dB. If a quarter of the code words are allowed, a (3,4) rate code the PAPR is 3.01dB. The PAPR reduction comes at the cost of an increase of bandwidth for the same data rate and a reduction of the energy per transmitted bit for the same transmit power. However the increase in bandwidth is small and is offset by the high spectral efficiency of OFDM, as is the reduction in energy per transmitted bit which is offset by the possible error detection/correction potential of block coding. No results on the minimum distance of the code are given but the authors indicate that a large number of the codes found are Golay complementary sequences. These codes are explored in greater depth in later papers. Practical OFDM systems employ at least 64 subcarriers and higher order mapping types which make the complexity of a block coding scheme Chapter 4: Peak to Average Power Solutions Distortionless Techniques 58 grow exponentially as the coding must be done in combinational logic or a Look Up Table (LUT). Aware of the limitations of the previous paper Jones and Wilkinson extended the block coding principle in reference [50]. 4 and 8 subcarrier systems are evaluated using QPSK mapping. They prove that the code rate is unaffected by changing the number of carriers and mapping types. 3 areas are identified to make coding techniques more practical Selection of suitable code words for any number of carriers and any M-ary mapping type. Selection of code words that allow efficient implementation of the coding/decoding. Selection of code words that also offer error detection/correction properties. The authors review a set of papers published in the 80s which addressed a similar problem for multitone test signals with low PAPR. Although certain parameters

differ much of this work can be applied to the OFDM case. The reverse of a code word will result in the same PAPR, i.e. for the 4 subcarrier BPSK case 1000 will give the same PAPR as 0001. Codes such as these are derived from Shapiro-Rudin sequences which are a subset of Golay complementary sequences. A definition of Golay complementary sequences is given in the paper as a pair of equally long, finite sequences whose aperiodic autocorrelations sum to zero for all non-zero displacements. Analysis of the code words that exhibited low PAPRs for 4 and 8 subcarriers revealed them to be Golay complementary sequences, however they did not give the optimum minimum solution, but they are amenable to mathematical encoding and decoding and the authors note that they may have error correction properties. For an 8 subcarrier BPSK OFDM signal based on Golay complementary sequences a (5,8) rate code can give a PAPR of 3.01dB. Larger code sets can be found by rewriting the definition of an OFDM signal (4.3) Chapter 4: Peak to Average Power Solutions Distortionless Techniques 59

( ) ( ) (2 )
1
nn

nN jft n n

s t d t e
= + =

= _ (4.3) as (4.4)

( ) ( ) () ( ) () /2
2121 1
ss

nN jnftjnft nn n

x t d t e d t e
= + =

= _ + _ _ _ _ (4.4) This form gives a term in the summation as 2 carriers that are equidistant from and on either side of the centre frequency of the complex envelope representation. If the dn data words are chosen so that the resultant vector lie on one of two orthogonal lines, then the PAPR is less than or equal to N/2. The principle of these sequences can be extended to give polyphase sequences suitable for multilevel phase modulation. The search for better sets of code words with error detection/correction properties is addressed in later papers. The authors of [49, 50] presented another paper, [51], in which they describe Combined Coded OFDM (CCOFDM). CCOFDM attempts to exploit the error detection/correction properties presented in reference [50] while still maintaining the

PAPR suppression. Polyphase weighting codes are applied to the encoded data (chosen from a low PAPR set) which are known at the receiver. Thus they can be compensated for without affecting the distance properties of the code. The BER in a non linear channel for CCOFDM is compared to the COFDM channel and significant improvement in the BER is seen in a highly non linear channel. Still, the work is limited to 8 subcarriers. Reference [52] expanded on the work of reference [51] by developing an algorithm to compute the phases that minimize the PAPR for larger sets of data of practical interest. The authors found sets of phase values (calculated offline) which are known at both the transmitter and receiver that reduce the PAPR without affecting the error correction properties. These phase shifts that calculated for various coding rates in the Hiperlan2 [53] standard where there are 48 information bearing subcarriers. For rate BPSK with 90 phase shifts a reduction of 4.09dB is reported. The reduction Chapter 4: Peak to Average Power Solutions Distortionless Techniques 60 can be improved slightly by using smaller phase shifts such as 8PSK and 16PSK. The authors elaborated on the results given in [52] in [54] where the computation of the PAPR is given in a rigorous mathematical proof. Going back in time another early paper [55] extended the work in reference [49] by identifying Quadature Phase Shift Keying (QPSK) message structures which produce high PAPR. They are grouped into equivalence classes where messages which have the same PAPR are in the same equivalence class. Up to N=5 subcarriers are analyzed with different coding rates. They show that that a small amount of redundancy can significantly reduce the PAPR. The authors of reference [55] developed their results in references [55, 56] to provide bounds for the PEP of OFDM using basic coding techniques as described in reference [49] for up to 16 subcarriers. They prove through analytic and simulated results that only 4 bits of redundancy are required to reduce the PEP to within 10% of its optimum value as the number of subcarriers is increased. They show that further redundancy provides little benefit in terms of PAPR reduction. Reference [57] presented an idea again based on reference [49] where vectors or messages of data which exhibit a high PAPR are attenuated. The amplitude of subcarriers which are above a given threshold are uniformly reduced to achieve equality between the maximum of the envelope power and the threshold level. Also where the envelope power is below the threshold these subcarriers are increased to obtain equality. This allows the PAPR to be reduced without affecting the net bit rate. Results are given for a BPSK and QPSK with less than 20 subcarriers. The PAPR is reduced significantly however this comes at the cost of a reduction in the SNR which becomes more pronounced as the number of subcarriers is increased. Reference [58] is another early paper which draws a link between the number of subcarriers and the mapping type used (4 Phase Shift Keying (PSK), 8 PSK, etc). It is stated that if sub carriers in the same group (where a group is a set of subcarrier frequencies sharing some relation) are phase shifted by the same amount, then the envelope remains unchanged. This concept is used to create code sets with low PAPR properties. Reed-Solomon codes are examined with parameters N=16 and M=4. Chapter 4: Peak to Average Power Solutions Distortionless Techniques

61 They also note that the constraints on the code can be loosened up if at least some variation in the envelope is allowed. 4.1.2 Bounds on PAPR Reference [52] provided an efficient computational method for finding offsets which can be used with coding in OFDM, which could allow large reductions in the PAPR, but no concrete level on the PAPR has been proven. Reference [59] addressed this issue by providing bounds for the PAPR for different error correction coding schemes coined trace codes. Duals of primitive BCH codes are identified as good error correction codes. Reference [60] presented an interesting paper which clarified the relation between the PAPR and the out of phase aperiodic autocorrelation of the message or data sequence. The PAPR as defined in (3.3) can be bounded as (4.5)

()1
1

2 1
N k

k N
=

+ _ (4.5) where

()*
1 Nk nkn n

kaa
+ =

= _ for k=0, N-1 (4.6) and an is the input data sequence to the IFFT. (4.6) shows that binary or polyphase sequences with low out of phase aperiodic correlation values can be used to construct low PAPR signals. The problem remains to find sequences that reduce the PAPR. (4.6) is useful as it is a much less complex process to calculate than (3.3). 4.1.3 Cyclic Codes Reference [61] developed a simple method based on rate code rate cyclic coding which can reduce the PAPR by 3dB when the number of subcarriers is a multiple of 4. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 62 The phase of every 4th subcarrier is calculated so as to minimize the amplitude giving a rate code. The method is simple to implement. Reference [62] achieves the same results as [61] but with reduced complexity. The authors also introduced Sub Block Coding (SBC) where systems with a large number

of subcarriers are divided into sub-blocks with the last bit of each subblock altered according to the method described in [49]. By dividing the OFDM frame into subblocks larger number of subcarriers can be used while still maintaining a reduced PAPR, which is a problem with other earlier coding schemes. The idea is developed to optimize the positions of the odd parity checking bits for further PAPR reduction at the cost of the introduction of side information to inform the receiver of the positions of odd parity checking bits. 4.1.4 Shapiro-Rudin codes A very early paper [63] applied Shapiro-Rudin and Newmen phases to multitone frequency response testing. Although the application was not for multicarrier applications the theory still holds with multicarrier signals. Reference [63] stated that a multitone signal can achieve a CF under 6dB for Shapiro-Rudin phases and around 4.6dB using Newman phases for an arbitrarily large number of subcarriers. Many definitions which were to become convention in later papers as far as the statement of the problem of large PAPR, as well as identifying some very pertinent parameters in the makeup of multitone signals. It was also noted that the set of tones needs to be a power of 2 (which is the case in all OFDM standards) in order for the codes to be optimum. Reference [63] closes the paper with an open question as to whether the CF can be reduced lower than 3dB. The authors also state that Newman phases which vary quadratically exhibit a lower CF than Shapiro-Rudin sequences which vary linearly for all N. Eleven years later reference [64] revisited the application of Shapiro-Rudin sequences to reduce the PAPR in OFDM. In [64] a rate code is employed for QPSK signals with 16 information carriers giving 32 subcarriers in total. 4 bit input messages are concatenated to 4 bit codewords which are determined according to the message data. A simple digital circuit using eight two input XOR gates can be used for the encoder. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 63 Results taken from a limited subset of possible messages suggest that the coded signal will reduce the PAPR to of the original uncoded message. 4.1.5 Golay complementary codes Golay codes are linear binary block codes which are the only non trivial example of a perfect code. As every codeword lies within distance 3 of any codeword they can be used in conjunction with maximum likelihood detection [16] for decoding which is not overly complex to implement. In reference [65] the use of Golay complementary codes is examined. Golay sequences were first recognized to have good PAPR properties for application in OFDM in reference [50]. An algorithm was developed where certain subsets of codes up to length 16 have a minimum distance of half the code length and have a PAPR of 3dB. Existing Forward Error Correction (FEC) codes are incorporated into PAPR techniques with a new decoding algorithm developed which utilizes the efficient inverse Walsh-Hadamard transform. However the new decoding algorithm has 3dB worse performance than the optimum maximum likelihood detection. The author notes that the scheme may be unfeasible for a large number of subcarriers as the length of the codes is the same as the number of subcarriers. However reference [65] proposed ways to nullify this effect at the cost of the PAPR reduction

made and error correction capability by breaking the total number of subchannels into smaller groups and applying a complementary code to each group. 4.1.6 Reed-Muller Codes Golay complementary sequences were further developed for PAPR reduction in reference [66] where again the PMEPR is found to be at most 2 (3dB) when the data sequence is constrained to be a member of a Golay complementary pair. However Golay pairs have a high overhead in terms of redundancy and may not be a practical coding solution for OFDM. Therefore the authors developed a more suitable coding method by recognizing the relation between Golay complementary sequences and Chapter 4: Peak to Average Power Solutions Distortionless Techniques 64 second order Reed-Muller codes (RM(2,m)), i.e. in the binary case Golay sequences occur as cosets of the first order Reed-Muller code within the second order ReedMuller code. Standard decoders can be used in the receiver for the RM(2,m). RM(2,m) allow the code rate to be improved by allowing a slight increase in the PMEPR. RM(2,m) further developed the theory of reference [65] by allowing a trade off to be made between the Hamming distance, PMEPR reduction made, code rate, and the number of phases allowed in the PSK mapping type. The authors of reference [66] advanced their work further in reference [67] providing many mathematical proofs for Golay sequences, Reed-Muller codes, decoding using fast Hadamard transforms. Basically this paper formalized the results given in reference [66] and extended the results for larger sets of variables such as an increase in the PSK mapping type. However a limitation which the authors note is that the codes are limited to 32 subcarriers where the resulting code rate will be high. Reference [68] examined the Reed-Muller coding scheme presented in reference [67] through simulation of an end to end system with various non linearitys and 16 subcarriers. It was shown that at -40dB ACI the new coding scheme had around a 12dB gain in IBO over an uncoded system in experimental results, however this gain drops to 4dB for the simulated results. Reference [69] also uses Reed-Muller codes in a simulation environment to test their performance in the presence of AWGN to determine the BER with QPSK, and 8 PSK mapping types, and different code lengths, m. The number of subcarriers is chosen such that N=2m, the length of the code. Results indicated that increasing the code length improves error correction capabilities for high SNR values, but at low values of SNR the uncoded system has better performance. As the code rate decreases performance is improved. QPSK and 8PSK are compared in terms of their BER, and it was shown that to maintain the same BER the code length for 8PSK must be increased. Reference [70] also enlarged on the work of [67] by developing new decoding algorithms with generalized fast Hadamard transforms. The complexity and performance of their decoding algorithm is compared to the standard maximum Chapter 4: Peak to Average Power Solutions Distortionless Techniques 65 likelihood method. Suboptimal algorithms are also presented with reduced complexity which were shown to have minimal degradation over optimal methods. Another paper to address decoding issues is [71] which used soft decision decoding

methods for block codes. The performance is evaluated in an AWGN channel with PSK modulation and 8 and 16 subcarriers. The authors point out the inherent problem with coding schemes i.e. the decrease in coding rate with the increase in the number of subcarriers but state as others have that the PAPR reduction gained can be traded off for reduction in complexity and code length. The new block coding method coined Ordered Statistic Codewords (OSD). BERs for (8,4) and (16,5) block codes utilizing 8 and 16 PSK show that with partial order decoding almost optimum results are achieved compared to the union bound. Complexity compared to MLD methods is greatly reduced at the expense of a minor error increase. A limitation of references [67, 68] is that PSK mapping types are used. Most practical OFDM standards use QAM mapping [53]. Reference [72] addresses this limitation by combining QPSK constellations to form any M-ary QAM constellation. This is achieved by using shift and rotation operations as defined in (4.7)

( )( )
1 1

1 2 0

2 2 exp 24
i i

n ix M i

j QAM j
=

_ _ _ _ = __ __ _ _ _ _ _ _

(4.7)

where the QPSK constellations can be expressed as xi j , where { Any QPSK sequence (

) 0 1 1 ... N a a a a = can be associated with another sequence

} 4 0,1,2,3 i x Z = .

... N 1 ) i i i i x = x x x where the elements of xi are in Z4. Golay and Reed-Muller sequences can now be created using the theory developed in reference [67]. Reference [73] extended on the coding theory of references [49, 50] by finding codes that simultaneously reduce both the PAPR and the ICI which is introduced by frequency offset between the transmitter and receiver. They define a new measurement term, Peak-Intercarrier-to-Carrier Interference (PICR) to quantify their results. As minimum PICR and PAPR do not occur in the same code word, a balance Chapter 4: Peak to Average Power Solutions Distortionless Techniques 66 is found where both are at an acceptable level. Simulations performed on a rudimentary system with 16 subcarriers and BPSK modulation in AWGN show that the PAPR is reduced by 3dB with a decrease in the PICR. The authors note that finding codes for higher mapping types and more subcarriers is difficult.
01

Coding techniques while popular in early OFDM papers have since fallen out of favour. Their performance would be largely negated in a practical communications system due to the interleaving stage which follows the coding. It is also worth noting that none of the papers referenced consider the effect of over sampling.

4.2 Multiple Signal Representation


Multiple Signal Representation (MSR) techniques are another distortionless method for PAPR reduction. The basic premise of MSR is to produce a set of alternative transmit signals seeded from the same data source. Various techniques are used to encode the alternative sets of transmit signals, which are encoded in such a way so that they will have different PAPR properties. The transmit signal with the lowest PAPR is chosen for transmission. 4.2.1 Partial Transmit Signals Reference [74] first proposed Partial Transmit Sequences (PTS). PTS generates a signal with a low PAPR through the addition of appropriately phase rotated signal parts. The original signal is given by (2.2) and is reproduced in (4.8)
12 0

1 .
N j nk N mm k

xXe N
=

= _ where 0 n N 1 (4.8) The signal to be transmitted is broken up into several sub blocks, m,k X , of length N/V (where N is the number of sub carriers and V is the number of sub blocks). All subcarrier positions which are occupied in another block are set to 0, i.e.
, 1 V mmk k

XX
=

=_ Next a constant phase rotation, ,


, j mk mk

p e + = , [ ) , 0, 2 m k 1<k<V is Chapter 4: Peak to Average Power Solutions Distortionless Techniques 67 performed on each subblock except for the first one which is kept constant, giving (4.9)
,, 1

.
V mmkmk k

XpX

=_ (4.9) The information in m X is the same as m X but with an added phase rotation, which must be known at the receiver. An IFFT is performed on each subblock which are then all summed together to create a possible transmit symbol, (4.10)

{}
11

,,,,

..
VV mmkmkmkmk kk

x p IFFT X p x
==

=_ =_ (4.10) The process is repeated again with a different phase rotation, m,k p , to produce another alternative transmit signal. The optimum parameters for the transmit symbol are (4.11)

{}
{ } ,1 ,
,1 , , , 01 1

argmin max .
mmV

V mmVmkmk nN ppk

pppx
<=

__ =__ __

(4.11) The optimum phase angle, m,k p , is obviously the one where the PAPR is minimized. Therefore the actual transmit signal is given as (4.12)
,, 1

.
V mmkmk k

xpx
=

=_ (4.12) Figure 4.2 shows a block diagram of a PTS transmitter. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 68
Figure 4.2: Block diagram of PAPR reduction using the PTS approach.

PTS requires side information to be sent to the receiver to inform it of the phase rotation used so the data can be decoded. Reference [74] noted that the number of angles should be kept low to keep the side information to a minimum. If each phase rotation is chosen from a set of W admissible angles then the required number of bits for side information is, Rap=(V-1)log2W bits per OFDM symbol. In order to reduce complexity the phase angles should be restricted to {1, j} , i.e. W=4, this allows multiplications to be performed with sign changes. Simulations shown in Chapter 5 reveal that increasing the number of allowed phase angles has a minimum impact of PAPR reduction. Reference [75] noted that explicit side information can be avoided if differential encoding is used for the modulation across the subcarriers within each subblock. In this case only the block partitioning need to be known at the receiver and one subcarrier in each subblock must be left unmodulated as a reference carrier. Side Information (if needed) x m Bit source Coding Mapping Sub block partitioning

IFFT

+
IFFT IFFT
Peak value optimization
m,1 X m,2 X m,V X xm,1 xm,2 xm,V pm,2 pm,V

Chapter 4: Peak to Average Power Solutions Distortionless Techniques 69 PTS is flexible as the number of blocks and phase rotations can be increased providing more alternative transmit signals to choose from. The disadvantage of this scheme is the complexity, especially with an increase in V and W. Also, a large amount of memory is required to store the alternative transmit signals (if check performed in parallel) in order to compare them to find the one with the lowest peak value. Alternatively the optimisation can be performed in an iterative fashion where the current best transit signal is stored until a better one is found, at the cost of increased latency.

It should also be noted that the data can be divided into sub blocks in different ways as noted in reference [76] and shown in Figure 4.3. Different PTS sub block structures have varying performance with pseudo random having the best and interleaving having the worst. Of course there is a trade off with complexity, interleaved sub blocks are the least complex PTS structure to implement (The size of the IFFTs can be halved by interleaving the input data to the IFFT and performing the last stage of the IFFT operation at the IFFT output [77].) and pseudo random PTS is the most complex (random sequences require more hardware complexity to implement in this case). Also computationally efficient IFFTs can be used to exploit the number of zeros in the PTS sub-blocks. Results from reference [74] indicate that pseudo-random partitioning is 0.5 to 0.9dB better than adjacent partitioning. It should be noted that no guaranteed level of PAPR reduction can be provided with MSR techniques, all they can do is reduce the probability of large peaks. Figures 4.4 and 4.5 show CCDF results with varying factors of V and W, N is set at 64. The simulation model is described in Figure 5.1. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 70
X(0 ) n X(1) n X(2)
N -1 n

(a)
X (0 ) n X (1 ) n X (2 ) N -1 n (b )

X (0 ) n X (1 ) n X (2 ) N -1 n (c )

Figure 4.3: An example of the 3 main PTS structures: (a) Interleaved (b) Adjacent (c) Pseudo-random.

Chapter 4: Peak to Average Power Solutions Distortionless Techniques 71


2 3 4 5 6 7 8 9 10 11 12 10-4 10-3 10-2 10-1 100 o (dB) Pr( > 0) Uncoded V=2, W=4 V=2, W=8

Figure 4.4: Simulated CCDF for PTS-OFDM with V=2 and varying W. N=64, adjacent subblock partitioning.
2 3 4 5 6 7 8 9 10 11 12 10-4 10-3 10-2 10-1 100 0 (dB) Pr( > 0) Uncoded

V=2, W=4 V=3, W=4 V=4, W=4

Figure 4.5: Simulated CCDF for PTS-OFDM with W=4 and varying V. N=64, adjacent subblock partitioning.

Chapter 4: Peak to Average Power Solutions Distortionless Techniques 72 4.2.2 Oversampling PTS Early on in the development of PTS a problem was identified which skewed the true gains of the proposed technique. As already mentioned in section 3.3 oversampling at the IFFT is important to get a true reflection of PAPR reduction. Reference [34] first identified this issue and showed that for a PTS system with V=4, W=4 the PAPR reduction gained using the True Peak Factor (TPF) i.e. oversampled signal is only 1dB better than the uncoded case. In reference [34] it is shown that the true peak will move away from the discrete sampled points. There is a 3dB difference in the reduction between TPF and Lower Peak Factor (LPF), i.e. the discrete PAPR. Reference [34] goes on to suggest a new optimisation technique based on the aperiodic autocorrelation of m,k x , k .

{}
{ } ,1 ,
1 ,1 , 1

argmin
mmV

N mmVk ppk =

pp __ =__ __

(4.10) A reduction of 2.5dB can be achieved when using the optimisation criteria of (4.10) at Pr(_>_0)=10-5. However this process is complex, for QPSK the calculation of k can be achieved with less multiplications and replaced with integer additions giving a total complexity of O(4V-1N2). Reference [39] acknowledges the limitations of discrete sampled PTS and notes that the mismatch between discrete and continuous CCDFs occurs due to DSP filtering after the IFFT, peak regrowth becomes more pronounced with a sharper rolloff, _, of the filter. Reference [39] suggests that oversampling the IFFT by a factor of 2 (zero padding) will reduce the peak regrowth affects after filtering. Oversampling of PTS is explored in more depth in Chapter 5. Another paper, [78], to explore the PTS approach looked at alternative ways to create

sub-blocks. The new method coined Concatenated Pseudo Random Subblock Partition Scheme (CPR SPS) divides the OFDM symbol into multiple disjoint subblocks and assigns signals randomly in each subblock. The partial sub band is then Chapter 4: Peak to Average Power Solutions Distortionless Techniques 73 duplicated and assigned to the rest of the subbands repetitively to form a complete sub block. This procedure is depicted in Figure 4.6 for 3 subcarriers.
partial subband duplication

X(0 ) n X(1) n X(2)


N-1 N-1 n

Figure 4.6: Generation of sub-blocks for PTS using Concatenated Pseudo Random Sub Block Partition Scheme (CPR SPS).

C is the concatenation factor, i.e. the number of sub-blocks. C=1 is equivalent to pseudo random PTS. Results indicate that CPR SPS can achieve similar results to the optimum pseudo random block allocation method but with a decrease in complexity, especially when efficient FFT structures such as the Cooley-Tukey algorithm are used. Best results are obtained with low values of C (2 or 4) which results in a marginal decrease in complexity compared to pseudo random PTS. For a significant reduction in complexity C=16 has similar performance to the adjacent subblock allocation method. References [79, 80] also examined the PTS complexity reduction. The basic principle of these papers is to produce a suboptimal iterative algorithm which uses a similar structure to PTS but with reduced complexity. This method is coined Suboptimal Exhaustive Search (SES) algorithm and exhibits good performance with a minimal number of trials. One of the problems of PTS is that in order to work out which Chapter 4: Peak to Average Power Solutions Distortionless Techniques 74 transmit signal has the largest peak, every sample in each possible signal has to be checked to work out its peak value. The process of finding large peaks contributes significantly to the complexity of PTS. If the time domain signal is given as (4.9) then the phase optimization values are restricted to [ ] , 1 m k p = . The SES algorithm works as follows: 1/ Assume pm,k=1 for all k and compute the PAPR of the combined signal (ie. no optimisation). 2/ Next, invert the first phase factor (pm,k=-1) and recompute the resulting PAP for the first value in each sub block, V. If the new PAP is lower than the previous value use b1 as part of the final phase sequence, otherwise use the original value for pm,1 (ie. 1). 3/ The algorithm continues until all V points have been given this treatment. Simulations for V=16 sub blocks, and N=256 subcarriers exhibited around a 1dB degradation compared to standard PTS with adjacent subblock partitioning at Pr(_>_0)=10-3. However the optimisation process has been reduced to N2 additions, a considerable saving in terms of hardware operations. Note that in the simulations an oversampling factor of 4 is used, which is sufficient to catch all peaks.

Various parameters were varied in the simulation such as a) the number of sub blocks, V. b) number of allowed phases, pm,k for optimization and c) data constellation size. It was concluded that the performance improves with an increase in V. However as V is increased the performance improvement becomes less pronounced. Increasing the number of phases to four [1, j] provides a minor improvement for higher values of V and up to a 1dB improvement at low values of V. The data constellation size has a minimal effect on performance. The SES algorithm has similar performance to traditional PTS with optimum phase selection (within 1dB of optimum solution at Pr(_>_0)=10-4) but with reduced complexity. However complexity increases substantially with an increase in V. Reference [81] again looked at developing optimum phase factors for PTS with reduced complexity. The new method called Optimal Limited Search (OLS) also Chapter 4: Peak to Average Power Solutions Distortionless Techniques 75 addresses the oversampling issue by including an oversampling factor, L. The output of the PTS algorithm can be written as (4.11)
1 2

11 21 1 12 22 2 12

... ... ... ... ... ... ... ... M


j M j M j LN LN MLN

AAAe AAAe S AAAe


____ ____ =________ ____ __ __ __ __ (4.11) where

( ) ( ) 1 ,....., T
S = __S S __ (4.12) contains the optimized signal samples. Figure 4.7 shows a block diagram of an OLSPTS transmitter. OLS sorts same subcarrier positions over V IFFTs in order of magnitude as depicted in (4.13) 1 2 ..... r i r i rMi A > A > > A (4.13)
LN

then chooses
rli rl rli

A A _ = _ _ 1,3,... 2,4,... l l = = (4.14) The minimum amplitude sum for one IFFT point is then given by (4.15)

( ) 1 2 3 ... i r i r i r i S = A A + A (4.15)

The phase selection (4.14) nearly always yields the maximum amount of amplitude cancellation for the ith signal sample. Next all LN solutions are calculated and the one that minimises the maximum signal samples is chosen. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 76
Figure 4.7: Block diagram of OLS-PTS transmitter.

The effect of quantization and the PSD is also examined for OLS. Quantization is shown to have a negligible effect on performance and the out of band radiation is also shown to be better than the OBPS case. OLS out performs the methods of references [79, 80] for larger values of V (>8), also the number of iterations is independent of V. Simulations (QPSK N=256) show OLS to be slightly better than the OBPS of reference [74] for V>4 and much better for V<4. The complexity of the 3 preceding methods is summarized in Table 4.1. Reference [82] developed another scheme to find optimal phases based on orthogonal vectors called Orthogonal Projection-based PTS (OP-PTS). The authors state that OP-PTS can reduce the PAPR within 0.4-0.8dB (using adjacent partitioning) of OBPS using pseudo random partitioning and is suitable for a large number of subcarriers.
IFFT IFFT A11 A1LN A13 A12 S/P S/P X1 XV Phase

optimisation Av1 AvLN Av3 Av2 X Data source S1(_) SLN(_) S3(_) S2(_) Table 4.1: Number of complex multiplications and magnitude operations required.

L=4; N=256; W=2 V=2 V=16


OBPS (Huber et al) 33,554,432 5.52*1070 SES (Cimini, et al) 2048 33,554,432 OLS (Tellambura et al) 1,048,576 1,048,576 Chapter 4: Peak to Average Power Solutions Distortionless Techniques 77 The complexity of OP-PTS is better than OBPS when a large number of subcarriers and sub-blocks are used (N=128, W=4, V<=5), but improvement using OP-PTS reduces with an increase in V. The complexity of OP-PTS is also slightly lower than SES for high N and V. Note that no oversampling is used in OP-PTS. A suboptimum solution for OP-PTS approach to find near optimum phases with negligible affect on performance is also presented. The optimum phases are chosen from a set of 32 selected phases. 4.2.3 Selective Mapping In reference [83] another multiple signal representation method is presented called SeLected Mapping (SLM). The basic idea of SLM is to produce U alternative transmit sequences seeded from the same data source and then to select the transmit signal exhibiting the lowest PAPR. The idea stems from the fact that as the PAPR is determined by the sequence of the transmit data vectors, Xm, multiplying the data vectors by some random phase will change the PAPR properties after the IFFT. Mathematically, a set of U markedly different, pseudo random fixed vectors are generated, ( ) ( ) 01uu, , u DP P P = _ _ _ _ , ( ) ( ) , ( ) [0,2 )
u ku

P = e+ , 0 k < N,1 u <U . The data, Xm, is multiplied subcarrier wise with each one of the U vectors, (u) P , resulting in a set of U different possible transmit symbols, (u) m X as depicted in (4.16)
kk

()()
,,

. uu X = X P , 0 k < N,1 u <U (4.16)

mkmkk

Next, all U possible transmit vectors are transferred to the to the time domain via the IFFT, (u)
mm

{ ( )}
u

x = IFFT X , and the transmit symbol with the lowest PAPR, m x , is chosen for transmission. A SLM transmitter block structure is depicted in Figure 4.8. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 78
Figure 4.8: Block diagram of an SLM OFDM transmitter.

Simulations performed with N=128 and 4-PSK mapping indicate that at Pr(_>_0)=104, with U=4 a PAPR reduction of ~3dB is gained compared to the uncoded case. Complexity reduction can also be achieved by restricting the random generated data to ( ) { 1, } u m P j avoiding complex multiplications. Figure 4.9 shows the CCDF of SLM with N=64, os=1, and U=1 to 32. At Pr(_>_0)=10-4 the PAPR is reduced from 2 to 4.5dB for U=1 to U=32 respectively. Figure 4.10 shows the CCDF of SLM with N=64 and U=1 (uncoded) 3, and 8. The oversampling rate is set at 1, 2, and 4, and 8. Here it is seen that oversampling has a minimal affect on the PAPR, only increasing it by ~0.5dB. An oversampling rate of 4 is sufficient. This is due to all the alternative transmit signals being uncorrelated as shown in Figure 3.6. This is one of the main advantages of SLM, i.e. that oversampling and filtering does not increase the PAPR dramatically as it does in PTS (shown in Chapter 5). Again oversampling is seen to have a negligible affect on the PAPR, the purpose of this plot is to show that oversampling still has a minimal affect at higher values of U. Side information Bit source Coding Mapping Serial to parrallel

IFFT
Select transmit symbol with lowest PAR

IFFT IFFT
(U ) p (2) p
m

X X

(1)
m

(2)

X X x x

(U )
m

(1)
m

(2)
m

(U )

x x Chapter 4: Peak to Average Power Solutions Distortionless Techniques 79 Side information is also an important issue in SLM as the receiver needs to be informed which vector, (u) P , was used. Log2(U) bits are required to send this information increasing redundancy. As loss of this information (in a fading channel) means the complete loss of the transmit symbol channel coding is required to ensure correct recovery of the data at the receiver, increasing redundancy further. In reference [74] SLM is compared to PTS for various combinations of V, W (PTS) and U (SLM). In terms of redundancy (side information) SLM with U=1..5 outperforms PTS using pseudo-random subblock partitioning (W=4, V=1..5) at the cost of greater system complexity. In terms of performance PTS has better PAPR reduction if the number of IFFTs is fixed as PTS can vary W with no additional IFFTs. 4 alternative signals produced with PTS require 3 IFFTs while the same performance with SLM can only be achieved with 4 IFFTs.
m m
2 3 4 5 6 7 8 9 10 11 12 10
-4

10
-3

10
-2

10-1 100 0 (dB) Pr(>0) U=1 U=2 U=3 U=4 U=8 U=16 U=32

Figure 4.9: Simulated CCDF for SLM-OFDM for varying values of U. N=64, os=1.

Chapter 4: Peak to Average Power Solutions Distortionless Techniques 80


3 4 5 6 7 8 9 10 11 12 10
-4

10
-3

10
-2

10-1 100 0 (dB) Pr(>0) os=1 os=2 os=4 os=8 U=1 U=3 U=8

Figure 4.10: Simulated CCDF for SLM-OFDM. U=1, 3, and 8. Oversampling rates are 1, 2, and 4, and 8.

The issue of side information for SLM is further explored in [84] where the need for explicitly sent side information is avoided. The new technique employs a scrambling sequence and inserts U different labels, (u) b , of length log2U as a prefix to the data sequence as shown in Figure 4.11a. The data with prefix is then fed into a scrambler polynomial as shown in Figure 4.12. The labels drive the scrambler into one of U different states before scrambling the data itself. The scrambled data is then processed as usual. The process is repeated with the other different U labels to produce U alternative transmit sequences as in standard SLM. Figure 4.11b is a variation where the linearity of the scrambler is exploited, here only a single codeword (0) q needs to be generated. U different subcarrier vectors can then be generated by applying U different vector mappings to (0) q . The advantage of this structure is that uth calculated vector mapping can be calculated once from the label (u) b and can be stored in memory. The receiver structure complexity is hardly Chapter 4: Peak to Average Power Solutions Distortionless Techniques 81 increased at all, only the label needs to be removed after descrambling. The complexity of the new scheme is increased by around U compared to standard SLM. (a) (b)
Figure 4.11: SLM transmitter block diagram employing technique to avoid explicit transmission of side information, a) serial form, b) parallel form. Figure 4.12: Scrambler polynomial for new SLM technique.

Simulations were performed with N=256 carriers, 219 of which are active (to decrease the design constraints of the transmit filter), 16 QAM mapping is used. The IFFT was oversampled by a factor of 2 yielding more accurate results. The data was then interpolated by a factor of 8 and a RRCF with a rolloff of 0.12 was used, results were shown after the transmit filter. At Pr(_>_0)=10-5 the new SLM algorithm can
Label inserter Scrambler Mapper
IFFT

Select RF

up conv. S/P P/S


(0 ) b (U 1 ) b

q
(u) q (u) A (u ) a (0 ) a (U 1 ) a

a (u ) b
Label inserter Scrambler

Mapper(0) IFFT
Select

RF up conv. Mapper(U- IFFT S/P S/P P/S P/S 0 (0) q

(0) A

(U 1 ) A
(0) a

(U 1) a

Delay Delay Delay Delay

(u) q (u ) q
Chapter 4: Peak to Average Power Solutions Distortionless Techniques 82 reduce the PAPR by 1.8 and 2.5dB for U=4 and 8 respectively. Figure 4.13 shows the receiver structure of the new SLM technique.
Figure 4.13: a) Receiver structure of the proposed SLM system, b) Descrambler polynomial at the receiver.

The impact on the BER is also analyzed where perfect knowledge of the channel and noise is assumed. The descrambler can multiply errors in receiver, but the effect is negligible as the BER is degraded by 0.2dB. PSD plots showed that between 1 and 2dB can be saved in power backoff of the LPA with 4 bits redundancy per OFDM symbol. Reference [85] again looks at the issue of side information in SLM and proposes a variation of reference [84] called Blind SLM (BSLM) which does not require the labels to be sent with the data for descrambling in the receiver. The U sets pseudo scrambling noise vectors are restricted to a known set at the receiver and all U sets are sufficiently different. In order for the method to work u
jn

c e Q , where Q is the constellation mapping type. At the receiver the decision metric is (4.17)
n

RF down conversi on FFT Demapper Descramble r Label dumper S/P P/S Y y q q q

a)
Delay Delay Delay Delay q q

b)
Chapter 4: Peak to Average Power Solutions Distortionless Techniques 83
{}
0, 1, 1,

12 ... 0 , 0,1,... 1

min u
n N

N j nnn cccC n PuU

D r e H c

__ __ =

= _ (4.17) is performed for P1, P2,.PU-1, the global minimum-distance solution provides the best estimate for the transmitted data. A suboptimal metric is also presented to reduce complexity in the receiver. Simulations of the BER revealed that the new technique has almost the same performance as SLM with perfect side information available with a infinite backoff in the LPA. When a SL is employed with varying backoffs, BSLM slightly outperforms the standard SLM approach. However there is a slight degradation when the new algorithm is used in a fading channel.

4.3 Tone Reservation/Injection


Tone Reservation (TR) and Tone Injection (TI) were first introduced in reference [86] and further detailed in reference [30]. These methods use an iterative algorithm to provide increasingly better CCDF results. 4.3.1 Tone Reservation In TR subcarriers, called Peak Reduction Tones (PRTs), are set aside for PAPR reduction as shown in the transceiver block diagram in Figure 4.14. The signal plus Peak Reduction Tones (PRTs) are represented in (4.18) x + c = Q(X + C) (4.18) where Q is the IFFT matrix, X is the transmit data before the IFFT, and C are the PRTs. __ __ _ + = mC
k m mk k m

Xk Ck XC , , (4.19) Chapter 4: Peak to Average Power Solutions Distortionless Techniques 84


k

Figure 4.14: Block diagram of a Tone Reservation (TR) OFDM transceiver.

where = 0 kX , {

} L k i ,....,i 1 and = 0 k C , { } L k i ,....,i 1


Coefficients for Ck can be found with a reduced complexity iterative algorithm which achieves a solution close to the optimum in a few steps to reduce the PAPR. This method is distortionless as the data lies in disjoint frequency bins, i.e. m = 0
k m

XC, which introduces redundancy. The new transmit signal has PAPR defined as (4.21)
k

{}[

] Nx

xc PAR x c /2
2

+ + = (4.21) Vector c is computed so that the maximum peak value is reduced as (4.22)
^^
^

+=+

min x c min x QC
C c

(4.22) Solving 2 x + c is a convex problem, where the optimum solution lies at the bottom of the parabola. When the terms are expanded out it can be solved as a Quadratically Constrained Quadratic Program (QCQP), which can also be solved as a Linear Program (LP). The complexity of the LP is O(NlogN).
FFT X0 X2 C1 CN-1

S/P

N
IFFT X0 0 X2 0

P/S

N
IFFT 0 CN-1 0 C1

P/S

N x(t) c(t) Chapter 4: Peak to Average Power Solutions Distortionless Techniques 85 L/N is the ratio of redundancy where L is the number of PRTs and N is the total number of data tones or subcarriers. To reduce the bit redundancy, PRTs can be assigned to subcarriers which have a small number of bits assigned to them (in the case of ADSL), i.e. QPSK as opposed to 16 QAM subchannels. The Data Rate Loss (DRL) can be given as (4.23)

_ _
=

= =1
0 1 N kk L ki

b b DRL k (4.23) Simulation results indicated that for a L/N ratio of 5%, the clip probability is reduced from 15dB to 9dB at Pr(_>_0)=10-5. For a L/N ratio of 20% the clip probability is reduced from 15dB to 5dB at Pr(_>_0)=10-5. Basically an increase in the number of PRTs improves the PAPR reduction made. All simulations are based on N=512 subcarriers. The performance of TR is influenced by the position of the peak reduction tones. As with PTS it was found that random positions of PRC gave the best peak reduction results of 6.2dB at Pr(_>_0)=10-5 probability. Adjacent PRTs on the other hand only had a peak reduction of 3.4dB. The number of iterations required to achieve various peak reduction values for random PRT also has a pronounced affect on the performance of TR. For example with L/N= 5%, 1 iteration gives a 2dB reduction at Pr(_>_0)=10-5 , 5 iterations provides a 3.8dB reduction at Pr(_>_0)=10-5 , which is 2.1dB away from optimum solution. Note that law of diminishing returns applies in that the more iterations performed the less the PAPR reduction achieved, i.e. after 40 iterations the new peak value is still 0.5dB away from optimum solution. These results are for the optimum solution where the complexity is quite high. The complexity can be reduced using general purpose iterative algorithms which find suboptimal solutions such as the gradient algorithm. Chapter 4: Peak to Average Power Solutions Distortionless Techniques 86 Gradient algorithm for computing c Reference [86] uses gradient algorithms with reduced complexity, to reduce the hardware requirements while still providing a good approximation to the optimum

solution. By taking the gradient of the clipping noise Mean Square Error (MSE), simple iterative algorithms were produced which achieved a solution close to the optimum in a few steps. The gradient algorithm searches for the largest terms in x+c and subtracts scaled replicas of them to cancel large peaks. The complexity of this algorithm is O(N) per iteration compared to O(NlogN) in the optimum case. The clipping noise of the gradient algorithm is defined as (4.24)

()()()_
=

=
1 0 22 N n

x clip x x clip x (4.24) Including the PRTs in the transmitted time signal x+c, the Signal to Clipping noise Ratio (SCR) is given by (4.25)
AnAn

()2
2

x c clip x c x SCR A+ + = (4.25) In order to maximize the SCR the denominator is minimized. Instead of solving for c (c=QC) [30] solves for the gradient of c giving (4.26)

_
+> + xcA n row k n kk
k ii

c c Qq
()

^^ ( 1) ( )

( ) (4.26)

where sign(x c ) x c k A
nn k nn k n

( ) = + ( ) + ( ) . As is seen in (4.26), each iteration requires an IFFT to be performed. Basically the algorithm searches for the largest terms in x + c(k ) and subtracts scaled, circularly shifted replicas of the vector p0 to cancel large peaks ( 0 0 row p = Qq ).

Chapter 4: Peak to Average Power Solutions Distortionless Techniques 87 4.3.2 Tone Injection Another method developed in reference [86] called TI maps data that cause large peaks to new positions which will not produce peaks when the IFFT is performed. The data can then be easily decoded correctly with a modulo D operation in the receiver as shown in the block diagram in Figure 4.15.
Figure 4.15: Block diagram of a Tone Injection (TI) OFDM transceiver.

If 2 3 2
kk k

X = A = d + j d (4.27) and A is changed so that A = A + pD + jqD (4.28) where p and q are any integer values and D is a positive real number known at the receiver. This process effectively increases the size of the constellation, as well as the average power of the transmit symbol. Reference [30] indicated that careful selection A can reduce the PAPR by over 5dB with only a 2% increases in the average power.
S/P

N
X0 IFFT XN-1 X2 P/S

N
IFFT C0 CN-1 C2 C1

P/S

N x(t) c(t)
Mod D
X0 X2 X1 XN-1

FFT
X0+C0 X1+C1 X2+C2
XN-1+CN-1

X1

Chapter 4: Peak to Average Power Solutions Distortionless Techniques 88 The possible expansions of the selected tone for a 16QAM constellation are shown in

Figure 4.16. The new transmitted signal can be given as (4.29)


Figure 4.16: Example of possible expansions of constellation in TI for 16 QAM.

()_
=

=++
1 0

1 2/
N k j kn N nkkkkk

X p D jq D e

N x (4.29) The maximum peak reduction per tone shift is (4.30) MN M


k

k 2 1 6
2 2
0 0

= (4.30)

_ _ _ _ _ _ _ _ _ _ _ _

_ _ _ _ _ _ _ _ _ _

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

_ _ _ _ _ _ _ _

_ _ _ _ _ _

I Q I I QQ D
Chapter 4: Peak to Average Power Solutions Distortionless Techniques 89 where k M is the number of levels per dimension. The maximum peak reduction factor (assuming normalized data) is K , where K is the number of real/imaginary dimensions. The peak reduction factor decreases as N increases, so to keep the peak reduction factor constant K must be increased. There is no redundancy in TI as the receiver uses a modulo D operation to decode information, thereby separating the original constellation point from the expanded one. The complexity in TI comes from finding values of pk and qk which produce low PAPR which in turn require (as in TR) the solving of an integer programming problem, which has exponential complexity. Assuming L duplicate points per constellation, if K dimensions are to be modified, the algorithm must search over ( )K NL combinations for vectors pk and qk, where L is the oversampling factor. Low complexity iterative algorithms are used to find approximations close to the optimum solution. The algorithm starts with the original multicarrier symbol (pk=0, qk=0). After the maximum sample, n0, is located, the tone, k0, is found (4.29) is updated. If more than 1 value of n x is large, a new value for k0 that reduces as many peaks of possible is found. This procedure can be repeated several times until the desired PAPR is met or the maximum of iterations is reached. Each iteration decreases the PAPR by around 1 dB up to a max of 6dB of reduction.

4.4 Conclusion
This chapter introduced distortionless techniques for PAPR reduction in OFDM. Distortionless techniques have the advantage of not corrupting the data thereby

maintaining a low BER but come at the cost of increased complexity and bandwidth. Coding techniques for PAPR reduction were first discussed in Section 4.1 where redundant bits are added to the bit stream before the IFFT. Properly chosen, these codewords ensure that the PAPR after the IFFT is kept low. These codes can be Chapter 4: Peak to Average Power Solutions Distortionless Techniques 90 combined with existing COFDM to reduce the redundancy and complexity inherent in coding. A disadvantage of coding is that the complexity becomes prohibitively high with an increase in the number of subcarriers (>32). Various codewords were presented such as cyclic codes, Shapiro-Rudin Sequences, Golay Complementary codes, and Reed-Muller codes. Golay codes and their subset, second order Reed Muller codes were found to have excellent PAPR properties restricting the PAPR to 3dB. This reduction could be traded off with reductions in complexity and the code length. Still complexity remains a restrictive issue in coding. Section 5.2 introduced MSR methods, where a set of alternative transmit signals are seeded from the same data source. Various techniques are used to encode the alternative sets of transmit signals, which are constructed in such a way so that they will have different PAPR properties. The transmit signal with the lowest PAPR is chosen for transmission. The effect of MSR OFDM is to shift the CCDF curve from the right to left, i.e. reduce the amplitude peaks in a uniform manner to lower PAPRs. Both PTS and SLM require side information to be sent with the chosen transmit signal. This information must be protected as loss of the side information means loss of the whole transmit symbol. The redundancy required for PTS is (V-1)log2W while SLM requires log2U bits per OFDM symbol. Variations of SLM and PTS have also been presented which alleviate the need for explicit side information. Increasing the number of alternative transmit symbols reduces the PAPR but also increases the complexity, reductions in the PAPR also become less pronounced with larger numbers of alternative transmit symbols. The number of alternative transmit symbols that can be produced for 1 OFDM symbol is WV-1 for PTS and U for SLM. An advantage of MSR over coding techniques is that the reductions are independent of the constellation type, and only marginally affected by the number of subcarriers. However unlike coding no specific level of PAPR can be guaranteed with MSR methods, only a shift to lower PAPR as shown in the CCDF curves. The effect of oversampling PTS is examined in the next chapter where filtering is shown to degrade the gains made with PTS by regrowing peaks. Oversampling at Chapter 4: Peak to Average Power Solutions Distortionless Techniques 91 the IFFT is required to maintain the reductions gained. SLM is much more robust to filtering due to the greater independence of the alternative transmit symbols. Generally SLM outperforms PTS in terms of reduction versus redundancy, but PTS is much better in terms of PAPR reduction versus additional system complexity. Section 4.3 described TR and TI for PAPR reduction, both methods prevent distortion by reducing the PAPR before the HPA. In TR the additional complexity is only in the transmitter. Suboptimal solutions to the PAPR minimization problem were presented which successfully reduced the complexity without sacrificing the amount of PAPR

reduction to a great degree. In TI a similar method to TR is presented where most of the complexity is in the transmitter with some additional complexity in the receiver which is a simple modulo operation of the demodulated complex vectors. Again suboptimal solutions were proposed with similar performance to the optimal case. Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 92

Chapter 5 Cyclic Shifted Sequences and Time Inversion of PTS


This chapter introduces new techniques to produce alternative transmit signals in PTS. Section 5.1 reviews standard PTS subblock creation methods. Two new techniques for subblock creation in PTS are described in section 5.2. Section 5.2.1 describes Cyclic Shifted Sequences (CSS) where shifts in the data are used in place of phase rotations. Section 5.2.2 looks at a combination of traditional PTS using phase rotations and CSS to produce alternative transmit signals. Section 5.2.3 describes Time Inversion (TI) where the data sequence is reversed to provide an alternative transmit signal. Section 5.3 analyses the effect of filtering on traditional phase rotated PTS as well as CSS and TI. Section 5.4 provides results for the aforementioned techniques when oversampling of the IFFT is done before filtering. Section 5.5 compares the complexity of standard PTS and the new techniques and Section 5.6 closes the chapter with a conclusion.

5.1 PTS subblock creation


As was shown in section 4.2.1 PTS can provide promising reductions in the PAPR. However issues such as complexity and filtering after PTS limit the techniques usefulness. In traditional PTS alternative sub-blocks are created by increasing V, the number of IFFTs, and W, the number of allowable phase rotations. Increasing the Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 93 number of phase rotations provides increasingly less PAPR reduction for a set V. In addition the hardware operations are non trivial for W>4. This limitation in PTS is the impetus for finding alternatives to phase rotations in order to create unique transmit sequences. Note that all simulations use adjacent block partitioning.

5.2 New techniques for PTS subblock creation


Of all the MSR techniques PTS suffers the most after filtering [39] as the alternative sequences are not necessarily independent of each other as in the case of SLM. The following sections propose new techniques to create alternative sequences for PTS and shows through simulation that the peak regrowth of the proposed techniques after filtering is not as severe as in traditional PTS. This advantage is combined with a reduction in complexity of the new algorithms. Figure 5.1 shows a block diagram of the simulation model, note that the CCDF is measured after the nyquist sampled IFFT.
Figure 5.1: Block diagram showing the simulation model of Section 5.2.

5.2.1 Cyclic Shifted Sequences As detailed in Section 4.2.1 PTS produces alternative signals by breaking up the transmit bit stream and phase rotating whole parts before performing the IFFT. These phase rotations can be kept trivial if W is restricted to 4 or less. In other words for phase rotations of 90, 180, and 270 degrees only the sign of the I or Q value needs to be changed, making them trivial hardware operations. If the number of phase

Data source M-ary


Mapping PTS /CSS /TI

Measure CCDF
Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 94 rotations, W, is greater than four, then the rotation becomes non trivial as the I and Q values have to be manipulated requiring a more complex circuit. CSS does not suffer from this limitation, the number of positions that can be shifted to provide alternative transmit signals can be up to N/2 (before identical signals are produced) where N is the number of subcarriers. Figure 5.2 shows a block diagram of a PTS system depicting PTS, CSS, and TI which is introduced in the next section. The system represents a PTS/CSS/TI system with V=2 sub-blocks and W/S=4 rotations/shifts. Figure 5.3 compares the CCDF of CSS to PTS where the phase shifts are replaced with cyclic shifts, they are equivalent in all other aspects. Unless otherwise stated all simulations use N=64 subcarriers with QPSK mapping. No oversampling is performed. Here we see that CSS outperforms PTS in all cases, also the complexity of CSS when W>4 is reduced. CSS: V=2, S=8 performs 0.4dB better at Pr(_>_0)=10-4 than PTS: V=2, W=8. CSS: V=4, S=4 has a minor improvement of 0.2dB over PTS: V=4, W=4 at Pr(_>_0)=10-4. CSS: V=4, S=8 has 0.6dB better performance than PTS: V=4, S=8 at Pr(_>_0)=10-4. CSS has a PAPR reduction of 3dB, 4.5dB, and 5.5dB compared to the uncoded conventional case for CSS: V=2, S=8; V=4, S=4; and V=4, S=8 respectively at Pr(_>_0)=10-4. Mathematically CSS can be expressed as (5.1):
()

~~() 1
v

Vv nn v

a a +
=

=_ (5.1) Where (v) is a cyclic shift in the time domain.

It seems that with CSS not only is complexity reduced but the independence of the generated alternative signals is greater thereby giving better PAPR reduction. The performance of CSS at lower probability regions should be increasingly better than PTS due to the steeper slope of the curve, especially for V=4, S=8. Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 95
Figure 5.2: Block diagram of PTS, CSS and TI transceiver.

For a CSS structured OFDM transmitter with V=2 and 64 subcarriers, 32 alternative transmit signals can be created. In a PTS transmitter, with V=3 and W=4 only 16 alternative transmit signals with trivial phase rotations can be produced. Increasing the number of subcarriers in CSS allows for more alternative transmit signals to be produced with trivial operations. The effect of double shifts has also been analysed
FFT A,A,A,A B, B, B, B Correction

A, A, A, A, B, B, B, B

a) Transmitter b) Receiver
' nn

a a =

0,0,0,0,B,B,B,B A,A,A,A,0,0,0,0
A,A,A,A,B,B,B,B Store Store

Modify

Select
IFFT IFFT

Correction Store v=0 v=1 v=2 v=3

a' = ae jv
' nnm

a a +=

PTS CSS TI
Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 96 where it was found that large shifts produce better results. This comes at the cost of reducing the possible number of transmit signals that can be constructed.
3 4 5 6 7 8 9 10 11 12 10-4 10-3 10-2

10-1 100

0
(dB) Pr(>0) Uncoded CSS PTS V=4 S/W=8 V=4 S/W=4 V=2 S/W=8

Figure 5.3: Simulated CCDF for PTS (V=2, W=8; V=4, W=4; V=4, W=8), CSS (V=2, S=8; V=4, S=4; V=4, S=8) and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT, (os=1).

5.2.2 PTS with CSS Another variation of CSS combining the trivial phase rotations (W=4) in standard PTS with CSS shifts to create more alternative transmit sequences was first presented in [1]. The PTS signal can be expressed as (5.2)

_=
=
V v1 j (v) n ~ n ~ (v)a

a .e (5.2) Adding cyclic shifts to PTS gives (5.3) Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 97
(v)

j V v1 (v) n ~ n ~
(v)

a a .e

_=

= + (5.3) Where (v) is a cyclic shift in the time domain. The number of alternative transmit signals is now WS, where S is the number of cyclic shifting options. Cyclic shifting generates a linear phase shift at the output of the FFT in the receiver. This linear phase component must be removed prior to demodulation and requires one complex multiplication for each data symbol. This process can be made trivial if the linear phase shift is constrained to 2 g radians per frequency bin (g=0,1,2,or 3). This corresponds to a signal shift of 4

(v) = N.g samples (g=0,1,2, or 3). Therefore for each of the W different phase rotations, S=4 additional signals can be produced by cyclic shifting with trivial operations in the transmitter and receiver. Figure 5.4 shows CCDF plots comparing standard PTS to PTS/CSS. There is a negligible performance difference in the two methods. The advantage here is that for 8 alternative transmit sequences (PTS: V=2, W=8; PTS/CSS: V=2, W=4, S=2) 4 complex multiplications per sample are avoided in the transmitter. For 16 alternative transmit sequences (PTS: V=3, W=4; PTS/CSS: V=2, W=4, S=4) a whole IFFT can be avoided, reducing complexity significantly. The reduction in PAPR from the uncoded case is ~2.5dB and ~3.5dB for (PTS: V=2, W=8; PTS/CSS: V=2, W=4, S=2) and (PTS: V=3, W=4; PTS/CSS: V=2, W=4, S=4) respectively at Pr(_>_0)=10-4. Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 98
3 4 5 6 7 8 9 10 11 12 10-4 10-3 10-2 10-1 100

0
(dB) Pr(>0) Uncoded PTS/CSS PTS V=3 W=4 V=2 W=4, S=4 V=2 W=8 V=2 W=4, S=2

Figure 5.4: Simulated CCDF for PTS (V=2, W=8; V=3, W=4), PTS/CSS (V=2, W=4, S=2; V=2, W=4, S=4), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT, (os=1).

5.2.3 Time Inversion Another new proposed PAPR technique based on a variation of PTS first presented in [2] is Time Inversion (TI), a block diagram description of which is shown in Figure 5.2. TI involves reversing the output sequence of the IFFT sub block before addition and transmission of the sub blocks. Mathematically TI can be expressed as (5.4) ( . ) ()
()() ()

1 () ( 1) ( ) ~~v
ivv vj

V v v n i

an conj a e

= (5.4) where i takes the value 0 or 1 for a time inversion of the Vth sub block in the time domain. TI of a sub-block must be accompanied by conjugation of its elements to stop the information from jumping into image sub-channels after the demodulation process. The effect that this has on the demodulated sub-channels can be reversed by Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS
+

99 taking the conjugate of the output after the FFT process in the receiver circuit. No extra multiply operations are required in the transmitter or receiver. Simulation results for PTS with TI are shown in Figure 5.5 where it is seen that TI has slightly worse performance than PTS with an equivalent number of alternative transmit sequences. TI has ~0.5dB degradation compared to PTS at Pr(_>_0)=10-4. TI reduces the PAPR by ~2.2dB over the uncoded case while PTS has a reduction of ~2.7dB at Pr(_>_0)=10-4.
3 4 5 6 7 8 9 10 11 12 10-4 10-3 10-2 10-1 100

0
(dB) Pr(>0) Uncoded PTS/TI: V=2, W=4, S=2 PTS: V=2, W=8

Figure 5.5: Simulated CCDF for PTS (V=2, W=8), PTS/TI (V=2, W=4, S=2) and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in the IFFT, (os=1).

5.3 Filtering new techniques


As noted in [34, 39] the reductions made with PTS are drastically reduced when passed through a pulse shaping filter due to peak regrowth. Furthermore as shown in Figure 3.10 peak regrowth after some form of PAPR reduction is more severe than when no PAPR reduction is performed. Certain PAPR reduction schemes such as Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 100 PTS and clipping display greater peak regrowth after interpolation and filtering than coding and SLM (Figure 4.11). This section analyses the effect of interpolation and pulse shaping filtering on CSS and TI, comparing them to standard PTS through simulation. Simulations performed in this section use N=64 subcarriers with no oversampling in the IFFT (i.e. os=1) and adjacent partitioning. As shown in reference [39] the partitioning type has a minimal effect on the peak regrowth after filtering. A RCF is used with a rolloff factor of 0.15 and 128 filter taps with a normalized sampling rate. The data is interpolated by a factor of 8 before filtering. The simulation model is represented in Figure 5.6.
Figure 5.6: Block diagram showing the simulation model of Section 5.2.

In Figure 5.7 the discrete CCDF after the IFFT is plotted with the CCDF after filtering for PTS and CSS. Both PTS and CSS have 16 alternative transmit symbols. CSS has a peak regrowth of ~3.5dB from the discrete level and PTS has a peak regrowth of ~4dB at Pr(_>_0)=10-4. This is contrasted with only 1dB of peak regrowth in the uncoded case at Pr(_>_0)=10-4. So the PAPR reduction after filtering is only ~1.5dB and ~2dB for PTS and CSS respectively at Pr(_>_0)=10-4. It is also worth noting that CSS has a steeper rolloff than PTS leading to the increasingly better performance than PTS at low probability levels.

RCF
Measure

CCDF Measure CCDF


Data source M-ary mapping
PTS /CSS /TI
(Figure 5.2)

8
IFFT size = Nos Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 101
3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100 0 (dB) Pr(>0) discrete filtered CSS PTS Uncoded

Figure 5.7: Simulated CCDF of discrete and filtered PTS (V=4, W=4), CSS (V=4, S=4), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT (os=1), interpolated by 8, filtered with RCF (_=0.15).

Figure 5.8 maintains the same number of IFFTs (V=4) as Figure 5.7 while increasing the number of phase rotations/shifts from 4 to 8. The peak regrowth of CSS above the discrete level after filtering is 4.3dB while PTS peak regrowth is 4.1dB at Pr(_>_0)=10-4. The peak power reduction after filtering is only ~1.7dB and ~2.2dB for PTS and CSS. Comparing these results with Figure 5.7 it is seen that increasing the number of phase rotations/shifts to 8 provides only a further 0.2dB PAPR reduction for both PTS and CSS after filtering. Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 102
3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100 0 (dB) Pr(>0) discrete filtered Uncoded PTS CSS

Figure 5.8: Simulated CCDF of discrete and filtered PTS (V=4, W=8), CSS (V=4, S=8), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT (os=1), interpolated by 8, filtered with RCF (_=0.15).

Figure 5.9 compares combined PTS/CSS and PTS/TI with standard PTS, with 8

alternative transmit signals. Both PTS/CSS and PTS/TI have very similar performance after filtering with a peak regrowth of ~2.5dB and 2.1dB respectively from the discrete level, while standard PTS is ~0.4dB worse with a peak regrowth of ~3dB at Pr(_>_0)=10-4 . Note that again the slope at Pr(_>_0)=10-4 is slightly steeper for the new techniques after filtering implying further PAPR improvement over PTS at lower probability levels. The PAPR reduction after filtering is very poor in all cases, only ~1dB for the new techniques and 0.6dB for standard PTS at Pr(_>_0)=10-4. Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 103
3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100

0
(dB) Pr(>0) discrete filtered PTS/TI PTS/CSS PTS Uncoded

Figure 5.9: Simulated CCDF of discrete and filtered PTS (V=2, W=8), PTS/CSS (V=2, W=4, S=2), and PTS/TI (V=2, W=4, S=2), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in IFFT (os=1), interpolated by 8, filtered with RCF (_=0.15).

In Figure 5.10 the CCDF of PTS and PTS/CSS is compared, where 16 alternative transmit sequences produced. The peak regrowth is ~2.8dB and ~3.1dB above the discrete level for PTS and PTS/CSS respectively at Pr(_>_0)=10-4. Again the PAPR reduction after filtering is heavily penalized with only ~1.8dB and ~1.5dB gained in PAPR reduction at Pr(_>_0)=10-4 for PTS and PTS/CSS respectively. As stated earlier the advantage of PTS/CSS in this case is the removal of 1 IFFT operation per transmit symbol. Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 104
3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100

0
(dB) Pr(>0) discrete filtered PTS/CSS Uncoded PTS

Figure 5.10: Simulated CCDF of discrete and filtered PTS (V=3, W=4), PTS/CSS (V=2, W=4, S=4), and uncoded OFDM. N=64, adjacent partitioning, with no oversampling in the IFFT (os=1), interpolated by 8, filtered with RCF (_=0.15).

5.4 Oversampling new techniques


Oversampling [39] is required to improve the performance of PTS techniques. Oversampling the IFFT increases the convergence between the discrete PAPR and the filtered PAPR. Oversampling increases the PAPR in the discrete domain, while

reducing the PAPR after filtering. The simulation model here is the same as Figure 5.6 except that oversampling is performed at the IFFT before filtering, i.e. os=1, 2, 4, or 8. Again N=64 subcarriers, and adjacent partitioning is used. The data is interpolated by 8 before filtering with a RCF with 128 taps and a rolloff factor of 0.15. Figure 5.11 and 5.12 shows PTS/CSS (V=2, W=4, S=4) and PTS (V=3, W=4) under oversampled and filtered conditions respectively, with 16 alternative transmit signals. It is seen that oversampling by 2 brings the discrete and filtered CCDF curves to Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 105 within 1dB of each other at Pr(_>_0)=10-4. Furthermore oversampling by a factor of 8 brings them almost to convergence with PTS/CSS slightly closer (~0.1dB) than PTS (~0.3dB) at Pr(_>_0)=10-4. The PAPR reduction for PTS/CSS and PTS after filtering is 3.3dB and 3dB respectively at Pr(_>_0)=10-4. As oversampling the IFFT by a factor of 8 provides minimal improvement over 4, all further simulations will use oversampling rates of os=1, 2, and 4.
3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100 0 (dB) Pr(>0) discrete filtered Uncoded os=1 os=2 os=4 os=8 os=1 os=2 os=4 os=8

Figure 5.11: Simulated CCDF for PTS/CSS (V=2, W=4, S=4), and Uncoded OFDM. Discrete oversampled curves (dashed) move from left to right and oversampled (solid) filtered curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, 4, and 8, interpolated by 8, and filtered with RCF (_=0.15).

Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 106


3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100 discrete filtered

0
(dB) Pr(>0) Uncoded os=1 os=2 os=4 os=8 os=8 os=4 os=2 os=1

Figure 5.12: Simulated CCDF for PTS (V=3, W=4), and Uncoded OFDM. Discrete oversampled

(dashed) curves move from left to right and oversampled (solid) filtered curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, 4, and 8, interpolated by 8, and filtered with RCF (_=0.15).

Figures 5.13, 5.14, and 5.15 compare PTS (V=2, W=8), PTS/CSS (V=2, W=4, S=2), and PTS/TI (V=2, W=4, S=2) under oversampled and filtered conditions respectively, with 8 alternative transmit signals. Oversampling by a factor of 2 brings the discrete and filtered CCDF curves to within 0.8dB of each other at Pr(_>_0)=10-4. Oversampling by a factor of 4 brings them almost to convergence (0.2dB). PTS/CSS and PTS/TI have slightly better performance at any given oversampling rate. For example, with an oversampling factor of 2 PTS has a PAPR of 10.2dB at Pr(_>_0)=104, while PTS/CSS and PTS/TI has a PAPR of 9.9dB and 10dB respectively at Pr(_>_0)=10-4. The PAPR reduction for PTS, PTS/CSS, and PTS/TI after filtering (os=4) is 2.2dB, 2.5dB, and 2.3dB respectively at Pr(_>_0)=10-4. Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 107
3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100 discrete filtered 0 (dB) Pr(>0) Uncoded os=1 os=2 os=4 os=4 os=2 os=1 3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100 discrete filtered 0 (dB) Pr(>0) Uncoded os=1 os=2 os=4 os=1 os=2 os=4

Figure 5.13: Simulated CCDF for PTS (V=2, W=8), and Uncoded OFDM. Discrete oversampled curves (dashed) move from left to right and filtered (solid) oversampled curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, and 4, interpolated by 8, and filtered with RCF (_=0.15). Figure 5.14: Simulated CCDF for PTS/CSS (V=2, W=4, S=2), and Uncoded OFDM. Discrete oversampled curves (dashed) move from left to right and filtered (solid) oversampled curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, and 4, interpolated by 8, and filtered with RCF (_=0.15).

Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 108


3 4 5 6 7 8 9 10 11 12 13 10-4 10-3 10-2 10-1 100 discrete filtered

0
(dB) Pr(>0) Uncoded os=1 os=2 os=4 os=4 os=2 os=1

Figure 5.15: Simulated CCDF for PTS/TI (V=2, W=4, S=2), and Uncoded OFDM. Discrete oversampled curves (dashed) move from left to right and filtered (solid) oversampled curves move from right to left. N=64, adjacent partitioning, with oversampling rates in the IFFT of 1, 2, and 4, interpolated by 8, and filtered with RCF (_=0.15).

5.5 Complexity evaluation


The performance in terms of the CCDF and PSD of CSS, PTS/CSS, and PTS/TI has been established. Table 5.1 compares them in terms of hardware operations where it is seen that multiplications are avoided when appropriate values of V, W, and S are chosen. For example with a PTS system (N=64, V=3, W=4) 85 non complex multiplications, 170 squaring operations, and 178 compares are required. The equivalent PTS/CSS system (N=64, V=2, W=4, S=2) requires no multiplications in exchange for an increase in 512 squaring operations, and 528 comparisons as well as the removal of a whole IFFT operation. A PTS system (N=64, V=2, W=8) requires 256 multiplications, half of which Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 109
Table 5.1: Complexity comparison of various techniques (adjacent partitioning)

Type Comments Operations PTS


- V blocks - W rotations - (N/V)W multiplications (trivial if W<4) - (N/V)W(V-1) squaring - (N/V+1)W(V-1) comparisons - V*(N/V) IFFTS

PTS/CSS
- V blocks - W rotations - S shifts - (N/V)WS(V-1) squaring - (N/V+1)WS(V-1) comparisons - V*(N/V) IFFTS

PTS/TI
- V blocks - W rotations - T(=2) inversion - (N/V)TW(V-1) squaring - (N/V+1)TW(V-1) comparisons

- V*(N/V) IFFTS are complex, 256 squaring operations, and 264 comparisons. The equivalent PTS/CSS system (V=2, W=4, S=2) requires no multiplications, and has the same number of squaring and comparison operations as the PTS system.

5.6 Conclusion
This chapter introduced new techniques for the creation of PTS OFDM signals. In section 5.2 it was shown through simulation of the CCDF that CSS has better performance (up to 1dB) than PTS for the same number of alternative signals over various combinations of V and W. Combining PTS with CSS and TI was shown to provide similar performance to standard PTS but with the advantage of a reduction in complexity. PTS (V=3, W=4) was shown to have the same performance as PTS/CSS (V=2, W=4, S=4) but with the added advantage of the removal of one whole IFFT operation. The performance after filtering was examined in Section 5.3 where it was shown that the gains made with PTS techniques are dramatically affected when passed through a pulse shaping filter. In the simulations a RCF with 128 taps and a roll off factor of 0.15 was used. The resultant reduction in PAPR after filtering was between 1 and 2dB for various combinations of V, W, and S with the new techniques having up to Chapter 5: Cyclic Shifted Sequences and Time Inversion of PTS 110 0.7dB better PAPR for all combinations. The only exception is PTS/CSS (V=2, W=4, S=4) where the performance was slightly worse than PTS (V=3, W=4) for the same number of alternative signals. This small degradation is a trade off for removing 1 of the IFFTs. The effect of oversampling on PTS and the new techniques was also examined, where it was seen that an oversampling factor of 2 was sufficient to bring the discrete PAPR to within 1dB of the filtered PAPR. The effect of oversampling was to increase the PAPR of the discrete CCDF while reducing the CCDF of the filtered CCDF. Oversampling by a factor of 8 (implying IFFT sizes of N8)was necessary to bring both curves almost to convergence. However in a practical system an oversampling factor of 2 was shown to be sufficient bringing the discrete and filtered CCDF curves to within 1dB of each other. The avoidance of multiplications and in some cases IFFT operations, as well as a modest performance gain combine to make the new techniques viable alternatives to standard PTS OFDM. Two publications resulted from this chapter. Chapter 6: Peak to Average Power Solutions Distorted Techniques 111

Chapter 6 Peak to Average Power Solutions Distorted Techniques


This chapter details another approach to PAPR reduction, distortion introducing techniques. These methods do not attempt to create a transmit signal with a low crest factor, instead they take the output of the IFFT and then limit the amplitude of large

samples which invariably causes distortion. These methods include approaches such as pulse shaping (or windowing), and clipping at every stage from the output of the IFFT to limited backoffs in the amplifier. The advantages of clipping are a reduction in complexity and ease of implementation, the disadvantages are the inband distortion (increasing BER) and spectral splatter, affecting adjacent channels by increasing out of band distortion. Section 6.1 describes the most common form of clipping in the baseband and reviews papers which seek to quantify the effect of clipping on the BER and PSD under different OFDM system conditions. Section 6.1.1 looks at the relation between clipping and quantization. Section 6.2 looks at amplifier limiting where the unconstrained OFDM signal is allowed to saturate the amplifier introducing spectral regrowth. Section 6.3 analyses windowing and pulse shaping techniques. Chapter 6: Peak to Average Power Solutions Distorted Techniques 112

6.1 Clipping in the Baseband


Reference [46] is an early paper looking at the effect of clipping and filtering on OFDM. The authors note that clipping causes both inband distortion, deteriorating the BER and out of band noise which reduces the spectral efficiency. Filtering after clipping was noted to reduce the spectral splatter but at the cost of peak regrowth. The simulation model used N=128 subcarriers with a guard interval, Tg, of 32 samples. QPSK modulation was applied to the data samples before modulation with an IFFT. As direct clipping of the samples will cause all the noise to fall in band the OFDM symbol is oversampled by a factor of 8 before clipping. The complex baseband samples were modulated up to a carrier frequency 1/4 of the sampling frequency in order to reduce the complexity of the simulation model. The real valued bandpass samples, x, are then clipped at amplitude, A, according to (6.1) , , A yx A __ =___ if if if xA AxA xA < > (6.1) where the clipping level A is determined as CL = A , where is the rms level of the OFDM signal. Filtering is performed with an equiripple bandpass Finite Impulse

Response (FIR) filter with 103 taps, a stopband of 40dB, and 1dB ripple in the passband. PSD results for clipping (CL=0.8 to 1.6) with no filtering displayed both in band distortion and spectral splatter. For a CL=1.4 the out of band noise was only 16dB lower than the signal power. This demonstrated the need for filtering to suppress the sidelobes. When the filter was applied the sidelobes were suppressed to ~50dB below the signal power for the same CL. Results also indicated that the peak regrowth after filtering was significant. For example, for a CL=1.4 (3dB) the clipped and filtered signal had a CF of ~9dB at the 99.999 percentile, i.e. 6dB of peak regrowth. The unclipped signal at the same percentile had a CF of 13dB. Finally the BER was examined after clipping and filtering in an AWGN channel. For reasonable clipping levels (CL>1.4) it was shown Chapter 6: Peak to Average Power Solutions Distorted Techniques 113 that less than 1dB of degradation at the 10-2 BER level is encountered. However it should be noted as shown in Figure 3.17 that QPSK, unlike higher modulation types is inherently impervious to clipping. Reference [87] presented a theoretical analysis of both cartesian and envelope clipping for various oversampling rates. Cartesian clipping is where the I and Q values of the complex sample are clipped independently. As the magnitude is not required this method is much less complex to implement than envelope clipping, however it introduces more distortion than envelope clipping at a set clipping level. Considering z as the I or Q component of the complex OFDM signal the output of the Cartesian clipper is expressed as (6.2)

()()
0 00 0

y ygzyzz y __ = =__ _+
0 00 0

, , , zz zzz zz < <<+ > (6.2) Under the assumption of a Gaussian like nature of the samples (i.e. high number of subcarriers) the impact of non linear distortion was analytically derived , and included

the effects of AWGN as well as clipping noise. The analytical results were compared to a simulated system with N=128 subcarriers and QPSK modulation. Square, or Cartesian clipping was shown to have worse performance by around 1dB than envelope clipping (as expected). An oversampling factor of 2 was shown to be enough to have the same performance as an infinitely oversampled signal. A moderate deterioration was seen when the oversampling factor was set at 1 for both square and envelope clipping. Optimum clipping levels were shown to be clip 2.5 A = for square clippers and 2.0 clip A = for magnitude clippers. Another reference [88] to analyze the performance of Cartesian clipping developed analytical expressions for the PSD and SER for any M-ary constellation with various clipping backoffs. Clipping is performed according to the rule set in (6.2) Chapter 6: Peak to Average Power Solutions Distorted Techniques 114 Clipping is defined as
0

20log x ibo __=__ __ (6.3) The SER was found to be


10

SER = ps =1 (1 pe )2 (6.4) where 13 2 Pr 1e M pr MM __ __ = _ > _ __ __ (6.5) and 0 0


0

12 yx Q x

____=____ ____ , Q is the Gaussian error function, is the variance of the input signal, and M is the M-ary constellation mapping type. A simple simulation model was created where an IFFT was used to modulate the complex samples, which were then clipped using the Cartesian method. A FFT was used to demodulate the data. Many interesting aspects of clipping were revealed analytically and supported through simulation, namely The BER is not uniform across all the subcarriers, some subcarriers have slightly worse performance. Increasing the number of subcarriers (which increases the CF) distributes the clipping noise over more subcarriers improving the BER, especially in higher order M-ary constellation (M>64). The authors note the limitation of a Gaussian assumption of noise as identified in reference [89]. The Gaussian assumption only holds for hard clipping, at higher IBO backoffs, the noise tends to have an impulsive distribution. Chapter 6: Peak to Average Power Solutions Distorted Techniques 115 Reference [89] also analytically examined clipping in the baseband focusing on magnitude clipping and derived an expression for the SER and BER versus the clip level, as well as performance in both AWGN and Rayleigh fading channels. Bounds on the probability of error due to clipping are derived for both the transmitter and the receiver. The analysis treats clipping noise differently to the standard AWGN assumption which is sufficient if the clipping level is set low enough so that there are a number of clips per symbol. In practice, the clipping level is set higher so that a low probability of error is maintained making clipping a rare event. Under this condition the clipping noise is of an impulsive nature as identified in reference [88] leading to a different type of error mechanism. Clipping is performed as (6.6)

()
l yhxx l __ = =___ xl xl xl < (6.6) where x(t) is a continuous time baseband multicarrier signal, y(t) is the clipped output,

and l is the clipping level. Reference [89] identified from the distortion spectrum analysis that the probability of error varies across the subcarriers with the lower subcarriers dominating the errors, (assuming a constant constellation size on all subcarriers). The probability of symbol error for a discretely sampled signal is given as (6.7)

()
1 3 2 2

()
813 Pr( ) ( ). 81 NLl error Q l Q LL _ _ _ _ _ _ _ _ = _ _ _ _ __ _ _ __ _ _ (6.7) where N is the number of subcarriers, L is the constellation type (eg: L=2 is 4 QAM; L=8 is 64 QAM), and Q(.) is the Gaussian error function. An expression for the BER is given as (6.8) Chapter 6: Peak to Average Power Solutions Distorted Techniques 116

()
1 3 2 2 2

()
413 Pr ( ) ( ). log 8 1
b

NLl error Q l Q LLL _ _ _ _ _ _ _ _ = _ _ _ _ __ _ _ __ _ _ (6.8) The AWGN approach is also presented for comparison and is given as (6.9)
2

13 Pr( ) 4 1c L

error Q LL __=__ ____ (6.9) where

()()
2

2 exp 2 1 2c l llQl = + + (6.10) Reference [89] noted that the Gaussian model is pessimistic as it does not take into account that some of the noise power will fall out of band, and will therefore not contribute to the in band distortion. Comparing the two analytical methods in Figure 6.1 it is seen that even with the pessimistic assumption of (6.9) the error probability is much lower than (6.7). This is because instead of being spread uniformly over time, as is assumed in (6.9), clipping noise is actually concentrated in time as impulses, which leads to a greater error probability. The AWGN model is appropriate for hard clipping levels but it underestimates the error probability by several magnitudes at higher clipping levels. It should be noted that the y-axes in Figure 6.1 exaggerates the difference between the 2 equations. A probability of error of 10-6 is probably sufficient. The advantage of (6.7) is that the error probabilities for high clipping levels can be calculated analytically avoiding laborious simulations times which would be required to get accurate results at these levels. Chapter 6: Peak to Average Power Solutions Distorted Techniques 117
22

2 2.2 2.4 2.6 2.8 3 3.2 3.4 3.6 3.8 4 10-16 10-14 10-12 10-10 10-8 10-6 10-4 10-2 100 Clip level Probability of error Pe vs clip level using AWGN approach-Metsgadh AWGN model Impulsive noise model

Figure 6.1: Analytical symbol error probability from (6.7) and (6.9) for 64 QAM and N=64.

The affect of clipping in the presence of channel impairments is also analyzed both through analytical methods and simulation to observe the effect at the receiver. The simulation model uses the Hiperlan2 [52] specification with a guard interval length of 16 samples, perfect synchronization and a 1 tap equalizer. Analytical analysis

revealed that the error probability is further degraded in the presence of channel fades, together with clipping at the receiver. The BER performance was degraded by 1 to 1.5dB on the lower subcarriers. The analytical model is within an order of magnitude of the simulated results. Reference [90] presented another analytical derivation of the SER resulting from clipping in the baseband extending on the impulsive nature of noise at higher clip levels first presented in reference [89]. Reference [90] claims that the SER curves of reference [89] are too pessimistic and that their claim that approximations become tight in the higher OBO region is unsubstantiated. Chapter 6: Peak to Average Power Solutions Distorted Techniques 118 Reference [90] proposed a different technique to limit the upper bound by using the Chernoff bounding technique. The expression for the average SER due to clipping is given as (6.11)

()
N

( ()()())

000

4min SER e d N b J r p r dr


>

__ _++_ __

_
00

(6.11)

where ( : 12 J x ex d

) 2 cos

= _ is the modified Bessel function of the first kind. Unfortunately no closed form solution is provided, however the computational effort is much less compared to long simulation times required for smooth curves below 10-6 probability. Due to the Gaussian assumption of the non linearity the results are only within an order of magnitude of simulated results provided N>256 and the clip level, , is greater than 7dB. Further expressions for the SER in AWGN and Rayleigh fading channels using the previously described Chernoff method are provided. Oversampling at both the transmitter and receiver is also treated, resulting in the realization that the side lobes generated by the transmitting non linearity are suppressed at the receiver by the filter. Furthermore, reference [90] claimed that the average SER is the same for the nyquist and oversampling case, only the out of band radiation is reduced by oversampling. An interesting claim which is not supported by simulations of the BER in the next chapter. The asymptotic behavior is treated and it was found that for N , N the Gaussian model matches the Chernoff bound of this paper in that the

SER0 . Reference [91] looked at the out of band radiation produced by clipping and presented correcting functions which limit the signal while avoiding out of band radiation, and keeping the in band interference to a minimum. This is relevant when oversampling is performed at the IFFT as out of band radiation is created by the clipping process. Filtering after clipping reduces the out of band radiation but regrows previously clipped peaks. Chapter 6: Peak to Average Power Solutions Distorted Techniques 119 Two correction functions were suggested, the first is a Gaussian correcting function k(t) which is an additive correction of the OFDM signal. If the signal exceeds the amplitude threshold A0 at times tn, then the corrected signal is (6.12) c (t ) = s (t ) + k (t ) (6.12) where (
n

) ( ) nn ) g t = et 2
2 2

k t =_A g t t , (

, and

()0
n nn n

(())()

st AstA st = . The correcting function must be normalized so that g(0)=1, which limits the signal s(t) to A0 at the positions tn. However, the correction function may cause peaks in other positions, but this consequence is shown to have a minor effect. Other functions for g(t) are developed which cause no out of band interference and keep the in band interference to a minimum. A Gaussian function is defined as (6.13)

() ()
1 2 0 1 0

01
N j k ft k k N k k

gtGe gG

= =

= ==

_ _

(6.13) And a sinc correcting function is defined as (6.14)

()1
2 0

1N
j k ft k

gte N

=_ sin c ( Bt )e j Bt (6.14) The correcting function (6.14) can correct an amplitude peak in an OFDM signal with minimal in band distortion and no out of band radiation. Note that if the signal is not oversampled then the correction scheme is the same as normal clipping. Simulations with the correcting functions were performed with N=128 subcarriers and an oversampling rate of 4. The signal is corrected with k(t) and any peak regrowth after the correction is clipped at A0. An IBO level of 4dB was used and the algorithm was tested in both an AWGN and fading environment. It was shown that the Chapter 6: Peak to Average Power Solutions Distorted Techniques 120 correction functions introduce more inband noise than standard clipping, however the slight increase in inband noise is offset by the alleviation of out of band interference. For example, at an IBO of 4dB standard clipping has a Signal to Interference Ratio (SIR) of 21dB, Gaussian 13dB, and Sinc 16dB. The BER vs IBO in an AWGN channel with SNR of 18dB shows a magnitude greater degradation for the sinc correction function at 4dB IBO, but in the fading channel the BER degradation is negligible for the sinc case and only marginally worse for the Gaussian case. Clipping and filtering issues were addressed in reference [92] where an oversampled (LN- where L is the oversampling factor) IFFT zero padded in the middle is used to modulate the OFDM symbol. The resultant samples are then clipped by a SL in the normal way, as this results in out of band radiation as described section 3.4.2 the data is filtered by a FFT/IFFT pair of size LN. The filter passes the wanted in band samples while nulling out the out of band components. The advantage of this technique is twofold, firstly out of band radiation is greatly attenuated and secondly, by oversampling peak regrowth after filtering is greatly reduced (refer to Figure 3.11). Simulations were performed with 4QAM data and N=64 subcarriers. CCDF results for a CR of 6dB compared the new algorithm (L=2, N=64) with standard non oversampled clipping (L=1, N=64) where it is seen that the L=2 case has 1dB less

peak regrowth than the L=1 case at 10-5 probability region. It was noted that increasing the oversampling rate (L>2) at the IFFT provided minimal further improvement. Out of band radiation is also analyzed in the form of the PSD where a perfectly linear amplifier with a CR 1dB higher than the baseband clipping level is used after modulation with a carrier frequency. Out of band radiation was reduced down to 65dB using the new technique (L=2), compared to 55dB for clipping before interpolation (L=1) and 45dB with no clipping before amplification. In band distortion is also an affect of clipping, reference [92] stated that clipping adds a noise like component and a reduction in the constellation size (refer to Figure 3.16) which can be corrected at the receiver with AGC. Also as the noise from clipping is created at the transmitter it will lessen its effect in a fading channel. These properties will improve the BER. This technique can be implemented in existing OFDM systems and requires no redesign at the receiver only replacing the IFFT at the transmitter. Chapter 6: Peak to Average Power Solutions Distorted Techniques 121 Another technique to lessen the effects of clipping [93] called Decision Aided Reconstruction (DAR) reduces clipping noise with an algorithm in the baseband at the receiver. Like the method of [92] it uses a FFT/IFFT pair in an iterative algorithm at the receiver to try to estimate which samples were clipped at the transmitter. When the clipping noise is large compared to the AWGN in the channel, performance is limited by the clipping noise. Using the FFT/IFFT pair to make decisions in the frequency domain regrows samples that were clipped at the transmitter, and although they are still distorted decisions made on the new symbols are much less affected by clipping noise. However much of the gain from DAR may be achieved by simply correcting the constellation shrinkage which clipping causes (refer to Figure 3.16). The algorithm can predict false clipped peaks when the clipping level is set too low (<2dB) worsening performance more than standard clipping. Through simulation reference [92] found that DAR worked best when a small number of samples were clipped, and that it worked better with higher order constellations with a clip level >4dB. For example in a 64QAM OFDM system with a CL of 5dB in an AWGN channel the improvement was quite dramatic being only ~1dB lower than the theoretical lower bound. It was also noted that a slight further improvement is seen as N is increased. The number of iterations required for good performance was shown to be around 3. The problem with the methods of references [92, 93] is the latency and complexity of performing extra FFT/IFFT operations. 6.1.1 Quantisation and Clipping An early paper to investigate the relation between clipping and quantization in DMT transceivers is reference [94] where an analytical expression was developed to find the minimum number of bits required in the A/D, D/A converters when the signal is clipped to a predefined level. Due to the Rayleigh nature of the envelope of the DMT envelope 2 to 3 bits can be saved in the A/D, D/A operation without changing the SNR. An expression to calculate the number of bits that can be saved is (6.15) Chapter 6: Peak to Average Power Solutions Distorted Techniques 122

()
2

2 2 13 2

22

1 3 8 .2 . . .log 2 Re

__ __ =__ __ __ (6.15) where 1 2 = R R is the number of bits saved, R1 is the number of bits required for the A/D-D/A when no clipping is performed, R2 is the number of bits required for the A/D-D/A with clipping to keep the same SNR as when no clipping is

performed,. ( ) 1 2 1 .3. 21 L N L +__=___+_ is a parameter set by the number of subcarriers, N, the QAM constellation size L2, (L=4 equates to 16 QAM), and max A = is the clip level. Amax is the clip level in volts and is the rms voltage of the DMT transmit symbol. It was assumed in the paper that all subcarriers have the same constellation type, however the authors claim that different mapping types on subchannels would have a minimum effect. Clipping and quantization are further explored in reference [95] for DMT based transceivers where a improved clipping technique allows for up to an 8dB improvement in the SNR over standard clipping. The new method analyzed the samples after the IFFT, if a sample is above the clipping threshold, Aclip, then the phase of each QAM modulated carrier is changed by means of a fixed phasor rotation, and a new DMT symbol is generated by the IFFT. By careful selection of the phasor rotation, the probability of the new symbol requiring clipping at Aclip will be reduced. Otherwise the symbol is sent on to further processing unmolested. The overall probability of clipping for the 2 pass method described above is (6.16)
2

. Clip Total clip clip clip P = P P = P (6.16) which is determined to be


/ /1 / 2 2 2

1 2
/ N

P erf _ _ __ = _ _ __ _ _ (6.17) Chapter 6: Peak to Average Power Solutions Distorted Techniques 123 where = AClip / . Recalculating the IFFT can create a bottleneck slowing down system performance, but does not necessarily require a factor 2 increase of the IFFT as not every symbol will require clipping, especially at higher levels of Aclip. Side information is also required to inform the receiver of the number of passes, p, used (if any) of log2p bits per transmitted symbol. As seen in Figure 6.2, the clipping probability drops with an increase in the number of passes. At =4 the 2 pass and 3 pass methods reduce the probability of clipping down to ~10-4 and ~10-6 respectively.
Clip Total

2.5 3 3.5 4 4.5 5 10-14 10-12 10-10 10-8 10-6 10-4 10-2 100 Aclip/sigma Pclip/p p=1 p=2 p=3

Figure 6.2: Probability of clipping DMT signal as a function of for p=1,2,3. N=64 subcarriers.

Quantization effects are also examined in terms of noise from the DAC and ADC in the transmitter and receiver respectively for various wordlengths, b. For 16 QAM, Chapter 6: Peak to Average Power Solutions Distorted Techniques 124 N=256, b=12, 3.7 with p=1 and 3.4 with p=2 an improvement of 3 and 8dB can be achieved. Another reference [96] analyzed quantization effects on OFDM used a simulation model with Hiperlan2 [53] specifications. Clipping is performed on the I and Q outputs of the IFFT. As the wordlength at the IFFT output is decreased, the power consumption and complexity of the DAC/ADC decreases at the expense of quantization noise, which increases the BER. However as noted in reference [94] the wordlength can be reduced with minimal affect on the BER. Also clipping at the IFFT output increases the resolution giving a better average signal/quantization noise power ratio, of course this is at the expense of clipping noise. In a nutshell, lowering the clipping level increases the clipping noise while at the same time reducing the quantization noise. Results from reference [96] indicated that the optimum clipping level for wordlengths between 6 and 9 bits occurs at around 4_

(slightly lower for smaller wordlengths). An 8 bit wordlength is also recommended with an extra 2 bits for the receiver ADC to compensate for peak regrowth affects after transmit filtering (upsampled by 4 before filtering) and imperfect AGC in the receiver. Reference [97] examined the effect of rounding and saturation in fixed-point DSP implementation of the IFFT and FFT where optimum trade-offs are found between saturation and rounding. Results from simulation revealed that performance for fixed point FFTs is improved when overflow is allowed to occur with low probability. The distribution of error was shown to depend on the ratio of the maximum quantization level to the RMS power of the random variable. For 16-bit arithmetic the headroom was shown to be around 15dB. Doubling the size of the FFT resulted in an improvement in the new scaling method of 3dB.

6.2 Amplifier non linearities


As shown in Section 3.4 if no attempt is made to control the peak excursions of an OFDM symbol the HPA will saturate causing spectral regrowth and an increase in the Chapter 6: Peak to Average Power Solutions Distorted Techniques 125 BER at the receiver. This section reviews papers which analyze the effect of an uncontrolled OFDM symbol on the HPA. An early reference [42] to look at the effect of a non linearity on QPSK OFDM compared analytically derived and simulated results for the BER versus SNR. Specifically they assumed that the intermodulation products were generated by non linearities in the receiver IF module. These results are stated to be applicable to the transmitter amplifier, and that only 3rd order distortion will affect the system. The BER is calculated under the assumption that the intermodulation products cause an additive Gaussian interference and that the BER is approximately equal on all subcarriers. The non linearity is set at the 1dB compression point of the in band output and is related to the output signal, Vo, by V1dB = 0.27Vo . The simulated results for the amplifier which is linear up to 3dB, with 1dB backoff show that even with a 1dB backoff the BER is within 1 order of magnitude of the linear amplifier at SNR=10dB. The simulated results were shown to be in good agreement with theoretical results. Reference [98] focused on simulation comparing Single Carrier (SC) and OFDM systems with clipping in the baseband and RF amplifier non linearities. The simulation model used N=2K and 8K (DVB, DVA) subcarriers with QPSK and 16 QAM mapping to produce SER versus clipping and AWGN plots in a Monte Carlo simulation. It was shown that the SER versus SNR is almost the same for both systems with OFDM having slightly better performance due to the frequency guard interval used in OFDM which reduces the equivalent noise bandwidth. The number of subcarriers has a negligible affect on OFDM due to the Gaussian distribution of samples. The baseband clipping effect was simulated with the SNR set at 16dB for 16 QAM and the backoff was defined as
2

22 BO A = . Again the number of subcarriers was

shown to have a negligible effect on the BER. The performance of the SC and OFDM system converges at a baseband clipping of BO=6dB for OFDM when 16 Chapter 6: Peak to Average Power Solutions Distorted Techniques 126 QAM mapping was used, a backoff of 3dB was only required for QPSK due to the larger Euclidean distance of QPSK constellation points. For the RF amplifier the low pass equivalent was used, where the signal is of the form

( ) j (t ) A t e , the HPA output can then be expressed as (6.18) ( ) ( ) j( (t ) g A(t ) ) y t f A t e = _ _ + __ __ _ _ (6.18)

Where f [A] is the AM/AM characteristic of the amplifier and g [A] is the AM/PM characteristic of the HPA. The phase distortion was considered to be linear. Performance for 16 QAM OFDM under different amplifier backoffs is almost identical to the baseband clipping effects with a 6dB BO required to bring performance in line with its equivalent SC counterpart. It was also shown that the degradation increases as the third order interception point approaches the 1dB compression point. QPSK was again shown to be more robust to amplifier non linearity. Reference [40] both analytically derived and simulated an OFDM system in a AWGN channel with and without equalization (1 tap) to produce BER versus SNR plots. Amplifier non linearity (using TWT as described in Section 3.4.1) is stated to cause two effects on the detected samples: constellation warping (amplitude and phase distortion) non linear distortion which generates a Gaussian spread like cluster of received values around each constellation point (refer to Figure 3.16) For 16 QAM OFDM system it was shown that at high IBO (25dB) equalization had no effect on the BER. As the IBO is reduced (14dB) the equalized BER degraded only a little, however the non equalized OFDM model lost a further 7dB of SNR. At 12dB IBO the non equalized system lost so much SNR that its error floor became 103. For all cases a good agreement is shown between analytical and simulated results. Chapter 6: Peak to Average Power Solutions Distorted Techniques 127 Reference [47] provided an analytical analysis of smooth non linear distortion on the SER as a function of third order distortion (the most dominant distortion for smooth non linearities) in a memoryless non linear power amplifier. The analysis was performed on a matched filter pair. Initially the spectrum of the distorted OFDM signal is examined, which is then used to determine the detection error for the matched filter detection. It was found through analytical means that the variance is approximately equal on all subcarriers with the middle subcarriers experiencing the most noise, this means that the SER is similar on all subcarriers, a result supported by reference [88]. Simulated results are compared to analytical results for QPSK and 16 QAM OFDM. The simulation model used an equivalent low pass representation to avoid RF up and down conversion, N=1024 subcarriers were processed by an IFFT at which point the non linearity was performed. The data was demodulated with an FFT and the transmit symbols were compared to the received samples. The difference was

squared and stored and then averaged to find the noise variance. The simulation results for 16 QAM were found to better fit the analytical results than QPSK, the tails in both cases fall of quicker in the simulated case because they are not exactly Gaussian. However a good agreement is seen between the analytical and simulated curves. Reference [41] extended on earlier work presented in reference [40] to theoretically analyze the effect of non linear amplifiers in conjunction with phase noise on MQAM OFDM. Phase noise is caused by the oscillators in the RF stage and becomes a more dominant source of noise at higher carrier frequencies (up to 40GHz). The theoretical expressions were supported by simulations using SSPA and TWTA with different values of phase noise. Under the assumption of modeling the phase and amplifier distortions as additive Gaussian noise, the computed variances are used to get an estimate of the BER in an AWGN channel. Results indicated that while QPSK is rather impervious to both amplifier non linearity and phase noise, the performance of 16 and 64 QAM is greatly diminished in terms of the BER even with a large OBO in the amplifier. A SSPA with p=2 introduced 4th order distortions which further diminished the performance of 16 and 64 QAM. In Chapter 6: Peak to Average Power Solutions Distorted Techniques 128 general it was shown that the joint effects of amplifier non linearity and phase noise have two major effects. Amplifier non linearity generates a uniform amplitude attenuation and phase rotation which can be corrected at the receiver by AGC. Phase noise introduced a constant phase rotation within 1 OFDM symbol which can be estimated and corrected using the pilot tones. The second effect is constellation clustering due to the interference produced by the HPA to the ICI caused by the phase noise. Finally it was shown that the phase noise impairment was dependant on the relationship between the phase noise rate and the OFDM symbol period. It was shown that the phase noise can become a limiting factor if a large number of subcarriers and a high frequency carrier are used.

6.3 Windowing
Windowing or pulse shaping are similar to clipping in that they attenuate large peaks. However in windowing a corrective window is multiplied with the data so that not only the peak cancelled but surrounding samples are also affected. The advantage of this process is to keep the OBR lower than in standard clipping. Windows should be as narrowband as possible in the frequency spectrum domain, so as to have good OBR properties. However narrowband windows have the reciprocal affect of being long in the time domain which implies many signal samples being affected, which increases the BER. Reference [99] uses window types such as such as cosine, Kaiser, and Hamming, comparing their use to standard clipping in terms of the frequency spectrum and BER. The simulation model uses Hiperlan2 [53] specifications with a rate convolutional code and 16 QAM where it is shown that clipping the signal at 5dB has a minor affect on the BER with a 0.2dB loss in SNR. Windowing is shown to have almost identical affect on the BER above 5dB clipping but interestingly has worse performance at harder clipping levels.

Chapter 6: Peak to Average Power Solutions Distorted Techniques 129 To simulate the affect of the required backoff in the HPA a SSPA with p=3 as described in Section 3.4.1 is used. In order to keep the OBR to below 30dB for 64 subcarriers a backoff of 6.3dB was required, which could be reduced by 1 dB to 5.3dB when peak windowing is used. When 256 subcarriers are used the backoff of 6.3dB can be reduced by 0.8dB to 5.5dB with peak windowing, showing that windowing is independent of N. A later reference [100] uses broadband pulse shaping on individual subcarriers as a way to reduce the PAPR. By making the cross correlation between samples in the same block close an OFDM signal with a low PAPR can be created. The new OFDM signal is given as

( ) ( ) ( ) j2 m T
nm

Where (

) n X m is the modulated data symbol of subcarrier m, T is the duration of the OFDM block, and the waveform ( ) m p t is a pulse shape of duration T, on subcarrier
m which has a bandwidth less than or equal to the bandwidth of the OFDM signal x(t). Unique RRC waveforms are multiplied with each sample, which are cyclic shifts of each other within the same time interval 0<=t<T. As each RRC pulse is seeded from the same source they are easy to create, the RRC pulse for each subcarrier is defined as

x t =_X m p t e nT t (n +1)T (6.19)

( ) ( ) 122
N L mk k m j j t NN m kL

ptCkee
+ =

= _ 0 t < T (6.20) where

()()2
0

11k
Tjt

m C k p t e dt P TTT _ _ = = _ _
T

_ _ _ (6.21) Chapter 6: Peak to Average Power Solutions Distorted Techniques 130 is the Fourier series of p(t) and

( ) ( 2),

0, rc p t T pt __ =___ 0tT elsewhere < (6.22)

where ( / 2) rc p t T are the samples of the time domain RRC pulse. When the rolloff factor is increased to 0.5 the reduction in the CCDF at Pr(_>_0)=10-3 is around 6.5dB. However the effect of oversampling and the frequency spectrum is not treated. I anticipate that the spectrum is at least 50% larger than normal OFDM because the spectrum of the broadband pulse would have to be convolved with the basic OFDM linear spectrum. Note that reference [100] is actually mathematically equivalent to clipping and filtering with a linear time invariant filter.

6.4 Conclusion
This chapter presented distorted techniques for the reduction of PAPR in OFDM. Clipping in the baseband was first introduced as this is the simplest and most widely examined area in distorted PAPR reduction techniques. Clipping was shown both analytically and through simulated means to have a minor effect on the BER when QPSK modulation was used due to the large Euclidean distance between constellation points. Higher order mapping types such as 16 and 64 QAM were much more susceptible to clipping. The BER on individual subcarriers was also treated where it was shown that the probability of error was almost equal on all subcarriers. Increasing the number of subcarriers was shown to have a beneficial affect as the noise introduced by clipping would be spread over more subcarriers. The Gaussian like assumption of the noise which is assumed in most analysis of clipping noise was also shown to be unsubstantiated resulting in optimistic error probabilities. Clipping noise was shown to have an impulsive nature at higher clip levels resulting in a much smoother decay in error probability as the clip level was increased. Chapter 6: Peak to Average Power Solutions Distorted Techniques 131 Clipping at the nyquist rate (no oversampling) does not cause out of band radiation as all the noise was shown to fall in band. Oversampling before clipping was shown [46] to produce less in band noise but to increase the out of band noise, requiring filtering. Quantization in hardware and clipping was also treated where it was shown that some bits could be saved after clipping, due to the Rayleigh distribution of samples without degrading system performance. This also improves the resolution of the clipped samples. Furthermore it was shown that several steps could be taken to mitigate the errors caused by clipping. As clipping noise caused shrinking in the constellation size as

well as a Gaussian like spreading the AGC in the receiver could be used to correct this depending on the clip level. Also, the noise due to clipping will also experience fading along with the signal further lessening its effect. Early windowing and pulse shaping techniques displayed little improvement and sometimes a further degradation in the BER while attempting to reduce the OBR. Later work in this area produced a markedly greater improvement in the PAPR by the selection of appropriate pulse shapes which were applied to individual subcarriers rather than the whole transmit signal. Chapter 7: Reduced Complexity Clipping Algorithms 132

Chapter 7 Reduced Complexity Clipping Algorithms


The previous chapter covered distorted techniques for PAPR reduction. As described clipping suffers after filtering as clipped peaks can regrow resulting in saturation of the HPA. Also, although clipping is less complex in terms of hardware operations than distortionless techniques, estimates of the magnitude still need to be made in order to decide whether a sample needs to be clipped or not and multiplications have to be made to correct the signal. This chapter presents new low complexity clipping techniques which avoid complex hardware operations while maintaining similar performance to conventional clipping. The new clipping algorithms are then implemented in a new clip and filter algorithm which is much less susceptible to peak regrowth after baseband filtering. Section 7.1 describes conventional clipping and a simulation model is developed to quantify the effects that various transceiver components have on the BER1. This model is then used to test the new clipping algorithms. Section 7.2 details a new technique coined Sector Clipping and provides theoretical and simulated analysis of the new method. Section 7.3 presents another new technique which is similar to the CORDIC algorithm but with reduced complexity called Vector Subtraction. Section
1 In

this work the BER will be plotted against clipping level with the noise set to zero. All the errors are therefore caused by clipping noise. The BERs in these plots therefore represent the error floor of the more commonly used BER vs. SNR plots.

Chapter 7: Reduced Complexity Clipping Algorithms 133 7.4 compares the new and existing clipping methods in terms of their baseband Clipping Level (CL) vs. the BER. Section 7.5 implements the new algorithms in a new clip and filter algorithm which is less susceptible to peak regrowth. Finally section 7.6 concludes the chapter with a review of the advantages of the proposed clipping techniques. Note that the results in Sections 7.1 and 7.4.1 show the Bit Error Rate Floor (BERF), i.e. the BER due to clipping as that is the focus of this chapter. For comparison Figure 7.1 shows the BER with Additive White Gaussian Noise (AWGN) in the channel and the CL set at 3, 5dB and no clipping. 4, 16, and 64 M-ary QAM symbols are modulated with a 64 point IFFT and then pulse shape filtered with a Root Raised

Cosine Filter (RRCF) with _=0.35, and 128 taps. It can be observed that 16 and 64 QAM mapping is much more susceptible to clipping noise than 4 QAM. The error floors for 64 QAM can be cross referenced with figure 7.4. The OFDM transceiver system is shown in Figure 7.3.
0 5 10 15 20 25 30 10-4 10-3 10-2 10-1 100 AWGN (dB) BER No clipping CL=5dB CL=3dB

QPSK 16 QAM 64 QAM

Figure 7.1: Average noise in the channel vs. BER for 4, 16, and 64 QAM.

Chapter 7: Reduced Complexity Clipping Algorithms 134

7.1 Conventional clipping


Conventional clipping is defined here as any hardware method which reduces the amplitude of the signal to a predefined level in line with the origin as shown in Figure 7.2.
Figure 7.2: IQ diagram showing conventional clipping region. The vector r is reduced to rclip.

Clipping in this way only introduces amplitude distortion, the phase is unaffected. This method is the most hardware intensive of all methods described in the following sections. Some of the operations require iterative techniques when implemented in fixed point processes (e.g. Division and square roots) and therefore take a number of clock cycles. Other methods require vast LUTs which consume chip area or memory space. Mathematically conventional clipping can be described as '' ( ) r
clip

r x re = clip
clip

rr rr > (7.1) In order to analyze the performance of conventional clipping it is useful to quantify the effects of the different system components involved, i.e. the mapping type, IFFT size, number of filter taps, filter rolloff factor, and IBO of the HPA. A block diagram of the simulation model used is shown in Figure 7.3. rclip r

i
q

x Chapter 7: Reduced Complexity Clipping Algorithms 135


Figure 7.3 Block diagram of simulation model used for clipping models.

The baseband Monte Carlo simulation model of Figure 7.3 randomly generates M-ary mapped data and then modulates the signal with the IFFT. Note the data is buffered at the input to the IFFT so that N samples are fed into the IFFT, after modulation the data is converted back into serial form and sent to the baseband clipping block. After clipping to a predefined level relative to the mean power of the transmit symbol a cyclic prefix can be added to the data. The data is then interpolated by a factor of 8 before being filtered by a matched Root Raised Cosine Filter (RRCF). A SSPA (as described in Section 3.4.1) models the HPA. AWGN and multipath delayed versions of the signal can then be added in the channel. At the receiver side the process is reversed, a matched RRCF filters the received samples which are then decimated by 8 to retrieve the transmitted samples. The cyclic prefix (if used) is removed prior to demodulation with the FFT. Again note that the data is buffered at the input to the FFT until all N samples are ready. Finally, a simple Least Square (LS) algorithm is used to make decisions on the decoded data. A comparison between the gray encoded transmitted bits and the received bits is made to determine the BER. In the results that follow the conventional baseband clipping algorithm is used as described in Figure 7.2. Note that 64 QAM mapping is used unless otherwise stated
Data gen. M-ary mapping IFFT Clip block CP Interp by 8. RRC Filter AWGN Matched RRC Filter Deci. by 8 FFT Remove CP Data De-map decision

BER calc SSPA Chapter 7: Reduced Complexity Clipping Algorithms 136 since the more traditional QPSK has such a high tolerance to clipping excessive simulation times are required to get BER plots. In the following simulations 10,000

OFDM symbols are transmitted, each symbol has 52 information bearing subcarriers. For 64 QAM this means around 3 million bits are transmitted. Effect of filter on the BER Figure 7.4 shows the baseband clip level vs. the BER for a RRCF with different numbers of coefficient taps and roll off factors (named alpha in figure 7.4). There are no other sources of distortion or noise. The number of taps is set at 64, 128 and 256, and the roll off factor or excess bandwidth is set at 0.15 and 0.35.
012345678 10
-7

10
-6

10
-5

10
-4

10
-3

10-2 10-1 100 Clip Level (dB) BER alpha=0.35 alpha=0.15 64 filter taps 128 filter taps 256 filter taps

Figure 7.4: Baseband clip level vs. the BERF for varying RRCF parameters. 64 QAM symbols, 64 point IFFT, LPA, no channel impairments. AWGN=0.

Two points can be made reviewing Figure 7.4, the first is that when the easier roll off or excess bandwidth of _=0.35 is used the number of taps has little effect on the BER. The second point is that when a tighter roll off factor is used (_=0.15) the BER is substantially affected by the number of taps. 64 filter taps has a BER almost 2 magnitudes worse than 128 taps at CL=7dB. This is due to the inband amplitude Chapter 7: Reduced Complexity Clipping Algorithms 137 distortion (ripple) created when insufficient taps are used with a small excess bandwidth (refer to Figures 7.5a and b). This creates a linear spreading of the demodulated samples away from the origin. In all cases _=0.35 has worse performance than _=0.15 for the same number of filter taps. The 0.35 roll off factor curves will have a wide transition bandwidth in the frequency domain. The extreme subcarriers will then be effected by additional attenuation which will reduce the noise margin in the receiver decision (slicer). The 0.15 roll off curves have a steeper transition band and avoid the problem of the extreme subcarriers. However pass-band ripple will be introduced when the number of taps is low (taps=64). The Hiperlan2/802.11a physical layer specification requires a tight roll off, (__0.15) so that the filter skirts occur in the null subcarriers. In all further simulations 128 filter taps with a roll off of _=0.15 will be used unless stated otherwise. Effect of HPA amplifier backoff and IFFT on the BERF Figure 7.6 and Figure 7.7 again shows the clip level vs. BERF with 64 QAM mapping. This time the IFFT size is set to 64 and 128 while maintaining the same
Figure 7.5b: Demapped constellation, M=64, with no clipping or channel impairments. 128 filter taps in RRCF, _=0.15. Figure 7.5a: Demapped constellation, M=64, with no clipping or channel impairments. 64 filter taps

in RRCF, _=0.15.

Chapter 7: Reduced Complexity Clipping Algorithms 138 number of information bearing subcarriers, i.e. the data is therefore critically sampled (almost) and oversampled by a factor of 2 respectively. The other variation is the inclusion of a HPA with IBO. The HPA follows the Rapp model of (3.34) with p=3. This allows us to see the effect of peak regrowth on the BERF with and without oversampling. In Figure 7.6 (os=1) even with an IBO of 4dB above the baseband clipping level (IBO=CL+4dB) peak regrowth still causes a small amount of saturation in the HPA, degrading the BERF. With no extra IBO in the HPA (IBO=CL) the BERF is between 10-2 and 10-3 at CL=6dB while with a LPA the BERF is below 10-4 at 6dB baseband clipping, a 1.5 order of magnitude improvement. In Figure 7.7 (os=2) the performance is better for all IBO across the board. Some of the clipping noise falls into the null bins and is subsequently filtered away while os=1 systems would cause this noise to fold back into the inband subcarriers. For no extra IBO in the HPA the 64 point IFFT has a BERF between 10-2 and 10-3 at 6dB baseband clipping (Figure 7.6) while the 128 point IFFT has a BERF just above 10-3 at the same clip level, an improvement of half a magnitude. Note in Figure 7.7 that for the curve IBO=CL+4 to the LPA curve (IBO=CL+_) there are no errors occurring in over 3 million transmitted bits at 7dB clipping level. In order to further highlight the effect of oversampling, the CCDF in Figure 7.8 is also shown for the case described in Figures 7.6 and 7.7, showing the peak regrowth under critically and oversampled conditions. In this case the baseband clip level is set at 5dB, peak regrowth for the critically sampled case is extreme with almost 5dB peak regrowth at Pr(_>_0)=10-4. Peak regrowth is 3dB (2dB less) for the oversampled case (os=2) at Pr(_>_0)=10-4. Results not shown here indicate that the harder the clip level the more extreme the peak regrowth after filtering. Chapter 7: Reduced Complexity Clipping Algorithms 139
012345678 10-6 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER IBO=CL IBO=CL+2 IBO=CL+4 LPA

Figure 7.6: Baseband clip level vs. the BERF with varying IBO in HPA. 64 QAM symbols, 64 point IFFT (os=1), RRCF with 128 taps and _=0.15. p=3 in HPA. AWGN=0.
012345678 10-6 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER IBO=CL IBO=CL+2 IBO=CL+4 LPA

Figure 7.7: Baseband clip level vs. the BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. p=3 in HPA. AWGN=0.

Chapter 7: Reduced Complexity Clipping Algorithms 140


3 4 5 6 7 8 9 10 10-4 10-3 10-2 10-1 100 Pr(>0) 64 point IFFT 128 point IFFT After Filtering After HPA After Baseband Clipping 0 (dB)

Figure 7.8: Simulated CCDF clipped in baseband at 5dB, IBO in HPA set to 8dB for 64 and 128 IFFT.

Effect of constellation size on the BERF. Figure 7.9 shows the effect of changing the mapping constellation M with an oversampling factor of 2 in conjunction with a LPA. Although 4 QAM was simulated, it is extremely impervious to clipping with no detected errors at 0dB and above. 16 QAM also has rather robust performance in the presence of clipping with no errors being detected above 4dB clipping at 10-5 probability. 64 QAM clipping has performance around 2 magnitudes worse at equivalent clipping levels to 16 QAM. Chapter 7: Reduced Complexity Clipping Algorithms 141
01234567 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER 16 QAM 64 QAM

Figure 7.9: Baseband clip level vs. the BERF with varying M-ary constellations. 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. AWGN=0.

Effect of changing P in HPA The model used to simulate the HPA is the SSPA described in Section 3.4.1. Varying the value of p in the SSPA controls the input to output curve of the amplifier as shown in Figure 3.14. Figure 7.10 demonstrates the effect of changing p on the BERF, the saturation level of the SSPA is set equal to the baseband clipping level. Here it is seen that p=1 has an extreme effect on the BERF. P=3 causes a magnitude of degradation over the absolutely linear region of p=1000. As p=3 is a practical value used in many designs [40] it will be used in all further simulations. Chapter 7: Reduced Complexity Clipping Algorithms 142
012345678 10-6 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB)

BER p=1 p=3 p=5 p=1000

Figure 7.10: Baseband clip level vs the BERF with varying p in the SSPA. 128 point IFFT (os=2), RRCF with 128 taps and _=0.15. HPA backoff set equal to baseband clipping level. AWGN=0.

This section provided a description and simulations of an OFDM system with baseband clipping. The effects of oversampling the IFFT, filter parameters, HPA parameters, and mapping type were simulated to see their effect on the baseband CL vs. the BERF. It can be concluded that a good set of parameters for further analysis of an OFDM system are: Oversampling factor of 2 in the IFFT. RRCF with 128 filter taps and a roll off factor of 0.15. A SSPA with p=3. Chapter 7: Reduced Complexity Clipping Algorithms 143

7.2 New Sector Clipping method


A new technique for clipping developed by the author which reduces hardware complexity is Sector clipping [3] Sector clipping avoids magnitude estimates which require hardware multiplications to perform corrective scaling. The decision to clip is based on the I and Q values in conjunction with comparisons between them, and so divides the clipping region into different sectors. Figure 7.11 shows the I Q plane of Sector clipping with the clipping regions clearly identified. The new method requires only comparators and can be implemented in hardware as either an iterative (to reduce complexity) or parallel structure (to increase speed). The number of sectors can vary from 2 (square clipping) to 5 or more, although no discernable improvement in performance is seen above this number. Note that increasing the number of decisions beyond 5 Sector will clipping increase the complexity as much as multiplications. As seen in Figure 7.11, Sector clipping not only introduces extra amplitude distortion over conventional clipping, but also phase distortion as data outside the clipping regions is not reduced in line with the origin. The symmetry of Sector clipping can be exploited to further reduce hardware complexity, the data can be folded into the first octant by removing the sign bits (making it positive) and making the largest value of the complex signal the real component, the clipping operation is then performed on the new value. Sign bits and the relative size of the real and imaginary components can then be used to extrapolate the original position of the clipped sample. As there are 3 unique sectors in Figure 7.11 this structure is known as 3 Sector clipping. 7.2.1 Theoretical Analysis of Clipping Techniques In order to compare the performance of the various clipping techniques the Clip Level vs. SNR is mathematically derived for 3 cases: Conventional clipping, Sector clipping with 3 sectors, and Square clipping. Square clipping can be construed as a DAC with limited word length. Note that for the theoretical analysis, no oversampling, filtering, or amplifier is assumed. Chapter 7: Reduced Complexity Clipping Algorithms 144
Figure 7.11: I Q diagram showing different sector clipping regions and the direction of data reduction for 3 Sector Clipping.

7.2.1.1 SNR Analysis The relation between the input and output of the clipping operation is expressed pictorially in Figure 7.11 and can be used to find an expression for the SNR.
Figure 7.12: Input output relationship of clipping operation.

Mathematically, this relation between the input, x, and the output, y can be expressed as Bussgangs theorem Noise x Amp y

n
rclip i q Chapter 7: Reduced Complexity Clipping Algorithms 145 y =x + n (7.2) where x in the input signal, y is the output signal and is chosen so that the input signal and the noise, n, is uncorrelated. Finding the 2nd moment of y (7.2) gives

[ ] [( )( )]
yy xx nn

R R E xn R E yy E x n x n =++ =++ 2 2 [ ] (7.3) where Rxx is the autocorrelation of the input signal giving the input power, Ryy is the autocorrelation of the output signal giving the output power. As x is assumed to be uncorrelated to the noise the term 2 E[xn] can be removed and (7.3) can be rearranged and solved for the noise power, Rnn. nn yy xx R = R 2R (7.4) Rearranging (7.2) n = y x (7.5) Taking the expected moments of (7.5) and then the correlation functions yields

[ ] [( )( )]
nn yy xy xx

RRRR E nn E y x y x 2 2 =+ = (7.6)

Equating (7.4) and (7.6) and solving for :


xx xy xx xy yy xx yy xy xx

R R RR RRRRR = = =+ 022 2
2 22

(7.7) Chapter 7: Reduced Complexity Clipping Algorithms 146 The signal, S is equal to: S Rxx = 2 (7.8) and the noise is given in (7.4), the SNR is then (7.9).
yy xx xx

RR R N S
2 2

= (7.9) To reduce mathematical complexity substitutions are made to factor out Rxx. The new SNR is given by (7.10).
2 2

=
xx xy

R NR S

(7.10) The expressions for the correlation functions need to be defined for each of the clipping techniques, which are solved to find a closed form solution. 7.2.1.2 Conventional clipping The definitions for the autocorrelation of the input, autocorrelation of the output and cross correlation for conventional clipping are respectively given below.

_ ()
0 2 (0)

R r . f r r xx (7.11)

_ ()_ ()
clip clip

=+

r clip r yy 0 2 (0)

R r . f r r r 2. f r r

(7.12)

_ ()_ ()
clip clip

=+

r clip r xy 0 2 (0)

R r . f r r r.r . f r r

(7.13) Chapter 7: Reduced Complexity Clipping Algorithms 147 where r is the input data (i.e. the magnitude of the data at the output of the IFFT), rclip is the clipping level, and f(r) is the probability distribution of the data, r, which is assumed to be Rayleigh distributed.

()
2

2 2

e r fr

= (7.14) Note that for the Rayleigh assumption of the distribution of r to hold the number of subcarriers is assumed to be greater or equal to 64 [39]. In order to reduce the mathematical complexity of the correlation functions they are normalized by the average input voltage, 2 .

2 r R = (7.15a) 2
clip clip

r R = (7.15b) 2 r R = (7.15c) Substituting (7.15a, b, c) into (7.11), (7.12), and (7.13) yields the normalized expressions for Rxx, Rxy, and Ryy given in (7.16), (7.17), and (7.18).

_ ()
0 23 (0)

R 4 R . f R R xx (7.16)

_ ()_ ()
clip clip

=+

r clip r yy 0 23 (0)

R 4 R . f R R 4 2 R2 .R. f R R

(7.17)

_ ()_ ()
clip clip

=+

R clip R xy 0 23 (0)

R 4 R . f R R 4 2 R2 .R . f R R

(7.18) Chapter 7: Reduced Complexity Clipping Algorithms 148 where f(R) is the normalized Rayleigh probability distribution of the data, R. f (R) = eR (7.19) The normalised clip level is also required (7.20)
2

dB (R) r dB
clip clip 10 10

20log 2 20log

= __ _ __ _= 10 20
dBclip

R = (7.20) The evaluated correlation expressions of (7.16), (7.17), and (7.18) are shown in (7.21), (7.22), and (7.23). These are substituted into (7.10) to calculate the SNR at various clipping levels.
()
2 0

() ()
02 0

2 xx R = (7.21)

yy Rclip xx

R e R = + (7.22)
() ()

()
0

02

1. 22
xy Rclip clip clip clip xx

R e R R erf R R = = + + (7.23) The SNR vs. clipping level for conventional clipping reduces to (7.24)

()

()
()

()

()
2 22

2 2 0 0

1. 22 11. 22
clip clip clip

R clip clip clip conv R xy R clip clip clip xx

e R R erf R SNR R e e R R erf R R


__ _++_ __= __ +_ ==++_ __ __


(7.24)

A plot of the theoretical SNR vs. conventional clip level is shown in Figure 7.16. Chapter 7: Reduced Complexity Clipping Algorithms 149 7.2.1.3 Sector Clipping In sector clipping the reduction in hardware complexity comes at a cost of introducing extra amplitude distortion and phase distortion as the components of the complex signal are not attenuated by the same scaling factor. This leads to more complex equations for Rxx, Rxy, and Ryy as the data must be represented in terms of its Cartesian co-ordinates. A way to realize the transition from polar to Cartesian is to recognize that the joint probability of two independent, zero mean, quadrature shifted, Gaussian-distributed variables, x and y, with the same create a Rayleigh distribution (7.25).
22

2 22 2

11

(,) 22
xy

f x y e e

= (7.25)
2 22

22 1 (,)
xy

fxye

= 3 sector clipping levels are defined relative to rclip (7.26a, b, c) of the conventional clipping method. The clipping regions for 3 sector clipping are shown in Figure 7.13 showing the vectors of an unclipped signal and the resultant clipped sample. 0 sin clip l = r (7.26a) 12 clip r l = (7.26b) 2 cos clip l = r (7.26c) Chapter 7: Reduced Complexity Clipping Algorithms 150
Figure 7.13: I Q diagram of 1st quadrant of a 3 Sector clipping system showing the vector of an unclipped and clipped sample.

The new sector clipping correlation equations for Rxx, Ryy, and Rxy are listed (7.27, 7.28, 7.29) respectively below. Due to symmetry, only the first quadrant is used

( )()
22 (0) 0 0

. , xx R x y f x y x y =_ _ + (7.27) The autocorrelation of the output equations are set out in (7.28) where the symmetry of the clipping regions is exploited. The limits of the integrals of each part (7.28) dictate what will happen at the output. Hence the first 2 parts of (7.28) do nothing to the data as they are inside the clipping region. The last 3 parts of (7.28) (1, 2, and 3 in Figure 7.13)) perform the attenuation as evidenced by the limits. I Q l0 l1 l2 l0 l1 l2 12 3 1 2 Clipped vector Original vector

Chapter 7: Reduced Complexity Clipping Algorithms 151

()() ()()

( ) () ( ) () ( ) ()
11 20 1 0 2 1 10 11

22 (0) 00 22 0 22 2 0 22 1 22 11

4., 8., 8., 8., 4.,


ll yy ll l l l l ll ll

Rxyfxyxy xyfxyxy lyfxyxy lyfxyxy llfxyxy


=+ ++ ++ ++ ++

__

_ _ _ _

_ _ _ _

(7.28) The cross correlation (7.29) follows the same form as (7.28) where the first 2 parts are inside the clipping regions. The last 3 parts of (7.29) show the translation of the input sample to the output. Again the symmetry of Figure 7.13 is exploited.

()() ()() ()() ()()


()()
11 20 1 0 2 1 10 11

22 (0) 00 22 0 2 2 0 2 1 11

4., 8., 8., 8., 4.,


ll xy ll l l l l ll ll

Rxyfxyxy xyfxyxy xl y f x y x y xl y f x y x y xl yl f x y x y

=+

++ ++ ++ ++

_ _ _ _ _

_ _ _ _ _

(7.29) As in the normalized conventional clipping method, substitutions are made to remove and reduce the complexity of the correlation functions. 2 x X = (7.30a) 2 y Y = (7.30b) 2 x X = (7.30c) 2 y Y = (7.30d) Chapter 7: Reduced Complexity Clipping Algorithms 152 2
0 0

l L = (7.30e) 2
1 1

l L = (7.30f) 2
2 2

L = (7.30g) Substituting (7.30) into (7.27), (7.28), and (7.29) yields the simplified expressions for the 3 sector clipping correlation functions. The normalized equation for Rxx(0) is
()2
0

( )()
22

2 . , xx R X Y f X Y X Y

= _ _ + (7.31) The normalized equation for _ is


() ()

()() ()() ()() ()()


()()
11 20 1 0 2 1 10 11

022 00 0 22 0 2 02 2 1 11

4., 8., 8., 8., 4.,


xy L L xx LL L L L L LL LL

R XYfXYXY R XYfXYXY XL Y f X Y X Y XL Y f X Y X Y XL YL f X Y X Y

==+ ++ ++ ++ ++

_ _ _ _ _
() ()

_ _ _ _ _

(7.32) The normalized equation for Ryy(0) is

()() ()() ()() ()()


()
11 20 1 0 2 1 10 11

022 00 0 22 0 22 02 22 1 2 1

4., 8., 8., 8., 8.,


yy L L xx LL L L

L L LL LL

R XYfXYXY R XYfXYXY LYfXYXY LYfXYXY LfXYXY


=+ ++ ++ ++ +

_ _ _ _ _

_ _ _ _ _

(7.33) Where f(X,Y) is given by (7.34)

()
22

, e XY fXY

= (7.34) Chapter 7: Reduced Complexity Clipping Algorithms 153 The normalized clipping levels are dB (R) r dB
clip clip 10 10

20log 2 20log

= __ _ __ _= 10 20
dBclip

R = (7.35) The evaluated correlation expressions of (7.31), (7.32), and (7.33) are shown in (7.36), (7.38), and (7.40). These are substituted into (7.10) to calculate the SNR at various clipping levels.
()
2

2 xx R = (7.36) _ was calculated to be (7.37)


0

() ()

()() ( ( ) ( )) ( ) ()()() ( ( )) ( ) ( )
2 1 2 0 22 20 22 01

011 1 0 00 21 200 02 10110

2 2 2. 2 2 21 2 2 1. 22
xy L xx L LL LL

R L erf L erf L e

R L erf L erf L erf L e L L erf L e erf L erf L e erf L L e L e erf L erf L


_ _ __ = = _ + _ _ _ __ _ _ __ + _+_ _ _ __ _ _ + + __ __ + __ _ _ __ + __ + __ _ _ __ +(
11 1

( ) ( ) ) ( ( ))

22

1101

22 LL1 L L e erf L erf L e erf L _ _ + _ _ (7.37) Which after expansion and reduction becomes
() ()

()()()()()0
20101 0 xy xx

R erf L erf L erf L erf L erf L R = = + (7.38) Chapter 7: Reduced Complexity Clipping Algorithms

154 The normalized autocorrelation of the output, (0)


(0) yy xx

R R was derived to be (7.39)


() ()

()() ( ( ) ( )) ( ) ()()() ( )( ( )) ( ( )) ( ) ( ( )) ( )
2 1 2 0 22 21 2 0 2 1

011 1 0 00 21 2211 0 200 2022 11 1

2 2 2 2 2 22 2121 2 21 2
yy L xx L LL L L

R L erf L erf L e R L erf L erf L erf L e L erf L L erf L

erf L e e L erf L L erf L erf L erf L e L erf L L erf L e


__=_+_ __ _ _ __ + _+_ _ _ __ _ _ + + + __ __++_+_ __ +++(


2

) ( ( ))( ( ) ( )) ( ( ))
0 00 222 111011

2 2121 L erf L e L erf L erf L erf L L erf L

_ _ __ __ _ _ __ +__+______ (7.39) Which after expansion and reduction becomes (7.40)


() ()

() ()()()

()()() ()()()
22 21 2 1

021 0 0 021 21 2021 22 11010

2 2 21 2121
yy L L xx L

RLL erf L e e R erf L erf L erf L L L erf L erf L e erf L L erf L erf L L erf L

_ _ =_+_ __ + __ __ + __ __ + + __ __ + __ __ (7.40) The closed form solution is shown in (7.41). Chapter 7: Reduced Complexity Clipping Algorithms 155

( ( ) ( ) ( ) ( ) ( ))
() ()()() ()() () ()() ()

()
22 21 2 1

2 20101 3 21 0 021 22 202 1 1 2 110 2 10

2 21 2 21 21
Sec LL L

erf L erf L erf L erf L erf L SNR LL erf L e e erf L erf L erf L erf L erf L erf L erf L L e erf L L erf L erf L L erf L

+ = _ _ _ _ __+__ _ _ _+____ _ _+____ _ __ _+_ __ _+_ ____ __ + _ _ __ _

() ()() ()
2 0 10 1

L erf L erf L erf L __ __ __ ___+_ (7.41) A block diagram showing 3 Sector clipping implemented with a LUT is shown in Figure 7.14. Note that the sign bits are removed in Iin and Qin and attached back on at Iout and Qout. A flowchart detailing the decision matrix in the algorithm is shown in Figure 7.15. From this the LUT in Figure 7.14 is derived and shown in Table 7.1, where it is seen that the LUT requires a 8 bit input and a (N2)+2 bit output. A plot of the theoretical SNR vs. 3 Sector clipping is shown in Figure 7.16. The latency of the structure in Figure 7.14 is low as the level comparisons are made in parallel and fed into the LUT. Further more when reviewing the truth table in Table 7.1 the inputs for 0 and L2 can replaced with logic 0 and 1 respectively reducing the number of inputs to the LUT to 4 and by implication the size of the LUT.
Table 7.1: Truth Table for 3 Sector clipping

Input Output Iin Qin Iclip Qclip Ctrl


I Ctrl
0 L0 L1 L2 0 L0 L1 L2 Q X X 1 X X X 1 X L1 L1 1 1 X X X 1 1 0 X X L2 Qin 1 0 X X 1 X 1 1 0 X L1 Qin 1 0 1 0 X X X X X 1 Iin L2 0 1 1 1 0 X X X 1 X Iin L1 0 1 For all other combinations data is passed through Iin Qin 0 0 Chapter 7: Reduced Complexity Clipping Algorithms 156
Figure 7.14: Block diagram of 3 Sector clipping.

LUT
L0 L1 L2 0

L2 L0 L1 0 N N

Iclip
Control I Control Q N N

Iin Qin Iout Qout Qclip


N N Chapter 7: Reduced Complexity Clipping Algorithms 157
Figure 7.15: Flowchart for the LUT in Figure 7.14 (3 Sector clipping).

Abs(Q)_0 Abs(Q)_L0 Abs(I)_L2 L2+jabs(Q) Abs(Q)>L0 Abs(Q) _L1 Abs(I)_L1 L1+jabs(Q) Abs(I)_L1 Abs(Q)_L1 Abs(I)_0 Abs(I)_L0 Abs(Q)_L2 Abs(I)+jL2 Store sign bits I+jQ Abs(I)>L0 Abs(I)_L1 Abs(Q)_L1 Abs(I)+jL1 Abs(I)+jabs(Q) Reattach sign bits L1+jL1 yes yes yes yes yes yes yes yes yes yes yes yes yes yes no no no no no no

no no no no no no no no Chapter 7: Reduced Complexity Clipping Algorithms 158 7.2.1.4 Square Clipping Square clipping is the least complex form of clipping and is shown in the following theoretical results as a lower bound on the presented clipping techniques. The theoretical SNR for Square clipping can easily be found by using the same method as for 3 Sector clipping and changing the limits on (7.32) and (7.33) where appropriate. That is by setting both I0 and I2 equal to I1 in Figure 7.13. The equations are simpler to derive and can be construed as 2 sector clipping which is in fact Cartesian clipping. 7.2.1.5 Theoretical Results The theoretical results for Conventional, 3 Sector, and Square clipping SNR vs. clip level are shown in Figure 7.16 together with the equivalent simulated results. Note that for the simulated results no filtering, oversampling, or amplification is performed, the noise is measured after clipping. As expected Conventional clipping has the best performance and Square clipping the worst. 3 Sector clipping has performance in the middle, but with greatly reduced complexity requiring only a few comparators and a simple LUT, making it only marginally more complex than square clipping. Generally 3 Sector clipping suffers a 1dB penalty in SNR compared to conventional clipping levels above 0dB. In other words in order for 3 sector clipping to achieve the same SNR as Conventional clipping the clip level needs to be set 1dB higher than the Conventional case. Also of note it is seen that sector clipping performance approaches the same SNR as Square clipping at clip levels below 0dB. This makes sense as harder clip levels reduce the size of regions 1 and 2 in Figure 7.13, making the corner sector the dominant clip region, just as in Square clipping. Therefore 3 Sector and Square clipping share the same lower SNR bound of 2.439dB. Conventional clipping has a lower SNR bound of 5.634dB. Chapter 7: Reduced Complexity Clipping Algorithms 159
-10 -8 -6 -4 -2 0 2 4 6 8 10 0 10 20 30 40 50 60 70 Clip Level (dB) SNR (dB) Theoretical Simulated Conventional 3 Sector Square SNRlimit=5.64dB SNRlimit=2.44dB

Figure 7.16: Theoretical Clip Level vs. SNR for Conventional (Standard), 3 Sector, and Square Clipping. Theoretical (dashed), simulated (solid).

Comparing the simulated curves with the theoretical ones it is seen that they are well matched at clip levels below 6dB. At higher levels of clipping the simulated results have slightly better SNR, this is due to the theoretical Rayleigh distribution assumption of the signal not holding at higher amplitude levels [89]. Simulated OFDM symbols have lower PDF values in the tail of the distribution. Figure 7.17 plots the theoretical 3 sector clipping angle, (shown in Figure 7.11) vs. SNR for clip levels ranging from -10dB (at the bottom of Figure 7.16) to 10dB (at the top of Figure 7.16). The angle is varied from 20 to 40. The choice of has no impact on harder clipping levels, but it does affect SNR performance at weaker levels (CL>5dB). It is seen that the optimum angle for
2

is 27.5, i.e. 0 (

L tan 27.5 L __=__ __ , for all practical clipping levels as shown in Figure 7.17. Chapter 7: Reduced Complexity Clipping Algorithms 160
20 22 24 26 28 30 32 34 36 38 40 0 5 10 15 20 25 30 35 40 45 50 () SNR (dB) CL=10dB CL=-10dB

Figure 7.17: Clipping angle,

(of Figure 7.11) vs. the SNR for 3 sector clipping, based on (7.41).

7.2.2 Extensions of Sector Clipping 3 Sector clipping can be extended to incorporate more sectors thereby improving performance. Figure 7.18 shows the I Q plot of the first quadrant of 4 and 5 Sector clipping. Figure 7.19 depicts simulated results for Sector Clipping with 3, 4, and 5 sectors as well as Conventional and Square clipping for comparison. Increasing the numbers of sectors from 3 to 4 improves the SNR by ~2dB at a clipping level of 6dB. 5 sectors provides a further 0.5dB gain in SNR at the same clipping level. Generally, the SNR difference between the schemes increases with clipping level. The complexity increase for 4 and 5 sectors is minimal over 3 sector clipping requiring only a few extra comparators and a doubling of the LUT size. Chapter 7: Reduced Complexity Clipping Algorithms 161
(a) (b) Figure 7.18: I Q diagram showing the 1st quadrant Sector clipping regions of a) 4 Sector clipping and b) 5 sector clipping.

The degradation in SNR for Sector clipping from optimal Conventional clipping is

~3dB and the improvement in SNR over square clipping is ~4dB at a clip level of 6dB. Results not shown here indicate that increasing the number of sectors above 5 shows no discernable improvement in the SNR. The choice of angles for 4 and 5 sector clipping has been found to be optimum when the angles which determine the sectors are equally spaced, i.e. for 4 sectors: =28.13 and =39.75; and 5 sectors: =11.25, =22.5, and _=33.75. This section detailed a new method for clipping OFDM symbols called Sector Clipping. The new method is very simple to implement requiring only a few extra comparators compared to square clipping, however its performance is closer to conventional clipping where traditionally more complex circuits are required. Chapter 7: Reduced Complexity Clipping Algorithms 162
-10 -8 -6 -4 -2 0 2 4 6 8 10 0 10 20 30 40 50 60 70 Clip Level (dB) SNR (dB) Conventional 5 Sector 4 Sector 3 Sector Square

Figure 7.19: Simulated clip level vs. SNR for Conventional, Square, 3, 4, and 5 Sector Clipping.

7.2.3 Hardware Implementation 3 Sector and Square clipping was implemented in digital form via Visual Hardware Design Language (VHDL). The two algorithms were then simulated using Synopsys to ensure proper operation. Next the algorithms were compiled into Verilog (VHDL code expressed in terms of gates, flip flops, etc.) code and simulated again in Synopsys. The Verilog code was then exported into the Cadence silicon design package, Silicon Ensemble, where routing and cell placement was performed with a 0.5 standard cell and port library. After routing a gds2 file was produced which could then be sent to the foundry for production. However, this was not done, therefore the algorithm was only proved through hardware simulation. The design flow is summarized in Figure 7.20. Figure 7.21 depicts the layout of the 3 sector clipping algorithm with N=8 bit inputs. A gain table was included to allow 16 different Clipping levels (CL) set from -3dB to 12dB. Chapter 7: Reduced Complexity Clipping Algorithms 163
Figure 7.20: Design flow for silicon implementation of 3 sector clipping. Figure 7.21: Block diagram of 3 Sector clipping implemented in VHDL.

VHDL Simulate (Synopsys) Verilog Silicon Ensemble

(Cadence) Gds2 file Foundry Cmos 0.5 standard cell library testing Standard port library

8 8 7 7 7 8 8 4 I shift reg. Q shift reg. Input data Iserial in erial Qserial in erial Iin eQriin eri Qin eri Qout erial Iout eri CL eri

Sector Clipper

(Fig 7.14)
Gain table
Chapter 7: Reduced Complexity Clipping Algorithms 164 Due to the clipping operation the Most Significant Bit (MSB) on each of the I and Q channels can be removed at the output. In order to save the number of input/output pins required, the I and Q input data is fed serially (Iin serial and Qin serial) into shift registers which then feed 8 bit words (Iin and Qin) into the 3 Sector clipping algorithm. The input data word is stored and output in parallel fashion for comparison to the clipped sample. Table 7.2 shows a summary of various Synopsys reports for the power consumption, delay path, and cell area of Square and 3 Sector clipping. System with buffer refers to the whole system shown in Figure 7.21 with the input and output buffers included. The buffers are necessary when the system is implemented as a stand alone device in silicon. In practice however the buffers are not necessary as only the clipping algorithm is implemented as a block in the whole OFDM transmitter.
Table 7.2: Synopsys reports for Square and 3 sector clipping. (Refer to Figure 7.21)

Hierarchy level Report Square clipping 3 sector clipping System with buffer Total cell area 13593.52 dbu 13827.55 dbu Total dynamic power 23.268mW 35.3874mW System Total cell area 357.16 dbu 591.1800 dbu Total dynamic power 11.5896mW 17.5455mW Sector Clipper Total cell area 110.37 dbu 385.38 dbu Total dynamic power 6.8317mW 8.7933mW Gain table Total cell area 62.70 dbu 131.08 dbu Total dynamic power 4.4682mW 8.3737mW I and Q shift registers Total cell area 37.36 dbu 37.36 dbu Total dynamic power 0.1893mW 0.1893mW System again refers to Figure 7.21 without the input and output buffers. The cell area is greatly reduced in this case as the buffers take up a lot of space. The algorithm only takes up 4.27% of the total cell area for 3 Sector clipping and only 2.62% for Chapter 7: Reduced Complexity Clipping Algorithms 165 Square clipping. The clipping algorithms themselves (Sector Clipper in Table 7.2) have an area of 385.38 dbu and 110.37 dbu for 3 Sector and Square clipping respectively, the power consumption is 8.7933mW and 6.8317mW for 3 Sector and Square clipping respectively. The 3 Sector Gain Table requires 50% more cells than Square clipping and the power consumption is doubled due to the extra clip levels required for sector clipping. The shift registers are the same in both systems. Table 7.3 and 7.4 show the Cadence reports for Square and 3 Sector clipping

respectively. A 0.5 process with 3 metal layers was used. As is seen in the Application Specific Integrated Circuit (ASIC) view of 3 Sector clipping Figure 7.22 the size of the chip is determined by the input/output pads which are abutted to make the area as small as possible. This made the wire routing process easy as there was a lot of room to work with. The area of utilization (occupied chip area) is 74.78% in Square clipping and 75.62% for 3 Sector clipping, a small difference. Comparing the number of CORE Rows and CORE Cells for Square and 3 Sector clipping algorithms it is seen that Square clipping requires 22 rows and 148 cells while 3 Sector clipping requires 26 rows and 319 cells.
Table 7.3: Cadence area utilization report on Square clipping Type Number Length Area %_Row_Space CORE Rows 22 1205160 3012900000 CORE Cells 148 237380 593450000 19.70 CORNERSITE_495 Rows 4 198000 9801000000 CORNERSITE_495 Cells 4 198000 9801000000 100.00 IOPADSITE_495 Rows 4 420000 20790000000 IOPADSITE_495 Cells 28 420000 20790000000 100.00 Area of chip: 41699680000 (square DBU) Area required for all cells: 31184450000 (square DBU) Area utilization of all cells: 74.78%

Chapter 7: Reduced Complexity Clipping Algorithms 166


Table 7.4: Cadence area utilization report on 3 Sector clipping Type Number Length Area %_Row_Space CORE Rows 26 1693120 4232800000 CORE Cells 319 390060 975150000 23.04 CORNERSITE_495 Rows 4 198000 9801000000 CORNERSITE_495 Cells 4 198000 9801000000 100.00 IOPADSITE_495 Rows 4 420000 20790000000 IOPADSITE_495 Cells 28 420000 20790000000 100.00 Area of chip: 41744615000 (square DBU) Area required for all cells: 31566150000 (square DBU) Area utilization of all cells: 75.62% Figure 7.22: Application Specific Integrated Circuit (ASIC) view of 3sector clipping algorithm implemented in Silicon using Cadence Silicon Ensemble. 0.5 process, 3 metal layers.

Chapter 7: Reduced Complexity Clipping Algorithms 167

7.3 New Vector Subtraction clipping method


This section describes another new method called Vector Subtraction which is an enhancement of an existing algorithm [101] called the Lucent algorithm here, to clip samples. Vector Subtraction reduces complexity of the Lucent algorithm by removing the need for divisions, which are complex operations in hardware and add significantly to the complexity of the original algorithm. 7.3.1 Lucent Algorithm The Lucent algorithm produces good estimates of the magnitude in K iterations. The

Lucent algorithm works as follows, first the complex sample, x = xi + jxq is folded into the first octant to give
'

max

x is then rotated by a number, K, of fixed phase angles, k, towards the real axis. The phase angle which returns the largest real part gives the closest approximation to the direction (phase) of the vector x, and the magnitude of the real part, x , is the closest approximation to the actual magnitude. The K phase values are spaced in the octant as 16
k k

( , ) .min ( , )

iqiq

x = x x + j x x (7.42)

l K = (7.43)

where {1,3,5,..., 2 1} k l = K . For all K values of k the magnitude estimate is given by max Re
k

( ( . ))
'
k

xxe

= (7.44) Chapter 7: Reduced Complexity Clipping Algorithms 168 After the magnitude has been estimated, any samples exceeding the clipping threshold, clip x , are multiplied by a scaling factor (which reintroduces complexity that was mitigated by the iterative magnitude estimator), the clipped output signal is given by

( )
out

clip

xxx x x __ =___
clip clip

xx xx > (7.45) The only error in this technique is a slight under estimate of x which reduces as K increases. The scheme has many similarities to the CORDIC [102] method, but gives

better estimates of the magnitude at low values of K. The scaling operation requires a division which is a complex hardware operation and can be avoided by using the new Vector Subtraction technique described below. 7.3.2 Vector Subtraction The process of finding the magnitude estimate, x , is the same as [101] described in (7.44), but the scaling operation is replaced by a subtraction. First, the overshoot, os, is calculated s clip o = x x (7.46) and then subtracted from the signal, x. However the phase of the main signal is not known therefore the best estimate of the phase is used, this is k,max. The overshoot, os, is rotated by j k ,max e giving the correction vectors yi and yq.
j k ,m iqs

y y jy o e = + = (7.46a) The correction vectors are octant adjusted prior to subtraction from the original signal, x. Chapter 7: Reduced Complexity Clipping Algorithms 169

( ) x = x y , y (7.47) ( ) x = x y , y (7.48)
ii iq '' qqqi ''

The addition/subtraction of yi and yq can be extrapolated from the maximum and minimum values of xi, xq and the sign bits of the original data. A block diagram of Vector Subtraction is shown in Figure 7.23.
Figure 7.23: Block diagram of new Vector Subtraction scaling operation.

Figure 7.24 shows the I Q diagram for Vector Subtraction with K=2 iterations corresponding to phases of 16 and 3 16 . The vector x is closest to the 16 vector. Note that the clipped value, x, has an additional phase error compared to the Lucent method. It has both amplitude and phase error compared to the ideal clipped value. Figure 7.25 and 7.26 show the CCDF of the Lucent algorithm [101] and the new Vector Subtraction variation respectively where 1000 OFDM symbols (N=64) are clipped at 5dB. Here it is seen that a slight underestimate in the magnitude means that samples are not always clipped back to the desired level. This is a function of the number of iterations in the algorithm with 1 iteration underestimating the magnitude by 0.7dB and 1.2dB to pass through in the Lucent algorithm and Vector Subtraction respectively. This problem is exacerbated in Vector Subtraction where the new scaling operation magnifies the error in the magnitude estimate. yi yq ix q x x clip x > x s clip o = x x j (1..k ) e
(-,+)
'' i

(-,+) ''

x
,max

( )
Re jk e

( )
,max Im jk e

Chapter 7: Reduced Complexity Clipping Algorithms 170


Figure 7.24: IQ plane for the new Vector Subtraction method showing vector, x being clipped to x.

The amount of error in the magnitude estimate increases with the decrease in clipping level for Vector Subtraction with clipping under 3dB requiring a prohibitive amount of backoff. This is not an issue with the Lucent patent where the magnitude estimate error is constant irrespective of clip level. The upshot of this is that the clip level will need to be backed off to avoid saturation of the amplifier. Alternatively more than 1 iteration can be used making the error in the magnitude estimate small. This small error can normally be neglected as filtering will cause substantial peak regrowth anyway.

I Q
yi yq k,max
x

x
Chapter 7: Reduced Complexity Clipping Algorithms 171
4 5 6 7 8 9 10 11 10
-3

10
-2

10
-1

10
0

0 (dB) Pr(>0)
Unclipped 1 iteration 2 iterations 4 iterations 3 iterations

Figure 7.25: Simulated CCDF for Lucent patent [101] for various iterations clipped at 5dB showing the leakage of under clipped samples.

4 5 6 7 8 9 10 11 10
-3

10
-2

10
-1

10
0

0 (dB) Pr(
>

) 1 iteration 4 iterations 3 iterations 2 iterations Unclipped

Figure 7.26: Simulated CCDF for Vector Subtraction for various iterations clipped at 5dB showing the leakage of under clipped samples.

Chapter 7: Reduced Complexity Clipping Algorithms 172 The simulated clip level vs. SNR results are shown in Figure 7.27 and 7.28 for the Lucent algorithm and Vector Subtraction method respectively. Note that the clip level is adjusted depending on the iteration and clipping method according to the CCDF results of Figures 7.25 and 7.26, for example the Lucent patent with 1 iteration requires the clip level to be set to CL-0.7. This ensures that the samples will not saturate the HPA if it were present. When 2, 3, or 4 iterations are used, both the Lucent patent and Vector subtraction have very similar performance to the conventional clipping method. 1 iteration results in a 5dB degradation from the optimum clipping method for the Lucent algorithm while 1 iteration in Vector Subtraction results in a more serious degradation across the board, making it impractical; In fact its performance goes below that of Square clipping for clip levels below 4dB. The reason for this is the increasing amount of back off required at harder clip levels to avoid saturation of the amplifier.
0 1 2 3 4 5 6 7 8 9 10 0 10 20 30 40 50 60 70 Clip Level (dB) SNR (dB) Conventional clipping Lucent - 4 iterations Lucent - 3 iterations Lucent - 2 iterations Lucent - 1 iteration Square clipping

Figure 7.27: Simulated clip level vs. SNR for Lucent clipping technique with varying iterations, as well as Conventional and Square clipping.

Chapter 7: Reduced Complexity Clipping Algorithms 173


0 1 2 3 4 5 6 7 8 9 10 0 10 20 30 40 50 60 70 Clip Level (dB) SNR (dB) Conventional Clipping Vector Subtraction - 4 iterations Vector Subtraction - 3 iterations

Vector Subtraction - 2 iterations Vector Subtraction - 1 iteration Square Clipping

Figure 7.28: Simulated clip level vs. SNR for Vector Subtraction clipping with varying iterations, as well as Conventional and Square clipping.

7.4 Comparison of new and existing clipping methods


The previous sections described new low complexity clipping algorithms and evaluated them both theoretically and through simulation. Section 7.4.1 analyses their performance in the OFDM system described in Section 7.1. Section 7.4.2 compares them in terms of their complexity by comparing hardware operations. The mapping type used in the following simulations is 64 QAM, the number of taps in the RRCF is 128 with a roll off factor of 0.15, the oversampling rate in the IFFT is set to 2. A LPA is used initially and later simulations use a HPA (P=3) with increasing backoffs. Conventional and Square clipping are also shown as a reference. 7.4.1 BERF Figure 7.29 shows the baseband clipping vs. BERF for 3, 4, and 5 Sector clipping where it is seen that increasing the number of sectors to 4 provides an improvement of 1 and a half magnitudes over Square clipping at 4dB clipping. Increasing the number of sectors to 5 provides a more modest decrease in the BERF beyond 4 Sector Chapter 7: Reduced Complexity Clipping Algorithms 174 clipping at 4 dB clipping. 5 sector clipping is 1 magnitude worse than conventional clipping at 4 dB clipping. At 5 dB clipping Conventional, 4, and 5 Sector clipping have a BERF below 10-6. Note that Sector clipping over clips the data therefore performance after a HPA with a limited backoff may be better due to the extra regrowth allowed. This will be explored further later in this section.
012345678 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER Conventional 5 sectors 4 sectors 3 sectors Square

Figure 7.29: Simulated 3, 4, and 5 Sector, Conventional and Square clipping vs. BERF with a LPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and =0.15. AWGN=0.

Figure 7.30 shows baseband clipping vs. BERF for Vector Subtraction with 1, 2, 3, and 4 iterations. Performance is nearly the same for all methods, however curiously 1 iteration has the best performance while conventional clipping has the worst. This can be explained by reviewing Figure 7.25 where it seen that Vector Subtraction under clips some samples which would lead to saturation of the amplifier had it not been a LPA. Chapter 7: Reduced Complexity Clipping Algorithms 175
012345678 10-5 10-4 10-3 10-2 10-1

100 Clip Level (dB) BER Conventional 4 iterations 3 iterations 2 iterations 1 iteration Square

Figure 7.30: Simulated Vector Subtraction (1, 2, 3, and 4 iterations), Conventional and Square clipping vs. BERF with a LPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and =0.15. AWGN=0.

Figure 7.31 shows baseband clipping vs. BERF for the Lucent algorithm [101] where it seen that the performance of all iterations is almost the same as Conventional clipping. Again 1 iteration has slightly better performance than the other methods due to the under clipping of some samples as seen in Figure 7.25. Vector Subtraction has an almost identical performance to the Lucent method except for the trivial case of 1 iteration. Chapter 7: Reduced Complexity Clipping Algorithms 176
012345678 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER Conventional 4 iterations 3 iterations 2 iterations 1 iteration Square

Figure 7.31: Simulated Lucent [101] clipping (1, 2, 3, and 4 iterations), Conventional and Square clipping vs. BERF with a LPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and =0.15. AWGN=0.

This section simulated the performance of the new and existing clipping algorithms in an OFDM environment with a LPA. The rest of this section uses a HPA with different IBOs relative to the baseband clipping level. Note that Sector Clipping overclips the signal in some instances while Vector Subtraction and the Lucent algorithm under clip the signal which explains the higher BERFs of the Sector clipping when the HPA is taken into account. Changing HPA backoff Figures 7.32, 7.33, and 7.34 show baseband clipping vs. BERF for 3, 4, and 5 Sector clipping, respectively, with increasing backoff in the HPA. Increasing the IBO in the HPA improves the BERF in all cases. Chapter 7: Reduced Complexity Clipping Algorithms 177
012345678 10-7 10-6 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER LPA

IBO=CL+2 IBO=CL+1 IBO=CL 0.8dB

Figure 7.32: Simulated 3 Sector clipping, vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and =0.15. AWGN=0.
012345678 10-6 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER LPA IBO=CL+2 IBO=CL+1 IBO=CL 0.8dB

Figure 7.33: Simulated 4 Sector clipping, vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and =0.15. AWGN=0.

Chapter 7: Reduced Complexity Clipping Algorithms 178


012345678 10-6 10-5 10-4 10-3 10-2 10-1 100 LPA IBO=CL+2 IBO=CL+1 IBO=CL 0.8dB Clip level (dB) BER

Figure 7.34: Simulated 5 Sector clipping, vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and =0.15. AWGN=0.

Comparing Figures 7.32 and 7.34, 5 Sector clipping is just under 1 magnitude better in terms of the BERF than 3 Sector clipping at 5dB baseband clipping with an IBO of 2dB above the baseband clip level (i.e. IBO=7dB). At an IBO=CL+2 the BERF is within 1 magnitude of the LPA at 5dB clipping for all methods and within half a magnitude at 4dB clipping. 1 dB of additional baseband backoff is required for 3 and 4 Sector and slightly more for 5 Sector clipping to maintain the same BERF as an LPA at a BERF=10-4 with no additional IBO in the HPA. The non-linear characteristics of a HPA adds another source of distortion to the transmitted signal. Figures 7.32 to 7.34 show that this additional distortion is equivalent to a 0.8dB change in the CL (at BERF=10-4) when the HPA saturation level is equal to the Chapter 7: Reduced Complexity Clipping Algorithms 179
012345678 10-7 10-6 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER LPA IBO=CL+2

IBO=CL+1 IBO=CL 1.5dB

Figure 7.35: Simulated Vector Subtraction (1 iteration), vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and =0.15. AWGN=0.
012345678 10-6 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER LPA IBO=CL+2 IBO=CL+1 IBO=CL 1.2dB

Figure 7.36: Simulated Vector Subtraction (4 iterations), vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps and =0.15. AWGN=0.

clipping level (IBO=CL). Obviously, as the amplifier is backed off, IBO=CL+2, then the additional distortion is reduced. Chapter 7: Reduced Complexity Clipping Algorithms 180 Figures 7.35 and 7.36 show baseband clipping vs. BERF for Vector Subtraction with 1 and 4 iterations, with increasing backoff in the HPA. Increasing the IBO improves the BER in all cases. There is little difference between 1 and 4 iterations when HPA=CL but this increases with the larger IBO in the HPA. However 1 iteration out performs 4 iterations for all HPA backoffs. This is due to the underestimation of the magnitude reducing the clipping distortion. At an amplifier backoff of HPA=CL+2 the BERF is within 1 magnitude of LPA performance at 5dB clipping for all methods and within half a magnitude at 4dB clip level. When no extra backoff is allowed in the HPA above the baseband clip level the performance of 1 and 4 iterations is almost the same, with an additional backoff of 2dB in the HPA (HPA=CL+2) 1 iteration outperforms 4 iterations by 0.2 dB at a BERF=10-4. Figures 7.37 and 7.38 show baseband clipping vs. BERF for the Lucent patent with 1 and 4 iterations respectively, and increasing backoff in the HPA. As with the 2 previous methods increasing the IBO improves the BERF. In Figure 7.37 (1 iteration) at a baseband clip level of 6dB there is a difference of 1.5 magnitudes between HPA=CL and a LPA. Under the same conditions the difference is under 2 magnitudes for Figure 7.38 (4 iterations). When no extra backoff is allowed in the HPA both 1 and 4 iterations have approximately the same performance. The performance of 4 iterations is slightly worse for IBO=CL+1 and IBO=CL+2. For comparison Square clipping with different backoffs in the HPA is plotted in Figure 7.39. Here it is seen that like Sector clipping Square clipping is more robust against a reduction in the amplifier backoff with only half a magnitude in difference between a LPA and a HPA with no additional IBO (IBO=CL). Comparing the 3 methods it is interesting to note that Sector clipping has the least increase in BERF from IBO=CL to LPA with a difference of 1.2, 1.5, and 1.5 magnitudes Chapter 7: Reduced Complexity Clipping Algorithms 181
012345678 10-7 10-6

10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER LPA IBO=CL+2 IBO=CL+1 IBO=CL 1.4dB

Figure 7.37: Simulated Lucent clipping (1 iteration), vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and =0.15. AWGN=0.
012345678 10-6 10-5 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER LPA IBO=CL+2 IBO=CL+1 IBO=CL 1.2dB

Figure 7.38: Simulated Lucent clipping (4 iterations), vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and =0.15. AWGN=0.

Chapter 7: Reduced Complexity Clipping Algorithms 182 for 3, 4, and 5 Sector clipping respectively. Vector Subtraction has a difference of 2.5 and 2 magnitudes for 1 and 4 iterations respectively, while the Lucent algorithm has a difference of 2.2 and 2 magnitudes.
012345678 10-4 10-3 10-2 10-1 100 Clip Level (dB) BER LPA IBO=CL+2 IBO=CL+1 IBO=CL 0.4dB

Figure 7.39: Simulated Square clipping vs. BERF with varying IBO in HPA. 64 QAM symbols, 128 point IFFT (os=2), RRCF with 128 taps, and =0.15. AWGN=0.

Table 7.5 compares the baseband clip level for the aforementioned clipping techniques required to maintain a BER of 10-4 for different backoffs in the HPA. For 3 and 4 Sector clipping decreasing the difference from a LPA to no additional backoff in the HPA (IBO=CL) comes at a cost of around an additional 1dB increase in the baseband clip level to maintain the same BERF=10-4, and a 1.1dB increase for 5 Sector clipping. For Vector Subtraction and the Lucent clipping method the increase required is around 1.5dB for 1 iteration and 1.3 dB for 4 iterations. Conventional (Figure 7.6) clipping requires a 1.2dB increase in clipping level and Square clipping only requires an extra 0.4dB to maintain the same BERF. Reviewing the results in terms of performance at a set HPA backoff it is seen that for 3 Sector clipping there is an additional 0.4dB of extra baseband clipping backoff Chapter 7: Reduced Complexity Clipping Algorithms 183

required than for 5 Sector clipping with no additional backoff in the HPA, and an additional 0.6dB required under LPA conditions. Note that for 3 and 4 Sector clipping an extra backoff of 2dB in the HPA is sufficient to provide near optimal results (i.e. under LPA conditions), however for 5 Sector clipping an extra backoff of 3dB is required to approach the LPA results. Performance of Vector Subtraction and the Lucent method for both 1 and 4 iterations have similar performance at a set amplifier backoff, interestingly 1 iteration slightly outperforms 4 iterations. Underestimation of the amplitude must therefore dominate yje phase error in the two schemes.
Table 7.5: Baseband clip level required to maintain a BER=10-4 at varying IBO in HPA.

Baseband clip level required for BER=10-4 at relevant HPA backoff HPA backoff CL CL+1 CL+2 LPA Sector Clipping 3 sectors (Fig 7.32) 7.5 dB 7 dB 6.9 dB 6.6 dB 4 sectors (Fig 7.33) 7.3 dB 7 dB 6.7 dB 6.3 dB 5 sectors (Fig 7.34) 7.2 dB 6.9 dB 6.6 dB 6.1 dB Vector subtraction 1 iteration (Fig 7.35) 7 dB 6.4 dB 6 dB 5.5 dB 4 iterations (Fig 7.36) 7 dB 6.6 dB 6.2 dB 5.8 dB Lucent 1 iteration (Fig 7.37) 7 dB 6.4 dB 6.1 dB 5.6 dB 4 iterations (Fig 7.38) 7 dB 6.6 dB 6.2 dB 5.8 dB Conventional
Conventional (Fig 7.7)

7 dB - 6.2 dB 5.8 dB Square Square (Fig 7.39) 8.4 dB 8.2 dB 8.1 dB 8 dB Chapter 7: Reduced Complexity Clipping Algorithms 184 7.4.2 PSD results The PSD after the HPA and receiver filtering is shown in the following figures. The filter used is the same as the previous simulations, i.e. Matched RRCF, _=0.15, 128 filter taps and the HPA is a SSPA with p=3. As in Section 3.4.2 the PSD is measured for each OFDM block and then averaged over many blocks. In Figures 7.40 and 7.41 the baseband clip level is set at 0dB and 5dB respectively and the amplifier backoff is set at increasing levels above this clip level (IBO=CL, CL+1, CL+2, CL+3, CL+4, and a LPA). Baseband clipping at 0dB results in a large amount of both in band and out of band distortion (ACI). With no additional backoff in the HPA (IBO=CL) the ACI is only 18dB below the signal power and the inband distortion is 3dB below the LPA case. Increasing the IBO by 1dB (IBO=CL+1) results in spectral splatter 20dB below the signal power and an inband distortion of 2.5dB. With IBO=CL+2 the ACI is 22dB below the signal power and the inband distortion is 1.5dB. An IBO level set at CL+3 results in 23dB ACI and 1.3dB inband distortion. Even with a backoff of 4dB in the amplifier (IBO=CL+4) 24dB in ACI is present and the inband distortion is 1dB below the LPA case.

In Figure 7.41 the baseband clip level is set at 5dB and the amplifier backoff is set at increasing levels above the clip level as in Figure 7.40. Here it is seen that there is much less distortion, both in band and out of band. There is only 0.5dB difference in inband distortion between no additional IBO in the HPA and a LPA, and the ACI is 24dB for the IBO=CL case, and for higher backoffs is almost non existent. These PSD results of 7.40 and 7.41 show that harder clipping in the baseband results in greater peak regrowth which leads to heavier saturation of the amplifier, hence the greater ACI and in band distortion. The inband distortion leads to a worse BERF as the difference between signal power and noise is reduced, i.e. SNR is reduced. The out of band distortion will lead to interference with adjacent channels. Chapter 7: Reduced Complexity Clipping Algorithms 185
-4 -3 -2 -1 0 1 2 3 4 -50 -45 -40 -35 -30 -25 -20 -15 -10 -5 0 Normalized Frequency (Hz) Absolute Power (dB) LPA HPA=CL

Figure 7.40: PSD; 0dB clipping in baseband with increasing amplifier backoffs (HPA=CL, CL+1,CL+2,CL+3,CL+4, LPA). RRCF, alpha=0.15, 128 taps.
-4 -3 -2 -1 0 1 2 3 4 -45 -40 -35 -30 -25 -20 -15 -10 -5 0 Normalized Frequency (Hz) Absolute Power (dB) LPA HPA=CL

Figure 7.41: PSD; 5dB clipping in baseband with increasing amplifier backoffs (HPA=CL, CL+1,CL+2,CL+3,CL+4, LPA). RRCF, alpha=0.15, 128 taps.

Chapter 7: Reduced Complexity Clipping Algorithms 186


-4 -3 -2 -1 0 1 2 3 4 -50 -45 -40 -35 -30 -25 -20 -15 -10 -5 Normalized Frequency (Hz) Absolute Power (dB) HPA=CL LPA

Figure 7.42: PSD; after 0dB clipping in baseband and receiver filtering, with increasing amplifier backoffs (HPA=CL, CL+1,CL+2,CL+3,CL+4, LPA). RRCF, alpha=0.15, 128 taps.
-4 -3 -2 -1 0 1 2 3 4 -50 -45 -40 -35 -30

-25 -20 -15 -10 -5 0 Normalized Frequency (Hz) Absolute Power (dB) HPA=CL LPA

Figure 7.43 : PSD; after 5dB clipping in baseband and receiver filtering, with increasing amplifier backoffs (HPA=CL, CL+1,CL+2,CL+3,CL+4, LPA). RRCF, alpha=0.15, 128 taps.

Chapter 7: Reduced Complexity Clipping Algorithms 187 Figures 7.42 and 7.43 show the PSD after receiver filtering for 0dB and 5dB baseband clipping respectively. Here as expected the out of band distortion is mitigated but the in band distortion remains.

7.5 New adaptive clipping method


Clipping after interpolation and filtering will ensure that the amplifier does not saturate eliminating AM to PM distortions in the amplifier, but results in ACI affecting adjacent channels. This effect can be compensated by putting the clipper before the filter as done earlier in this chapter. However, as shown clipping and filtering can regrow peaks causing saturation of the amplifier resulting in an increase in the BERF. This section introduces a new adaptive clip and filter algorithm, Level Detection Algorithm (LDA) first presented in [103] which overclips the signal at certain times avoiding the peak regrowth issue and sparing the amplifier from saturation. However LDA as presented in [103] requires the use of a conventional clipper which adds significantly to the overall complexity and latency of the algorithm. Latency is an important issue in LDA, therefore Vector Subtraction is very useful in this algorithm. In this section Vector Subtraction with 2 iterations is used in place of the conventional clipper in LDA and their performance is compared through simulation. Vector Subtraction has the advantage of providing good estimates of the error magnitude with low latency, and low complexity, both of which are important in LDA. LDA uses an extra matched filter before the standard pulse shaping filter to predict the response of the signal from which the amount of compensation required can be calculated. A block diagram showing the LDA algorithm is shown in Figure 7.44 with a more detailed view of the filtering operation shown in Figure 7.45. The input data is modeled as a complex Gaussian process which describes accurately either an OFDM (N>64) or CDMA distribution of data where the algorithm could be implemented. The input data is interpolated to form the signal, x(n), (Figure 7.46a) and then fed to 2 identical filters. The first filter is used for peak detection and for generation of the correction vector, v(n). The suitably delayed correction vector is subtracted from the signal, x, before being passed to the second filter, and on to the Chapter 7: Reduced Complexity Clipping Algorithms 188 rest of the transmission chain. The filter used is a linear phase RRCF, in order to compensate for the group delay of the filter the subtraction point is set at the centre of the first filters delay line. This reduces the potential for additional peaks by including the latter half of the smeared v(n) waveform in the output estimate y(n). The clipping processor compares the amplitude of the first filter output with the

Clipping Level (CL) threshold to detect peaks. When the filter output magnitude, |y(n)|, exceeds the CL as shown in (Fig 7.46b) the correction vector, v(n) is subtracted from the second filter input. The correction vectors (Figure 7.46c) show the positions where the correction takes place at n-1, n, and n+1. Figure 7.47 shows the I Q diagram for LDA showing how the vector, v(n), is calculated based on the filter input samples. The correction required at the filter output to stop saturation at the amplifier is _(n) which is in phase with y(n) and has amplitude

(n) = y (n) CL (7.49) When the correction vector v(n) is added to the centre of the filter delay line, _(n) will be

( ) ( ) 0 n = h v n (7.50)
where h0 is the central filter tap value. Combining (7.49) and(7.50) gives v(n) as

()() ()
0

1 CL y n vn ynh ____ = __ __ __ __ ____ (7.51) Chapter 7: Reduced Complexity Clipping Algorithms 189
Figure 7.44: Block diagram of system with Level Detection Algorithm (LDA). Figure 7.45: Detailed block diagram of the LDA.

A limitation with LDA is that the correction vector smears into other parts of the signal, which can introduce new peaks where none existed before. This problem is especially apparent when a number of peaks appear successively. To combat this problem the baseband clip level CL is set lower than the Amplifier Clip Level (ACL), the saturation level of the amplifier. The lower CL reduces the chance of regrown peaks saturating the amplifier. Clipping processor

z-1 RRCF z-1

+
CL

h0 h1

v(n) y(n) x(n) x(n-1)


delay 1st filter Clipping processor

8 2 n d filter
I/Q I/P

y x v(n) CL
I/Q O/P

1st filter 2nd filter


Chapter 7: Reduced Complexity Clipping Algorithms 190
Figure 7.46: a) Amplitude of zero padded input to filter b) amplitude of filtered output c) LDA correction vectors.

In order to compare the performance of the LDA 2 parameters are used: the Clipping Error (CE) produced by the amplifier and the Mean Square Error (MSE) introduced by the compensation vector. The CE is the noise power of the clipped part in the amplifier, and is wideband and spread over many channels causing ACI. The MSE distortion introduced by the compensation filters is filtered by the second filter so no extra ACI is generated. The MSE introduces in band distortion that interferes with the desired signal.
n+1- n n+1 n+1+ |x(n)| time

CL CL+|_(n)|
|y(n)| time n time n |v(n)|

(a) (b) (c)


n-1 n+1

Chapter 7: Reduced Complexity Clipping Algorithms

191
Figure 7.47: Vector representation of filtering with 3 active taps and the required correction vector to bring the output back to CL.

A block diagram of the simulation model used to evaluate LDA is shown in Figure 7.48. The interpolation factor is 8 and the RRCF has a roll off factor of 0.2. The HPA amplifier used in this case is just a linear limiter which saturates at ACL while the baseband clipper saturates at CL. A matched RRCF simulates the receiver filter and a decimator follows to sample the received data symbols. The measured clipped power is the average power of the difference between the amplifier input and output signals. Similarly, the MSE is the average power of the difference between the original data samples and the received ones.

x(n+1)h0 x(n+1-)h-(1-) x(n+1+)h-(1+) y(n) (n)=v(n)h0 Correction vector CL I Q


Chapter 7: Reduced Complexity Clipping Algorithms 192
Figure 7.48: Block diagram of the simulation model used to evaluate LDA.

Figures 7.49 and 7.50 plot the simulated CL versus the inband distortion at the receiver. The ACL is set at 5dB and CL is varied from 2 to 9dB in Figure 7.49, and in Figure 7.50 the ACL is set at 8dB and the CL is varied from 7 to 12dB. Curves are plotted for Vector Subtraction with 2 iterations and Conventional clipping, both of which are implemented in a 64 tap and 128 tap filter. Random data source LDA

(Figure7.45) Amp

Measure clip noise RRCF Measure noise RRCF RRCF Delay Clipping Error (CE) Transmit signal power In band distortion (MSE) Receive signal power

CL ACL

Chapter 7: Reduced Complexity Clipping Algorithms 193


23456789 -26 -24 -22 -20 -18 -16 -14 -12 -10 SCL to input power Inband distortion to Rx signal Conventional - 64 taps Vector Sub - 2 iter - 64 taps Conventional - 128 taps Vector Sub - 2 iter - 128 taps

Figure 7.49: In band distortion for Conventional and Vector Subtraction (2 iter) with 64 and 128 taps in compensation filter. ACL=5dB.
7 7.5 8 8.5 9 9.5 10 10.5 11 11.5 12 -44 -42 -40 -38 -36 -34 -32 -30 -28 SCL to input power Inband distortion to Rx signal Conventional - 64 taps Vector Sub - 2 iter - 64 taps Conventional - 128 taps Vector Sub - 2 iter - 128 taps

Figure 7.50: In band distortion for Conventional and Vector Subtraction (2 iter) with 64 and 128 taps in compensation filter. ACL=8dB.

Chapter 7: Reduced Complexity Clipping Algorithms 194 In all cases when the CL is set below the ACL more inband noise is produced as the noise is dominated by the baseband clipping (CL). When the CL is set above the ACL the clipping noise is dominated by the amplifier and levels off at CL=9dB for ACL=5dB and at CL=10dB ACL=8dB. When the ACL=CL=5dB the inband distortion is 20dB below the received signal and when ACL=CL=8dB the inband distortion is around 35dB below the receive signal. Little variation is seen in performance between Conventional and Vector Subtraction with the 64 tap filter having slightly worse performance in the conventional case. Figures 7.51 and 7.52 show the simulated results for the Clipping Error (CE) versus the inband distortion, or Mean Squared Error (MSE) for a constant ACL. The results are obtained by varying the CL and recording the two error sources (CE and MSE). When the CL is set larger than the ACL, all the clipping is performed by the amplifier and the most noise is produced as is evidenced by the top of the curves in Figures 7.51 and 7.52. The bottom part of the curves in Figures 7.51 and 7.52 show noise produced when the

CL is set smaller than the ACL, as a result clipping is mostly performed by the baseband clipper rather than the amplifier. The noise produced at the bottom of the curves is due to peak regrowth after baseband clipping saturating the amplifier. In the ACL=5dB case it is seen that at the top of the curves there is no discernable difference in performance between Conventional and Vector Subtraction, the number of taps also has a negligible effect on performance. At the bottom of the curves where the noise is due to peak regrowth there is a divergence in performance between Conventional and Vector Subtraction with Conventional clipping out performing Vector Subtraction by around 6dB from -55dB to -61dB in terms of the clipping noise. When the ACL is increased to 8dB the performance improves across the board as expected. Again Conventional clipping outperforms Vector Subtraction by at least 1dB, with Vector Subtraction displaying -64dB of noise and Conventional clipping displaying no noise below -65dB. The number of taps in the filter has a negligible effect. The amplifier CE is low enough for LDA with Vector Subtraction to meet the ACI specifications of most standards. Chapter 7: Reduced Complexity Clipping Algorithms 195
Figure 7.51: Clipping error vs. in-band distortion. Performance curves for LDA using Conventional clipping and Vector Subtraction with 2 iterations. Compensation filter has 64 and 128 taps. ACL=5dB. Figure 7.52: Clipping error vs. in-band distortion. Performance curves for LDA using Conventional clipping and Vector Subtraction with 2 iterations. Compensation filter has 64 and 128 taps. ACL=8dB.
-26 -24 -22 -20 -18 -16 -14 -12 -10 -70 -65 -60 -55 -50 -45 -40 -35 -30 -25 -20

CL > ACL
Conventional - 64 Vector Sub - 2 iter - 64 Conventional - 128 Vector Sub - 2 iter - 128 taps

CL = ACL = 5dB CL < ACL


-44 -42 -40 -38 -36 -34 -32 -30 -28 -65 -60 -55 -50 -45 -40 -35 Inband distortion to Rx signal

CL > ACL
Conventional - 64 Vector Sub - 2 iter - 64 Conventional - 128

Vector Sub - 2 iter 128 taps

CL = ACL = 8dB CL < ACL

Chapter 7: Reduced Complexity Clipping Algorithms 196

7.6 Conclusion
This chapter presented new low complexity clipping methods for OFDM. Initially a Monte Carlo OFDM transceiver simulation model was used to establish the basic parameters on which the new clipping algorithms were tested. Various parameters such as the mapping type, the amount of oversampling in the IFFT, the baseband clip level, the number of taps in the pulse shaping filter, and the HPA backoff were varied to quantify their affect on the BERF. In this chapter the BERF was plotted against the clipping level with the noise set to zero, this meant that all the errors produced were due to clipping noise alone. Good parameters for further simulation were found to be oversampling the IFFT by a factor 2, a RRCF with 128 filter taps for a rolloff factor of 0.15, and a SSPA with p=3. The effect of filtering after clipping was also studied, it was shown through simulation how peak regrowth was more severe the harder the signal is clipped in the baseband. The peak regrowth could be countered to a certain degree by oversampling the IFFT by a factor of 2, oversampling beyond this point provided little discernable improvement. QPSK mapping was shown to be extremely robust to clipping, therefore making hard clipping a viable alternative at lower data rates. In Section 7.2 Sector clipping was introduced as a low complexity alternative to conventional clipping. Theoretical analysis of Conventional, Sector, and Square clipping SNR was shown to be in good agreement with the simulated results. At higher clip levels the theoretical and simulated results diverged as the theoretical SNR used the assumption that the clipping noise was Gaussian in nature. This was shown to be untrue at higher clip levels where the clipping noise is more impulsive in nature, hence the divergence in results. Both 3 Sector and square clipping were implemented in silicon using a 0.5, 3 metal layer process. Reports generated by Synopsys and cadence proved that the complexity increase of the Sector clipping algorithm was small compared to Square clipping with the advantage of having much better SNR properties. Sector clipping was expanded to include more sectors thereby improving performance. 4 and 5 Sector clipping provided a marginal improvement over 3 sector in terms of the BERF with a minor increase in complexity. Chapter 7: Reduced Complexity Clipping Algorithms 197 Section 7.3 introduced another low complexity enhancement to an existing Lucent algorithm. An iterative method similar to the CORDIC algorithm to calculate the magnitude of a complex sample. The variation called Vector Subtraction further reduced complexity by the removal of a complex multiplication for the price of 2 extra additions and a comparison operation. Its performance was found to be as good as conventional clipping with only 2 iterations. Section 7.4 compared simulated BERF results of the Sector, Square, Lucent, Vector Subtraction, and Conventional clipping under various backoffs in the HPA. It was shown that while the Vector Subtraction and Lucent algorithms have better performance across the board, Sector clipping was more tolerant to harder backoff

levels in the HPA. The PSD results for conventional clipping at 0dB and 5dB at various HPA backoffs was shown. While the inband distortion remains in both clipping modes the ACI present at 0dB is up to 17dB below the signal power while ACI is almost non existent at 5dB baseband clipping. As most WCDMA and OFDM standards require an ACI of 20dB below the signal power it was shown that even with hard clipping the ACI specifications are met. For a baseband clipping system with 16 QAM mapping, an IFFT with an oversampling factor of 2, a pulse shaping RRCF with 128 taps and _=0.15, to maintain a BER of 10-4 clipping at 3.8 to 4dB above the average signal power with a HPA with backoff 2dB above the baseband clip level is recommended. In this situation the HPA must have a peak power some 6dB above the average signal power. Finally in Section 7.5 Vector Subtraction with 2 iterations was implemented into an existing adaptive clipping system which required the clipping operation to take place within the filter. A low latency magnitude estimate was therefore required. Simulated results showed that Vector Subtraction had similar performance to conventional clipping in this algorithm but with less latency and complexity. Chapter 8: Conclusion 198

Chapter 8 Conclusion
This thesis analysed and proposed new methods to deal with the PAPR in OFDM. Initially Chapter 2 introduced the theory and principles behind OFDM and detailed scenarios where it is used. Chapter 3 identified contributing factors to the PAPR, these were that peaks are a function of the IFFT operation, where in phase waveforms add to create a large peak. The general distribution of OFDM samples was shown to have a Rayleigh distribution. The number of subcarriers, and to a lesser degree the mapping type were also shown to contribute to the PAPR. Simulations of an OFDM system revealed how uncontrolled large peaks will saturate the HPA creating ACI and an increased BER at the receiver. Chapter 4 began the literature review with an analysis of non distorted PAPR reduction techniques which included coding techniques, PTS, SLM, and Tone Reservation /Insertion. Coding introduced the most redundancy and became extremely complex at a higher number of subcarriers (N>64), however later papers identified promising code sets such as second order Reed-Muller codes. PTS and SLM reduced the PAPR by producing a series of alternative transmit signals seeded from the same data source which are altered before the IFFT process so that they will have different PAPR properties. The waveform with the lowest PAR is chosen for transmission. These methods are complex and the amount of PAPR is not guaranteed. Tone Insertion/variation used peak reduction carriers which introduced redundancy and required some additional processing at the receiver. Chapter 5 proposed two new low complexity variations to PTS, called Cyclic Shifted Sequences (CSS) and Time Inversion (TI). CSS and TI were shown to be less complex than PTS when the number of phase rotations was greater than 4. Chapter 8: Conclusion

199 Combining shifts of CSS and TI with non complex phase rotations of standard PTS allowed a whole IFFT operation to be removed at the expense of some extra non complex operations. CSS and TI where shown to display less peak regrowth after pulse shaping filtering. Oversampling at the IFFT by a factor of 2 was shown to improve the performance of PTS, CSS, and TI bringing the discrete and filtered CCDF to within 1dB of each other. These methods provided reduction in the CCDF of between 2 and 3dB at Pr(_>_0)=10-4. Chapter 6 picked up the literature review again from Chapter 4 for distorted PAPR solutions. Distorted PAPR methods were defined here as methods which intentionally limit the excess peaks at the transmitter, usually in the baseband so that the HPA would not saturate. These papers revealed that a backoff of around 6dB is sufficient to maintain a respectable error rate for 4 and 16 QAM mapping. Other papers analysed the affect on the amplifier of saturation and the resultant in band and out of band distortion. Windowing was also examined which is the process of using a pulse shape to clip the signal and surrounding samples to give better spectrum properties. The gains in reduction in ACI were shown to be minimal when an acceptable amount of backoff in the HPA was used. Chapter 7 presented the next set of new PAPR solutions where a series of low complexity and low latency clipping algorithms were proposed. This chapter started with a detailed Monte Carlo simulation of an OFDM transceiver were different stages of the transmission chain were modified to ascertain their effect on the BERF. The motivation behind clipping as a solution to the PAPR was that with an acceptable HPA backoff of around 6dB, clipping is a very rare occurrence affecting the BER negligibly. The first method, Sector clipping uses a rule base to perform the clipping operation. It was implemented in silicon and shown to have negligible extra complexity to square clipping but with much better performance. The rule base was extended up to 5 sectors where performance was close to conventional clipping. The second method used a variation of a CORDIC like magnitude estimator and was called Vector Subtraction. The complexity was further reduced by the removal of a multiplication at the expense of 2 additions and comparisons. Vector Subtraction was shown to have identical performance to conventional clipping with only 2 iterations. Finally Vector Subtraction with 2 iterations was implemented in a previously Chapter 8: Conclusion 200 proposed clip and filter algorithm where its low latency was important as the magnitude had to be found quickly as it was operating in a feedback loop on filter samples.

8.1 Future Work


The PAPR problem in OFDM is still an ongoing issue, especially for portable devices where the need to minimise the power amplifier linear range is paramount. The PTS/CSS/TI methods developed in this thesis to reduce the PAPR can be combined with other PTS techniques such as adaptive PTS and variations of the blind SLM techniques to further reduce complexity and the peak power. A hybrid system utilizing clipping techniques could also be added as a last stage so as to have an upper bound for the PTS signal.

Further enhancements of Sector clipping can be made where the error introduced by the phase distortion could be minimised so that only the amplitude distortion remains. All of the clipping techniques detailed in this thesis could be combined with coding techniques to further improve the BER performance. Code sets for larger numbers of subcarriers are an open ended problem and there is much ongoing research in this area. Analysis of the proposed algorithms in a MIMO OFDM system is an area that has gained a lot of focus recently and analysis of the performance of the proposed techniques in such an environment would be valid. The LDA algorithm with Vector Subtraction could be implemented in a complete OFDM transceiver and then implemented in hardware via a FPGA or silicon to ascertain whether the latency requirements can be met with the wider bandwidth systems now being proposed (40 MHz for 802.11n, and up to 100 MHz for 4G LTE). Bibliography 201

Bibliography
[1] G. Hill, M. Faulkner, and J. Singh, "Reducing the peak-to-average power ratio in OFDM by cyclically shifting partial transmit sequences," Electronics Letters, vol. 36, pp. 560-561, Mar 16 2000. [2] G. Hill, M. Faulkner, and J. Singh, "Cyclic Shifting and Time Inversion of Partial Transmit Sequences to Reduce the Peak-to-Average Power Ratio in OFDM," in PIMRC, London, 2000. [3] G. Hill, "Comparison of Low complexity clipping algorithms for OFDM," in PIMRC, Portugal, 2002. [4] M. L. Doelz, E. T. Heald, and D. L. Martin, "Binary Data Transmission Techniques for Linear Systems," Proc IRE, vol. 45, pp. 656-661, May 1957. [5] R. W. Chang, "Synthesis of Band Limited Orthogonal Signals for Multichannel Data Transmission," Bell Syst. Tech. Jo., vol. 45, pp. 1,7751,796, Dec 1966. [6] B. R. Saltzberg, "Performance of an efficient parallel data transmission system," IEEE Transactions on Communication Technologies, vol. COM-15, Dec 1967 1967. [7] S. Weinstein and P. Ebert, "Data transmission by frequency division multiplexing using the discrete fourier transform," IEEE Transactions on Communication Techniques, vol. COM-19, pp. 628-634, October 1971. [8] L. J. Cimini Jr, Ye, L., "Orthogonal Frequency Division Multiplexing for Wireless Channels," in IEEE Global Telecommunications Conference GLOBECOM, Sydney, Australia, 1998, p. 82. [9] C. H. Chang, C. L. Wang, and Y. T. Chang, "Efficient VLSI architectures for fast computation of the discrete fourier transform and its inverse," IEEE Transactions on Signal Processing, vol. 48, pp. 3206-3216, Nov 2000. [10] Y. Jung, Y. Tak, J. Kim, J. Park, D. Kim, and H. Park, "Efficient FFT algorithm for OFDM modulation," in Electrical and Electronic Technology, 2001. TENCON. Proceedings of IEEE Region 10 International Conference on, 2001, pp. 676-678 vol.2.

[11] A. Sadat and W. B. Mikhael, "Fast Fourier Transform for high speed OFDM wireless multimedia system," in Circuits and Systems, 2001. MWSCAS 2001. Proceedings of the 44th IEEE 2001 Midwest Symposium on, 2001, pp. 938942 vol.2. [12] A. D. S. Jayalath, "OFDM for Wireless Broadband Communications (Peak Power Reduction, Spectrum and Coding)," in School of Computer Science and Software Engineering Melbourne: Monash University, 2002, p. 257. [13] L. Hanzo, Webb, W., Keller, T., Single and Multi-carrier Quadrature Amplitude Modulation. Chichester: Wiley, 2000. [14] O. Edfors, Sandell, M., van de Beek, J.J., Landstrm, D., Sjberg, "An Introduction to Orthogonal Frequency Division Multiplexing," Division of Bibliography 202 Signal Processing, Lule University of Technology, Research Report TULEA 1996:16 1996. [15] R. Morrison, L. J. Cimini, Jr., and S. K. Wilson, "On the use of a cyclic extension in OFDM," in Vehicular Technology Conference, 2001. VTC 2001 Fall. IEEE VTS 54th, 2001, pp. 664-668 vol.2. [16] T. S. Rappaport, Wireless Communications, Principles and Practice. New Jersey: Prentice Hall, 1996. [17] S. Coleri, M. Ergen, A. Puri, and A. Bahai, "Channel estimation techniques based on pilot arrangement in OFDM systems," Broadcasting, IEEE Transactions on, vol. 48, pp. 223-229, 2002. [18] M. Morelli and U. Mengali, "A comparison of pilot-aided channel estimation methods for OFDM systems," IEEE Transactions on Signal Processing, vol. 49, pp. 3065-3073, Dec 2001. [19] W. G. Jeon, K. H. Paik, and Y. S. Cho, "Two-dimensional pilot-symbol-aided channel estimation for OFDM systems with transmitter diversity," IEICE Transactions on Communications, vol. E85B, pp. 840-844, Apr 2002. [20] J. Shentu and J. Armstrong, "Blind frequency offset estimation for PCCOFDM with symbols overlapped in the time domain," in Circuits and Systems, 2001. ISCAS 2001. The 2001 IEEE International Symposium on, 2001, pp. 570-573 vol. 4. [21] F. Tufvesson, "Design of Wireless Communication Systems - Issues on Synchronization, Channel Estimation and Multi-Carrier Systems," in Department of Applied Electronics Lund: Lund University, 2000. [22] M. Faulkner, "OFDM: Overview," Melbourne, 2002. [23] A. G. Armada, "Understanding the effects of phase noise in orthogonal frequency division multiplexing (OFDM)," IEEE Transactions on Broadcasting, vol. 47, pp. 153-159, Jun 2001. [24] T. Pollet, M. Vanbladel, and M. Moeneclaey, "Ber sensitivity of ofdm systems to carrier frequency offset and wiener phase noise," IEEE Transactions on Communications, vol. 43, pp. 191-193, 1995. [25] ETSI, "Radio Broadcast Systems: Digital Audio Broadcasting (DAB) to mobile, portable and fixed receivers," ETSI final draft ETS 300 401, Nov 1994.

[26] P. Shelswell, "The COFDM modulation system: The heart of digital audio broadcasting," Electronic and Communication Engineering Journal, vol. 7, pp. 127-135, June 1995. [27] J. Stott, "The DVB terrestial (DVB-T) specification and its implementation in a practical modem," International Broadcasting Convention, pp. 255-260, 1996. [28] ETSI, "Digital Video Broadcasting (DVB); framing structure, channel coding, and modulation for digital terrestial television (DVB-T)," European Telecommunications Standard Institute ETSI EN 300 744, Nov 1999. [29] ISO/IEC, "Information technology - generic coding of moving pictures and associated audio information: Part 2 video," ISO/IEC 13818, 2000. [30] J. Tellado-Mourelo, "Peak to Average Power Reduction for Multicarrier Modulation," in Department of Electrical Engineering: Stanford University, 1999. [31] H. Bolckei, D. Gesbert, and A. J. Paulraj, "On the capacity of OFDM-based spatial multiplexing systems," Communications, IEEE Transactions on, vol. 50, pp. 225-234, 2002. Bibliography 203 [32] S. H. Mller, R. W. Bauml, R. F. H. Fischer, and J. B. Huber, "Ofdm with reduced peak-to-average power ratio by multiple signal representation," Annales des Telecommunications Annals of Telecommunications, vol. 52, pp. 58-67, 1997. [33] C. Annis, "Central Limit Theorem, http://www.statisticalengineering.com/central_limit_theorem.htm," 2004. [34] C. Tellambura, "Phase optimisation criterion for reducing peak-to-average power ratio in ofdm," Electronics Letters, vol. 34, pp. 169-170, 1998. [35] H. Ochiai and H. Imai, "On the distribution of the peak-to-average power ratio in OFDM signals," IEEE Transactions on Communications, vol. 49, pp. 282289, Feb 2001. [36] M. Sharif, M. Gharavi-Alkhansari, and B. H. Khalaj, "On the peak-to-average power of ofdm signals based on oversampling," Communications, IEEE Transactions on, vol. 51, pp. 72-78, 2003. [37] R. v. Nee, OFDM for wireless multimedia communications. Boston ; London: Artech House, 2000. [38] G. Wunder and H. Boche, "Upper bounds on the statistical distribution of the crest-factor in ofdm transmission," Information Theory, IEEE Transactions on, vol. 49, pp. 488-494, 2003. [39] S. Mller-Weinfurtner, "OFDM for Wireless Communications: Nyquist Windowing, Peak-Power Reduction, and Synchronization," in Reihe Kommunikations- und Informationstechnik Nrnberg: Universitt Erlangen, 2000. [40] E. Costa, M. Midrio, and S. Pupolin, "Impact of amplifier nonlinearities on OFDM transmission system performance," Communications Letters, vol. 3, pp. 37-39, 1999. [41] E. Costa and S. Pupolin, "M-QAM-OFDM system performance in the

presence of a nonlinear amplifier and phase noise," IEEE Transactions on Communications, vol. 50, pp. 462-472, Mar 2002. [42] C. L. Liu, "The effect of nonlinearity on a qpsk-ofdm-qam signal," IEEE Transactions on Consumer Electronics, vol. 43, pp. 443-447, 1997. [43] S. Merchan, A. G. Armada, and J. L. Garcia, "Ofdm performance in amplifier nonlinearity," IEEE Transactions on Broadcasting, vol. 44, pp. 106-114, 1998. [44] G. T. Zhou and J. S. Kenney, "Predicting spectral regrowth of nonlinear power amplifiers," Communications, IEEE Transactions on, vol. 50, pp. 718-722, 2002. [45] G. Santella and F. Mazzenga, "A hybrid analytical-simulation procedure for performance evaluation in m-qam-ofdm schemes in presence of nonlinear distortions," IEEE Transactions on Vehicular Technology, vol. 47, pp. 142151, 1998. [46] X. Li and L. J. Cimini, "Effects of clipping and filtering in the performance of OFDM," IEEE Communications Letters, vol. 2, pp. 131-133, May 1998. [47] C. van den Bos, M. H. L. Kouwenhoven, and W. A. Serdijn, "Effect of smooth nonlinear distortion on OFDM symbol error rate," IEEE Transactions on Communications, vol. 49, pp. 1510-1514, Sep 2001. [48] M. Friese, "On the degradation of OFDM signals due to peak clipping in optimally predistorted power amplifiers," in Globecom, Sydney, Australia, 1998. Bibliography 204 [49] A. E. Jones, T. A. Wilkinson, and S. K. Barton, "Block coding scheme for reduction of peak to mean envelope power ratio of multicarrier transmission schemes," Electronics Letters, vol. 30, pp. 2098-2099, 1994. [50] T. A. Wilkinson and A. E. Jones, "Minimisation of the Peak to Mean Envelope Power Ratio of Multicarrier Transmission Schemes by Block Coding," in VTC, Chicago, USA, 1995. [51] A. E. Jones, "Combined Coding for Error Control and Increased Robusteness to System Nonlinearities in OFDM," in IEEE 46th VTC, Atlanta, USA, 1996, pp. pp 904-908. [52] V. Tarokh and H. Jafarkhani, "An Algorithm for Reducing the Peak to Average Power Ratio in a Multicarrier Communication System," in IEEE VTC, Piscataway, NJ, USA, 1999, pp. pp 680-684. [53] ETSI, "Broadband Radio Access Networks (BRAN); HIPERLAN Type 2; Physical (PHY) layer," European Telecommunications Standards Institute ETSI TS 101 475 v1.1.1, 2000. [54] V. Tarokh and H. Jafarkhani, "On the computation and reduction of the peaktoaverage power ratio in multicarrier communications," Communications, IEEE Transactions on, vol. 48, pp. 37-44, 2000. [55] P. W. J. VanEetvelt, S. J. Shepherd, and S. K. Barton, "The Distribution of Peak Factor in QPSK Multi-Carrier Modulation," in Wireless Communications Netherlands: Kluwer Academic Publishers, 1995, pp. 87-95. [56] S. Shepherd, J. Orriss, and S. Barton, "Asymptotic limits in peak envelope

power reduction by redundant coding in orthogonal frequency-division multiplex modulation," IEEE Transactions on Communications, vol. 46, pp. 510, 1998. [57] D. Wulich, "Peak Factor in Orthogonal Multicarrier Modulation with Variable Levels," Electronic Letters, vol. 32, pp. 1859-1861, September 1996. [58] J. E. M. Nilsson, "Coding to control the signal waveform in M-ary PSK multicarrier communications," in Radio Vetenskap och Kommunikation '96 (RVK), Lule, Sweden, 1996, pp. pp 658-662. [59] K. G. Paterson, "Bounds on Envelope Power of Trace Codes for OFDM," Hewlett Packard Laboratories, Bristol HPL-98-176, October 1998. [60] C. Tellambura, "Upper bound on peak factor on n-multiple carriers," Electronics Letters, vol. 33, pp. 1608-1609, 1997. [61] C. Wulich, "Reduction of Peak to Mean Ratio of Multicarrier Modulation using Cyclic Coding," Electronic Letters, vol. 32, pp. 432-433, February 1996. [62] Y. Y. Zhang, A.; Chouinard, J.-Y.; Zhang, L., "OFDM peak power reduction by sub-block-coding and its extended versions," in Vehicular Technology Conference, 1999 IEEE 49th, 1999, pp. 695-699 vol.1. [63] S. Boyd, "Multitone signals with low crest factor," Circuits and Systems, IEEE Transactions on, vol. 33, pp. 1018-1022, 1986. [64] S. J. V. E. Shepherd, P.W.J.; Wyatt-Millington, C.W.; Barton, S.K., "Simple coding scheme to reduce peak factor in QPSK multicarrier modulation," in Electronics Letters. vol. 31, 1995, pp. 1131-1132. [65] R. D. J. van Nee, "OFDM codes for peak-to-average power reduction and error correction," in Global Telecommunications Conference, 1996. GLOBECOM '96. 'Communications: The Key to Global Prosperity, 1996, pp. 740-744 vol.1. Bibliography 205 [66] J. A. Davis and J. Jedwab, "Peak-to-mean power control and error correction for ofdm transmission using golay sequences and reed-muller codes," Electronics Letters, vol. 33, pp. 267-268, 1997. [67] J. A. Davis and J. Jedwab, "Peak-to-mean power control in OFDM, Golay complementary sequences, and Reed-Muller codes," IEEE Transactions on Information Theory, vol. 45, pp. 2397-2417, 1999. [68] A. E. Jones and T. A. Wilkinson, "Performance of Reed-Muller codes and a maximum-likelihood decoding algorithm for OFDM," IEEE Transactions on Communications, vol. 47, pp. 949-952, 1999. [69] A. G. Armada, V. Gil, and J. L. Garcia, "Improving Peak to Average power Ratio (PAR) and Probability of Error in OFDM-based WLAN," in IST Summit, United Kingdom, 2001. [70] A. J. Grant and R. van Nee, "Efficient maximum likelihood decoding of peak power limiting codes for OFDM," in Vehicular Technology Conference, 1998. VTC 98. 48th IEEE, 1998, pp. 2081-2084 vol.3. [71] I. Ochiai, M. P. C. Fossorier, and H. Imai, "On decoding of block codes with peak-power reduction in OFDM systems," IEEE Communications Letters, vol. 4, pp. 226-228, 2000.

[72] B. S. Tarokh, H.R., "Construction of ofdm m -qam sequences with low peaktoaverage power ratio," Communications, IEEE Transactions on, vol. 51, pp. 25-28, 2003. [73] K. Sathananthan and C. Tellambura, "Coding to reduce both par and picr of an ofdm signal," IEEE Communications Letters, vol. 6, pp. 316-318, 2002. [74] S. H. Mller and J. B. Huber, "A comparison of Peak Power Reduction Schemes for OFDM," in Globecom '97, Phoenix, USA, 1997. [75] S. H. Mller and J. B. Huber, "Ofdm with reduced peak-to-average power ratio by optimum combination of partial transmit sequences," Electronics Letters, vol. 33, pp. 368-369, 1997. [76] S. H. Mller and J. B. Huber, "A novel peak power reduction scheme for OFDM," in IEEE Personal Indoor Mobile Radio Communications, Helsinki, Finland, 1997, pp. 1090-1094. [77] S. H. Mller and J. B. Huber, "OFDM with Reduced Crest Factor," ETSI BRAN - European Telecommunications Standards Institute Broadband Radio Access Network, Ulm, Germany WG3 temporary document 25, 16 September 1997. [78] S. G. Kang, J. G. Kim, and E. K. Joo, "A novel subblock partition scheme for partial transmit sequence OFDM," IEEE Transactions on Broadcasting, vol. 45, pp. 333-338, 1999. [79] L. J. Cimini and N. R. Sollenberger, "Peak-to-average power ratio reduction of an OFDM signal using partial transmit sequences," in ICC'99, 1999. [80] L. J. Cimini and N. R. Sollenberger, "Peak-to-average power ratio reduction of an OFDM signal using partial transmit sequences," Communications Letters, vol. 4, pp. 86-88, Mar 2000. [81] C. Tellambura and A. D. S. Jayalath, "PAR reduction of an OFDM signal using partial transmit sequences," in Vehicular Technology Conference, 2001. VTC 2001 Fall. IEEE VTS 54th, 2001, pp. 465-469 vol.1. [82] H. Chen and G. J. Pottie, "An orthogonal projection-based approach for PAR reduction in OFDM," Communications Letters, vol. 6, pp. 169-171, May 2002. Bibliography 206 [83] R. W. Bauml, R. F. H. Fischer, and J. B. Huber, "Reducing the peak-toaverage power ratio of multicarrier modulation by selected mapping," Electronic Letters, vol. 32, pp. 2056-2057, 1996. [84] H. Breiling, S. H. Muller-Weinfurtner, and J. B. Huber, "SLM peak-power reduction without explicit side information," IEEE Communications Letters, vol. 5, pp. 239-241, 2001. [85] A. D. S. T. Jayalath, C., "A blind slm receiver for par-reduced ofdm," in Vehicular Technology Conference, 2002. Proceedings. VTC 2002-Fall. 2002 IEEE 56th, 2002, pp. 219-222. [86] J. Tellado and J. Cioffi, "Peak Power Reduction of Multicarrier Transmission," in Globecom 1998, Sydney, Australia, 1998. [87] R. Dinis and A. Gusmao, "On the performance evaluation of OFDM transmission using clipping techniques," in VTC'99, 1999. [88] H. Nikopour and S. H. Jamali, "Effects of Cartesian Clipping Noise on the

Performance of Orthogonal Frequency Division Multiplexing System: A Theoretical Approach," in IEEE GLOBECOM 2002, 2002, pp. 1254-1258. [89] A. R. S. Bahai, M. Singh, A. J. Goldsmith, and B. R. Saltzberg, "A new approach for evaluating clipping distortion in multicarrier systems," IEEE Journal on Selected Areas in Communications, vol. 20, pp. 1037-1046, Jun 2002. [90] G. Wunder and H. Boche, "The Impact of Nonlinear Devices on the Symbol Error Rate in Broadband OFDM Transmission,"?, vol.?, 2002. [91] T. May and H. Rohling, "Reducing the Peak to Average Power Ratio in OFDM Radio Transmission Systems," in VTC '98, ?, 1998, pp. 2474-2478. [92] J. Armstrong, "Peak-to-average power reduction for OFDM by repeated clipping and frequency domain filtering," Electronics Letters, vol. 38, pp. 246247, Feb 28 2002. [93] D. Kim and G. L. Stuber, "Clipping noise mitigation for OFDM by decisionaided reconstruction," Communications Letters, vol. 3, pp. 4-6, 1999. [94] D. J. G. Mestdagh, P. M. P. Spruyt, and B. Biran, "Effect of Amplitude Clipping in DMT-ADSL Transceivers," Electronic Letters, vol. 29, pp. 13541355, 22nd July 1993. [95] D. J. G. Mestdagh and P. M. P. Spruyt, "A Method to Reduce the Probability of Clipping in DMT-Based Transceivers," IEEE Transactions on Communications, vol. 44, pp. 1234-1238, October 1996. [96] B. Come, R. Ness, S. Donnay, L. V. d. Perre, W. Eberle, P. Wambacq, M. Engels, and I. Bolsens, "Impact of Front-End Non-Idealities on Bit Error Rate of WLAN-OFDM Transceivers," in Microwave Journal, 2001, pp. 126-140. [97] J. Armstrong, H. A. Suraweera, S. Brewer, and R. Slaviero, "Effect of Rouunding and Saturation in Fized-Point DSP Implementation of IFFT and FFT for OFDM Applications," in The Embedded Signal Processing Conference (GSPx 2004), Santa Clara, CA, USA, 2004. [98] S. Merchan, A. G. Armada, and J. L. Garcia, "OFDM performance in amplifier nonlinearity," Broadcasting, IEEE Transactions on, vol. 44, pp. 106114, 1998. [99] R. v. Nee and A. d. Wild, "Reducing the Peak-to-Average Power Ratio of OFDM," in Vehicular Technology Conference, Ottawa, Canada, 1998, pp. 2072-2076. [100] S. B. Slimane, "Peak-to-average power ratio reduction of OFDM signals using broadband pulse shaping," in IEEE VTC '02, Canada, 2002, pp. 889-893. Bibliography 207 [101] R. S. Creighton, "System and method to reduce the peak-to-average power ratio in DS-CDMA transmitter," in European Patent Application Great Britain: Lucent Technologies (UK), 1999. [102] J. E. Volder, "The CORDIC Trigonmetric Computing Technique," IRE Trans. Electronic Computing, vol. 8, pp. 330-334, Sept 1959. [103] R. Berangi and M. Faulkner, "Peak Power Reduction for Multi-Code CDMA and Critically Sampled Complex Gaussian Signals," in IST 2001, Iran, 2001.

Anda mungkin juga menyukai