Anda di halaman 1dari 23

Sinhgad Technical Education Societys

SINHGAD ACADEMY OF ENGINEERING


Department of Electronics and Telecommunication. PG Students 2012-13

Seminar On

Marathi Language Speech Synthesizer Using Concatenative Synthesis Strategy


Submitted By Mr. Sawane Padmakar D. Guided by PROF. Patil M.M.

Outline

Introduction Speech Synthesis Process


-- Feature of Spartan-6 -- Power Supplies -- Spartan-6 Board Power Connections -- FPGA Interfacing to PS/2 keyboard. --Electrical interface --Protocol for receiving from the keyboard --The keyboard scan-codes -- Audio (AC-97)

Project work Hardware Design

Software Xilinx 13.4 --Why Xilinx -- RTL Schematic -- Test Bench simulation

Implementation Conclusion References

data

Introduction

Speech is one of the most vital forms of communication in everyday life.

Human computer interaction but still rural communities lag behind technologies.

One good solution for this problem would be computers talking to the common man in the language he is comfortable to communicate in.

Speech Synthesis Process

Project work divided in Two Part


1. 2. Hardware Selection & Design Software Module:PS2 Keyboard interface with FPGA. Concatenation & Marathi word searching in database. Audio voice of Marathi word which is input from PS2 keyboard.

Fig 2 The speech synthesis system.

Hardware Design

Features of Spartan-6
Xilinx Spartan-6 LX45 FPGA, 324-pin BGA package 128Mbyte DDR2 with 16-bit wide data 10/100/1000 Ethernet PHY on-board USB2 ports for programming and data transfer USB-UART and USB-HID port (for mouse/keyboard) two HDMI video input ports and two HDMI output ports AC-97 Codec with line-in, line-out, mic, and headphone real-time power monitors on all power rails 16Mbyte x4 SPI Flash for configuration and data storage 100MHz CMOS oscillator 48 I/Os routed to expansion connectors GPIO includes eight LEDs, six buttons, and eight slide switches

Power Supplies
The Atlys board requires an external 5V, 4A.
Atlys Power Supplies Supply Circuits FPGA I/O, video, USB ports, clocks, ROM, audio FPGA aux, VHDC, Ethernet PHY I/O, GPIO FPGA core, Ethernet PHY core DDR & FPGA DDR I/O DDR termination voltage (VTT) Device Amps (max/typ)

3.3 V

IC16: LT3501

3A / 900mA

2.5 V 1.2 V 1.8 V 0.9 V

IC15: LTC3546 IC15: LTC3546 IC16: LT3501 IC14: LTC3413

1A / 400mA 3A / 0.8 1.8A 3A / 0.5 -- 1.2A 3A / 900mA

Spartan-6 Board Power Connections

Fig 3 Spartan-6 Board Power Connections

FPGA Interfacing to PS/2 keyboard.

Goals: Attach a keyboard to the Spartan 6 device through PS/2 type of port; Design a logic which can detect a character. Use VHDL during the development process.

Electrical interface
FPGA read PS2 Data_Out & PS2 Clk_out = 0 So tristate buffer= high impedance mode FPGA write Logic 0:PS2 Data_Out & PS2 Clk_out = 1 Which pulls line low Logic 1:PS2 Data_Out & PS2 Clk_out = 0 Which pulls line high

Figure 4 . FPGA/Keyboard Interface

Protocol for receiving data from the keyboard

Data is received from the keyboard

Clock Data

Figure 5. PS/2 Protocol

The keyboard scan-codes

The keyboard sends packets of data, scan codes, to the host indicating which key has been pressed. When a key is pressed or held down a make code is transmitted. When a key is released a break code is transmitted. Every key is assigned a unique make and break code so that the host can determine exactly what has happened.

Scan Code Set


Key A B Make 1C 32 Break F0,1C F0,32 Key N O Make 31 44 Break F0,31 FO,44

C
D E F G H I J K L M

21
23 24 2B 34 33 43 3B 42 4B 3A

F0,21
F0,23 F0,24 F0,2B F0,34 F0,33 F0,43 F0,3B F0,42 F0,4B F0,3A

P
Q R S T U V W X Y Z

4D
15 2D 1B 2C 3C 2A 1D 22 35 1A

F0,4D
F0,15 F0,2D F0,1B F0,2C F0,3C F0,2A F0,1D F0,22 F0,35 F0,1A

Audio codec (AC-97)

Fig 6 Audio codec (AC-97)

Audio Codec LM4550


The Atlys board includes a National Semiconductor LM4550 AC 97 audio codec with four audio jacks for line-out, headphone-out, line-in, and microphone-in . Audio data at up to 18 bits and 48KHz sampling is

supported, and the audio in (record) and audio out


(playback) sampling rates can be different. The microphone jack is mono, all other jacks are stereo.

The headphone jack is driven by the audio codec's internal


50mW amplifier.

Software Xilinx 13.4


XILINX TARGETED DESIGN FORMAT ISE DESIGN SUITE SPARTAN-6 & VERTEX-6 FPGA FAMILIES XILINX IP CORE SOLUTION

RTL SCHEMATIC

Test Bench simulation of ps2interface

Implementation
The hardware layout proposed earlier is implemented on a Xilinx
Spartan-6 LX45 FPGA. Both the FSM and hardware layout are synthesized using the Xilinx ISE 13.4 VHDL Compiler.

Conclusion
In this paper we discussed the issues relevant to the development of text to speech systems for Marathi language. The Atlys circuit board is a complete, ready-

to-use digital circuit development platform based on a


Xilinx Spartan-6 LX45 FPGA, speed grade -3. Simulation in Xilinx 13.4 of keyboard interface with Spartan 6 using VHDL.

References

Marathi Language Speech Synthesizer Using Concatenative Synthesis Strategy (Spoken in Maharashtra, India) 2010 IEEE; Atlys Board Reference Manual Thesis of FPGA-based implementation of concatenative speech synthesis algorithm by Praveen Kumar Bamini VHDL: Programming by Example Douglas L. Perry Fourth Edition McGraw-Hill E. Veera Raghavendra, Srinivas Desai, B. Yegnanarayana, Alan W Black, Kishore Prahallad, Experiments on Unit Size for Unit selection Speech Synthesis, Blizzard 2008. Jerneja Zganec Gros and Mario Zganec, An Efficient Unitselection Method for Concatenative Text-to-speech Synthesis Systems, Journal of Computing and Information Technology- CIT 16, 2008,1, 69-78 doi:10.2498/cit.1001049 Aniruddha Sen, Speech Synthesis in India, IETE Technical Review, Vol 24, No 5, Sep-Oct 2007, pp 343-350. S. P. Kawachale and J. S. Chitode, An Optimized Soft Cutting Approach to Derive Syllables from Words in Text to Speech Synthesizer, in proceedings Signal and Image Processing, 2006, pp 534. S. P. Kishore and A. W. Black, Unit size in Unit selection Speech Synthesis, Proceedings of EUROSPEECH, Geneva, Switzerland, 2003. P. Prathibha, A.G. Ramakrishna, R. Muralishankar, Thirukkural II- A text-to-Speech Synthesis System Edition, 2003 S. P. Kishore, Rohit Kumar, and Rajeev Sangal, A data driven synthesis approach for Indian Languages using syllable as basic unit, in Proceedings of International Conference on Listeners National Language Processing (ICON), 2002. Eric Lewis and Mark Tatham, Word and Syllable Concatenation in Text to Speech Synthesis, Proceedings of sixth European Conference of Speech Communication and Male E 77 23 Female F 84 16 Technology, pp615-618, ESCAs, 1999. T. Dutoit, An Introduction to Text-to-Speech Synthesis, Kluwer Academic Publishers, Dordrecht, 320 pp., ISBN 0-79230254498-7. 1997. Paul Taylor, Text-to-Speech Synthesis University of Cambridge

THANK YOU

Anda mungkin juga menyukai