Anda di halaman 1dari 12

Synopsis on Parllelisation of tesseract OCR

By Gaurav Parashar June, 2011

Contents
1 Present State of Knowledge 1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2 Denition & Framework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3 Research Goals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Objectives 3 Preliminary Work Done 4 Proposed Research Design, Tools and Methodology 5 Tentative Detailed Chapter Plan Bibliography 3 3 5 6 7 8 9 10 11

1
1.1

Present State of Knowledge


Introduction

According to recent statistics, in a world population of around 6,845,609,960, the Internet users number 1,966,514,816 [13]. This means that only 28.7% of total population uses the Internet. Now let us see where India stands. Taking the cue from the same site there is a population of 1,173,108,018 in India of which Internet users number 81,000,000 [14]. Therefore, only around 6.09% of total population are using Internet. Here lies a question: if we are not able to provide QoS to this 6.09% of the Indian population then what will we do if the gure increases? There are some more questions that have cropped up in this researchers mind. The pertinent questions are: Is there really a lack of quality in services oered? If so, why? Specically, is it due to policy reasons or something else? What means can be devised to oer better QoS and customer satisfaction? Adequate research is needed in this area to design and develop an automated process which ensures that when there is a drop below a certain threshold, then the server automatically sets the QoS at a desired acceptable rate. Lets see some more issues with the upcoming technology. 3G has been launched and the rate at which the data is transferred is extremely fast. It means that the problem which this researcher faced earlier is now solved. In the opinion of researcher it isnt. Lets see why. In a country where we are getting 2GB in Rs 200 will a customer pay for a 3G connection of about Rs 1000 per GB? At this rate a customer will consume his limit even before knowing it. A faster connection means that one uses up data faster. 1 GB on EDGE is as good as unlimited usage. On 3G, it is 10 hours or less [22]. He will again get dissatised. Service provider must take into consideration the customers perception of technology, his needs, and his pocket as well. According to a PC Quest article [15], quality of after-sales service is still the biggest challenge in India. The article states that, on an average, 2Mbps is the bandwidth used by corporate India. The same article states that broadband speed is not enough. If bandwidth is an issue the why not go for the other alternative, i.e., content adaptation? Content adaptation is a process that is based on factors like capabilities of the displaying device on network, or the users preferences, and adapts the content that has been requested to provide an optimized user experience. [9]. Mobile video content is becoming sizeable, with resolution compatible with broadband. It is critical that a video that is for broadband must be portable to mobile [11]. Content adaptation is a problem which had fascinated the researcher because of the challenges researcher had faced in his real life. So researcher will now turn his entire focus on content adaptation so that those 6.9 % of Internet users who use Internet today get value for money. 3

Whenever there is a multimedia rich website, or a movie, or a podcast or a live sport event, we all face a problem in viewing/hearing it. We have seen the jitter problem in podcast, delay in live streaming videos [20], and delay in loading of multimedia-rich website on personal computers. If one views in his PDA or mobile phone then this becomes frustrating because one never instantly gets full information, which one seeks. Chang [16] addressed real-time content based adaptive streaming of sports videos, Hsiao [18] addressed content-aware adaptation under low bit-rate constraint and Tavanapong [24] addressed characteristics-based bandwidth reduction technique. These techniques were used to minimize the content in the videos so that the less needed information can be discarded and focus is on more important details. According to research done by Acision and YouGov [12], almost 84% of UK mobile broadband users face QoS issues. Slow speeds(67%), poor network coverage(49%), inability to get connected (45%) and connection loss(40%) are some of the issues users face despite of the popularity of the service. As per Van Zanen (Sr. Vice president Marketing), Content optimization is a second area where operators can improve user experience. While video is becoming increasingly popular, with over a third of consumers questioned (36%) accessing the data-hungry video sites via their mobile connection, the research highlighted that the quality of video services across mobile broadband performed particularly badly on a frequent basis. Research also highlighted that among those consumers accessing video services on their mobile phones, over half (63%) of the consumers experienced frequent pauses and, as many as 55% of video users experienced these problems on a regular basis. With video playback causing problems to consumers, 48% agreed that they would be happy if their mobile operator applied content adaptation policies such as reducing the size of videos being watched, provided that ensured the video would play back without stalling/buering, and they were not able to see a dierence in the video being played. Same were the issues that had been addressed by Mr. Ganesh Narayana Murthy , a researcher at IIT Bombay. He has done a study on Element based adaptation of lecture videos to mobile devices [21]. His study focused on how study material can be adapted and viewed on mobile devices and PDAs, where screen size is small and the content had to be scaled down [19]. The motivation of this researcher goes around those ideas and future work of collaborative tagging, content region identication and conversion to text. In the course of this research dierent video codec formats will also be studied, those will be used for transcoding [25] like MPEG-1 [5], MPEG-2 [8], H.264 [23] and MPEG-4 [6]. Transcoding is a technique through which a video is changed from one format to another format. During the entire course of research, researcher will also maintain his focus on user perception of the service as mentioned by Hardy [17] and to provide better QoS, to achieve user satisfaction. With the result of the research we can clearly see that content adaptation is one possible remedy for QoS under conditions of low bandwidth.

1.2

Denition & Framework

Denition:- Quality of Service(QoS):The collective eect of service performance which determine the degree of satisfaction of a user of a service. ETSI [10] QoS is also being considered as a result of service performance to the satisfaction of user. QoS encompasses network components instrumental in providing minimum level if assurance and end user satisfaction. Factors of QoS for Cellular Networks: Packet Drop Low Throughput Jitter Latency Packet Drop: It is the number of packets dropped because either routers were busy or device was busy, causing delays in reception of data. Low throughput: Due to sharing of resources by dierent users in a network, the resource crunch can happen. This results in reduction of bit rates and hence delay. Jitter: Packets may reach destination with varying delays, this delay leads to choppy sound and video. Latency: It might take long time for a packet to reach its destination. The researcher will attempt to discuss issues in cellular networks which lead to choppy video and sound and we have discussed it in this section. Researcher will attempt to discuss these issues and try to provide multimedia contents to network end user by content adaptation and improvement in QoS factors.

1.3

Research Goals

The proposed study would include the study of dierent content adaptation techniques. The study would also include design, development and testing of the content adaptation model for cellular devices on the basis of the study done during the research period. The objective of the proposed study is bring out a model that will overcome major weaknesses of the methods currently used for content adaptation and extend the work done by Murthy [21]. The proposed model will be designed on the basis of the study conducted by other researchers in the eld. Redundancy is a parameter which exists in mostly all the multimedia les. The purpose of the study will also include devising a method for exploiting this parameter and using video transcoding, that will make the video small in size. In the light of the above, researcher will attempt to focus on the following study objectives:1. To measure end-user satisfaction with regards to content adaptation. 2. To improve current tagging methodology of Collaborative Tagging1 . 3. To identify the content region in a video, which is text in video, so that subject matter in the video can be shown with greater legibility. 4. To convert the study element 2 in video to text, which is easier to display in a cellular phone. 5. To improve the factors of QoS. In order to improve QoS researcher will focus on objective 1 to 4 and the QoS will be improved as a consequence.
It is a method of allowing viewers to tag the study elements in a video. A lecture video may contain slides, video of instructor, instructor explaining on a white board or on a piece of paper. These elements in a lecture video are termed as study element.
2 1

Objectives
1. To measure end-user satisfaction with regard to content adaptation. 2. To improve current tagging methodology of Collaborative Tagging. 3. To identify the content region in a video, which is text in video, so that subject matter in the video can be shown with greater legibility. 4. To convert the study element in video to text, which is easier to display in a cellular phone. 5. To improve the factors of QoS.

As discussed in section 1.3 the objective of the research will be:-

All above objectives have to match to the end-user satisfaction. Researcher will devise a method to get feedback for calculating end-user satisfaction level.

Preliminary Work Done

Some literature and reports have been reviewed and the main ndings have been mentioned under Present State of Knowledge. It has been established that content adaptation based model have performed well in most of the cases concerning cellular devices. A brief list of relevant books, thesis, and articles consulted has been given at the end in the form of references and bibliography.

Proposed Research Design, Tools and Methodology

Several tools and techniques will be used for carrying out the research. Some of them have been listed under. Researcher did not list down all the tools since it will become evident from the research as it proceeds that what all new innovative tools and techniques will be used to carry out the research. Tools 1. Simulator tool:- appropriate tool for the simulation of a cellular network. 2. Statistical tool:- SPSS or R [7] for carrying out all the statistical calculation. 3. Graph tool:- gnuplot [2] or R for plotting graphs. 4. Documentation tool:- Latex [4] for documentation. 5. Scripting tool:- C / C++ / PHP / Perl / Python / J2ME [3] for developing scripts for automation of the process for carrying out tests. 6. Transcoding:- mpeg [1] / any other transcoder. 7. Cellular Phone or Simulator: Any cellular phone or simulator which supports GPRS and video play. 8. Any other tool which may seem relevant to the cause. Methodology The research will be carried out by studying the relevant literature concerning the problem and appropriate experimentation will be carried out to test the researchers views. Results will be compiled and will be presented as research papers.

Tentative Detailed Chapter Plan


1. Motivation. 2. Problem Statement. 3. Video adaptation and its limitations. 4. Proposed Adaptation Methodology 5. Organization of Thesis.

Chapter 01 Introduction

Chapter 02 Video Adaptation Methodologies 1. Study of dierent video adaptation methodologies. 2. Current research. 3. Comparison of dierent adaptation methodologies. Chapter 03 Tools and Techniques 1. Introduction. 2. Detailed description of tools and technologies used in research. 3. Test bed for research. Chapter 04 Design and Analysis of proposed algorithm 1. Introduction to video adaptation algorithm. 2. Design of the video adaptation algorithm. 3. Comparison of video adaptation algorithm with current algorithms. 4. Result of the comparison. Chapter 05 Implementation and experiments of the System 1. This section would discuss the implementation of the system. 2. Preparation of Testbed. 3. Analysis techniques which will nd the eectiveness of the system. Chapter 06 Conclusions and Recommendations 1. This part of the proposed research would summarize the conclusions and would deliver them into a pragmatic framework that can be applied. 2. The recommendations regarding increasing the eciency of the proposed System are expected to emerge out of this research.

10

References
[1] Ffmpeg. http://www.ffmpeg.org/. [Online; accessed 24-December-2010]. [2] Gnu plot. http://www.gnuplot.info/. [Online; accessed 24-December-2010]. [3] J2me. http://www.java.com/en/download/faq/whatis_j2me.xml. [Online; accessed 24-December-2010]. [4] Latex a document preparation system. http://www.latex-project.org/. [Online; accessed 24-December-2010]. [5] Moving picture experts group. http://en.wikipedia.org/wiki/Moving_Picture_ Experts_Group. [Online; accessed 24-December-2010]. [6] Mpeg-4. http://www.mpeg4.net/. [Online; accessed 24-December-2010]. [7] The r project for statistical computing. http://www.r-project.org/. [Online; accessed 24-December-2010]. [8] Video compression standards. http://www.cctvone.com/pdf/FAQ/ VideoCompressionStandardsJournal.pdf. [Online; accessed 24-December-2010]. [9] W3.org. http://www.w3.org/standards/webofdevices/independence. [Online; accessed 24-December-2010]. [10] Etsi technical report. http://www.etsi.org/deliver/etsi_etr/001_099/003/01_ 60/etr_003e01p.pdf, 1990. [Online; accessed 24-December-2010]. [11] Dilithiumn networks. http://www.dilithiumnetworks.com/pdfs/white_papers/ MKT_ART_MobileContentAdaptation.pdf, 2008. [Online; accessed 24-December-2010]. [12] Cellular-news. http://www.cellular-news.com/story/44138.php, 2010. [Online; accessed 24-December-2010]. [13] Internet world stats. http://www.internetworldstats.com/stats.htm, 2010. [Online; accessed 24-December-2010]. [14] Internet world stats. http://www.internetworldstats.com/asia.htm#in, 2010. [Online; accessed 24-December-2010]. [15] Pc quest. http://pcquest.ciol.com/content/ITstrategy/2010/110100805.asp, 2010. [Online; accessed 24-December-2010]. [16] Shih-Fu Chang, Di Zhong, and Raj Kumar. Real-time content-based adaptive streaming of sports videos. In CBAIVL 01: Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL01), 2001.

11

[17] William C. Hardy. QoS Measurement and Evaluation of Telecommunications Quality of Service. John Wiley and Sons, England, 2001. [18] Ming-Ho Hsiao, Yi-Wen Chen, Hua-Tsung Chen, Kuan-Hung Chou, and Suh-Yin Lee. Content-aware video adaptation under low-bitrate constraint. EURASIP Journal on Advances in Signal Processing, 2007, 2007. [19] Stephan Kopf, Fleming Lampi, Thomas King, and Wolfgang Eelsberg. Automatic scaling and cropping of videos for devices with limited screen resolution. In Proceedings of the 14th annual ACM international conference on Multimedia, MULTIMEDIA 06, pages 957958, New York, NY, USA, 2006. ACM. [20] Manzato, Marcelo G., Goularte, and Rudinei. Live video adaptation: a context-aware approach. In Proceedings of the 11th Brazilian Symposium on Multimedia and the web, WebMedia 05, pages 18, New York, NY, USA, 2005. ACM. [21] Ganesh Narayana Murthy. Study - element based adaptation of lecture videos to mobile devices. Masters thesis, IIT Bombay, 2010. [22] Prashant Rao. 3g:glass half-empty or bottle half-full? Express Computer, 21, 2010. [23] Gary J. Sullivan, Pankaj Topiwala, and Ajay Luthra. The h.264/avc advanced video coding standard: Overview and introduction to the delity range extensions. In SPIE Conference on Applications of Digital Image Processing XXVII Special Session on Advances in the New Emerging Standard: H.264/AVC, 2004. [24] Wallapak Tavanapong and Srikanth Krishnamohan. A characteristics-based bandwidth reduction technique for pre-recorded videos. In IEEE International Conference on Multimedia and Expo (III), pages 17511754, 2000. [25] A. Vetro, C. Christopoulos, and Huifang Sun. Video transcoding architectures and techniques: an overview. IEEE Signal Processing Magazine, 20:1829, 2003.

12

Anda mungkin juga menyukai