Anda di halaman 1dari 4

Code No: R5411205 1

IV B.Tech I Semester(R05) Regular/Supplementary Examinations, December 2009


INFORMATION RETRIEVAL SYSTEMS
(Information Technology)
Time: 3 hours Max Marks: 80
Answer any FIVE Questions
All Questions carry equal marks
?????

1. (a) Prove the statement that “Language is the Largest Inhibitor to Good Communications”, applies
to Information Retrieval Systems.
(b) Explain about Selective Dissemination of Information. [8+8]

2. Describe about various standards used in Information Retrieval Systems. [16]

3. (a) What are the objectives of indexing?


(b) Explain about multimedia Indexing. [8+8]

4. (a) Explain in detail about stemming process.


(b) Describe signature File structure. [8+8]

5. Explain the term frequency algorithm with a suitable example. [16]

6. Write short notes on the following with suitable examples:

(a) Manual clustering.


(b) Automatic Term clustering. [16]

7. Write short notes on:

(a) Hidden Markov Models Techniques.


(b) Ranking Algorithms. [8+8]

8. (a) How finite state Automata is used for hardware and software searchers?
(b) Construct finite state automata for each of the following set of terms:
BIT, FIT, HIT, MIT, PIT, SIT.
Be sure to define the three sets I, S, P along with providing the state drawing.
[8+8]

?????
Code No: R5411205 2
IV B.Tech I Semester(R05) Regular/Supplementary Examinations, December 2009
INFORMATION RETRIEVAL SYSTEMS
(Information Technology)
Time: 3 hours Max Marks: 80
Answer any FIVE Questions
All Questions carry equal marks
?????

1. Describe in detail about functional overview of an Information Retrieval Systems.


[16]

2. Describe the rationale why use of proximity will improve precision versus use of just the Boolean
functions. Discuss its effect on improvement of recall. [16]

3. (a) Discuss about different processes associated with information extraction.


(b) What are the problems with Luhn‘s concept of “resolving power”? [10+6]

4. (a) Explain in detail about stemming process.


(b) Describe signature File structure. [8+8]

5. Explain signal weighting with an example. [16]

6. (a) Define clustering? What are the general guidelines of clustering?


(b) Clearly bring out the steps of the process of a clustering. [8+8]

7. Write short notes on:

(a) Hidden Markov Models Techniques.


(b) Ranking Algorithms. [8+8]

8. Compare and contrast TREC measurement examples and results. [16]

?????
Code No: R5411205 3
IV B.Tech I Semester(R05) Regular/Supplementary Examinations, December 2009
INFORMATION RETRIEVAL SYSTEMS
(Information Technology)
Time: 3 hours Max Marks: 80
Answer any FIVE Questions
All Questions carry equal marks
?????

1. Define Information Retrieval Systems. What are its objectives? Explain them.
[16]

2. (a) Are Thesauri a subclass of concept of classes? Justify your answer.


(b) Compare and contrast fuzzy searches and Term masking. [8+8]

3. (a) Under what circumstances is manual indexing not required to ensure finding information? Pos-
tulate an example where this is true.
(b) Briefly explain how information Extraction differ from the process of document indexing. [8+8]

4. Describe the similarities and differences between stemming algorithms and n-grams. Describe how
they affect precision and recall. [16]

5. (a) Discuss the importance of statistical indexing.


(b) Write short notes on:
i. Probabilistic weighting.
ii. Vector weighting. [8+8]

6. (a) What are the general guidelines of clustering? Discuss them.


(b) List out the important decisions associated with the generation of a thesaurus.
[8+8]

7. Write a note on:

(a) Similarity Measures


(b) Ranking Algorithms. [8+8]

8. What algorithmetic basis is used for the GE-SCAN and Fast Data Finder hardware text search ma-
chines? Why this approach is used over others? Explain. [16]

?????
Code No: R5411205 4
IV B.Tech I Semester(R05) Regular/Supplementary Examinations, December 2009
INFORMATION RETRIEVAL SYSTEMS
(Information Technology)
Time: 3 hours Max Marks: 80
Answer any FIVE Questions
All Questions carry equal marks
?????

1. (a) What is an Item normalization? Explain it.


(b) How is a DBMS different from Information Retrieval System. [8+8]

2. (a) Briefly discuss about Proximity in Information Retrieval Systems.


(b) Explain about multimedia information system in IRS. [8+8]

3. (a) Explain about different factors involved in dividing level of index.


(b) Discuss in detail about different index processing techniques. [8+8]

4. (a) Describe briefly about the concepts of Signature File Structure.


(b) Describe briefly about the concepts of Porter Stemming Algorithm. [8+8]

5. Explain the term frequency algorithm with a suitable example. [16]

6. Compare and contrast Manual Clustering and Automatic Term Clustering. [16]

7. (a) What are the two major approaches of generating user queries?
(b) Discuss about natural language and Boolean query techniques. [4+12]

8. Construct finite state automata for each of the following set of terms

(a) BIT,FIT,HIT,MIT,PIT,SIT
(b) CAN,CAR,CARPET,CASE,CASK,CAKE
(c) HE,SHE,HER,HERE,THERE,SHEAR
Be sure to define the three sets I,S,P along with providing the state drawing.
[5+6+5]

?????

Anda mungkin juga menyukai