Anda di halaman 1dari 14

IBM EXAM 000-M91

IBM Content Analytics Product Fundamentals Technical Mastery Test v1

Total Questions:

41

Question: 1
ICA provides a software component that enables users to search multiple repositories with a single query and view only the documents that they are allowed to see without requiring a Single Sign-ON (SSO) policy. What is this component called? A. Real time natural language processing API. B. REST APIs. C. Identity management API. D. Plug-in APIs. Answer: C

Question: 2
Regarding Field filters, which of the following statements is FALSE? A. You can change field values to all lowercase or all uppercase. B. Field filters can map, split, trim or replace strings within field values. C. If multiple field filters exist for a given search field, the parser applies the newest filter first. D. Field filters require a condition to be met. Answer: C

Question: 3
LanguageWare Resources includes the following, EXCEPT for _________________ A. Parsing rules. B. Dictionaries. C. Correlation. D. Break rules. E. Character rules. Answer: C

Question: 4
Comma separated value (csv) files are supported in several different capacities. Which of the following is NOT correct? A. You can import a csv file to create or add words to a custom user dictionary. B. You can export documents as csv files. C. You can export query statistics to a csv file. D. You can import csv files and add them to a search collection. Answer: D

Question: 5
The Real-time NLP API has the following requirements, EXCEPT for: A. The collection can either be a search collection or a text analytics collection. B. The collection's search session must be running. C. The collection's parse and index sessions must be running. D. The NLP REST API accepts both text and binary content. Answer: A

Question: 6
When you configure a collection, you can enable the system to automatically detect Terms of interest. Of the following statements, which one is INCORRECT? A. Terms of interest are only valid for text analytics collections. B. Similar to the Named Entity Recognition annotator, Terms of interest is only supported by a subset (EN, ES, DE, FR, JA) of the languages supported in ICA. C. Terms of interest are the relationships between nouns and nearby verbs and adverbs in the text. D. You can enhance the content analytics by including nouns identified in the terms of interest in a custom user dictionary. Answer: B

Question: 7
In regard to Crawler plug-ins, which of the following statements is NOT true? A. Crawler plug-ins come in two flavors: Web and Non-web. B. Non-web crawler plug-ins are of two types: prefetch or postparse. C. Crawler plug-ins must be written in Java. D. Data source crawler plug-ins can change the content or metadata of crawled documents. Answer: B

Question: 8
Flexible node assignment allows you to create an additional server that will perform which two functions? A. Crawler and document processing. B. Crawler and parser/indexer. C. Crawler and search. D. Document processing and search. Answer: D

Question: 9
The overall process through which LanguageWare segments and normalizes text is referred to as _____________________? A. Lexical Analysis. B. Tokenization. C. Normalization. D. Part of Speech (POS). Answer: A

Question: 10
LanguageWare Resource Workbench (LRW) has been significantly enhanced to work with ICA . Which of the following is NOT one of these LRW improvements? A. A Type catalog has been added to simplify searching for types (and their associated resources) and renaming types (to ensure workspace consistency). B. LRW users can export a UIMA pipeline directly from the LRW and have it installed as the Custom annotator stage of an ICA collection. C. LRW users can send one or more documents to ICA to be annotated by the pipeline configured for one of the collections on the server. D. LRW Users can export a UIMA pipeline to an ICA search collection. Answer: D

Question: 11
When using the text miner application to analyze content, you use the ____________ to identify the relationship of keywords or subfacets that belong to different facets. A. Facets view. B. Time Series view. C. Connections view. D. Deviations view. Answer: C

Question: 12
IBM Content Assessment users benefit from several key features. Which of the following is NOT one of those features? A. Enabling a collection to automatically detect terms of interest. B. Automatically generating facets from document metadata. C. Flagging documents according to administrator-defined criteria. D. Showing the percentage of duplicate documents in the results. Answer: A

Question: 13
What is the simple mechanical process of breaking up white space delimited text into words called? A. Tokenization. B. Segmentation. C. Shallow parsing. D. Regular expressions. Answer: A

Question: 14
All of the following statements about annotators are true, EXCEPT for: A. A software component that performs specific linguistic analysis tasks and produces and records annotations. B. Are always independent of (i.e., do not rely on) other annotators in the document processing pipeline. C. Do the real work of extracting structured information from unstructured data. D. Annotators typically read from and write to the common analysis structure. E. May be collection specific, and are administered via the Configure document processing pipeline link on the Parse and Index tab. Answer: B

Question: 15
The REST APIs have several benefits over the SIAPI APIs. Which of the following statements is NOT correct about REST API benefits? A. Response formats include HTML, JSON and XML. B. Language independent, applications don't have to be written in Java. C. HTTP communication is the only required feature on the client application. D. All communications are in text format. Answer: A

Question: 16
In text analytics, the process of determining a single string representation for a word or term found in text is known as? A. Shallow parsing. B. Normalization. C. Segmentation. D. Character rules. Answer: B

Question: 17
Which of the following statements is TRUE regarding optional facet index? A. Optional Facet Index should be considered for small collections. B. By default, Optional facet index is enabled. C. Optional facet index can be quickly generated by adding additional index servers. D. Long building time is required for creating optional facet index. Answer: D

Question: 18
Categories enable you to group documents that share common characteristics, Which of the following terms is NOT connected with category configuration? A. Rule-based categories. B. Cluster analysis. C. Terms of interest. D. All of the above. Answer: C

Question: 19
Which preferred API would you use to build custom search applications, text miner applications, and administration applications? A. Search and Index API. B. REST API. C. Identity Management API. D. Crawler Plug-in. Answer: B

Question: 20
With regard to custom lexical dictionaries, all of the following are true statements EXCEPT: A. Custom lexical dictionaries are only used by search collections. B. Use the eslexicalbuilder tool to create a custom lexical dictionary .dic file. C. Use the administration console to modify a custom lexical dictionary once it has been added to the system. D. Custom lexical dictionaries must be assigned to one or more collection(s) in order to improve the quality and precision of search results. Answer: C

Question: 21
There are several ways to add a document to a collection. Which of the following CANNOT be used? A. Crawler. B. CSV file import. C. Real-time NLP API. D. REST API. Answer: C

Question: 22
You can export documents with text analysis results from IBM Content Analytics to IBM Cognos BI using which export format? A. XML files. B. Relational database (star-schema tables.) C. CSV files. D. Any of the above. Answer: B

Question: 23
Choose the best answer Text mining and gaining insight into your content is a/an ___________ process. A. Elementary. B. Simple. C. Iterative. D. Uncomplicated.

Answer: C

Question: 24
Misspelled words can cause problems when performing text analysis. Which of the following possible solutions is recommended? A. Create a crawler plug-in to perform spell checking. B. Modify the system dictionary or create a custom user dictionary (i.e., treat misspelled words as synonyms.) C. Create an export plug-in to perform spell checking. D. Enable spelling Correction in the Text Miner application. Answer: B

Question: 25
A Search collection may optionally use the following annotators, except for: A. Named Entity Recognition Annotator. B. Classification Module Annotator. C. Custom Annotator. D. All of the above annotators may be used. Answer: A

Question: 26
What application feature is used to provide integration to the IBM Content Collector? A. Search and Text Miner Export. B. REST API. C. Real-time NLP API. D. Crawler plug-in. Answer: A

Question: 27
Users are being discouraged from using ________________ because they may not be supported in future releases. A. Real-time NLP API. B. SIAPI Administration APIs. C. Crawler Plug-in APIs. D. Identity management component APIs. Answer: B

Question: 28
What must be installed and configured to integrate ICA to the IBM Case Manager? A. ODBC Driver. B. IBM Case Manager Crawler. C. FileNet P8 Content Engine 5.0 Client. D. Both B and C. Answer: D

Question: 29
Which ICA component generates Facets? A. Crawlers. B. Search Servers. C. Indexer. D. All of the above. Answer: C

Question: 30
What do Export plug-ins allow you to do? A. Implement custom logic when exporting crawled, analyzed or searched documents. B. Change indexed content when exporting analyzed documents. C. Use the real-time NLP API to perform ad-hoc text analytics on a document. D. Export collection configuration information. Answer: A

Question: 31
Which BI tool does ICA integrate with to populate and display reports? A. IBM Cognos BI B. Crystal Reports C. Microsoft Excel. D. SAP BusinessObjects Answer: A

Question: 32
Integration to the IBM ECM repositories can be accomplished through which crawlers? A. Content Manager B. Content Integrator C. FileNet P8 D. All of the above. Answer: D

Question: 33
Crawler plug-ins for archive files support which of the following crawler(s)? A. Agent for Windows file systems. B. FileNet P8. C. Sharepoint. D. All of the above. Answer: D

Question: 34
Integrating the IBM Classification Module in the ICA document processing pipeline allows one to: A. Reduce indexing time. B. Improve search quality. C. Categorize documents. D. B and C only. Answer: D

Question: 35
In what order do you follow these steps according to Best Practices? 1. Crawl, parse and index, and inspect the content using the Text Miner application. 2. Integrate ICM and/or develop a custom annotator, and inspect the content using the Text Miner application. 3. Create a small, representative collection of documents. 4. Define new dictionaries, and inspect the content using the Text Miner application. A. 1/2/3/4 B. 3/1/4/2 C. 3/4/2/1 D. 2/4/3/1 Answer: B

Question: 36
With regards to boost word dictionaries, all of the following are true EXCEPT for: A. The boost factor can range from -10 to +10, and the values -1, 0, and 1 have no boost effect. B. You can define specific terms or multi-word terms that raise or lower the rank value of the document in which the term appears. C. You specify boost words in an XML file, and then convert the XML file to a boost word dictionary. D. Boost word dictionaries are language dependent. Answer: D

Question: 37
What is a means of identifying strings of characters using a formal syntax? A. Shallow parsing. B. Normalization. C. Segmentation. D. Regular expressions. Answer: D

Question: 38
Which one of the following elements has the least influence on an ICA Capacity Planning Spreadsheet? A. Number of documents per collection. B. Average document size. C. Document language. D. Largest document size. Answer: C

Question: 39
Custom rule files are implemented by the ______________ annotator in the document processing pipeline. A. Dictionary lookup. B. Linguistic analysis. C. Named entity recognition. D. Pattern matcher. Answer: D

Question: 40
The process of analyzing a sentence to identify its structure and components results in what linguistic category? A. Shallow parsing. B. Part of Speech (POS). C. Tokenization. D. Character rules. Answer: B

Question: 41
Assuming you want to export documents to IBM Content Collector, which one of the following options to export crawled or analyzed documents is NOT useful? A. Enable the CAS as XMI format option. B. Specify whether you want to export information about documents that were deleted from the crawl space since the crawler last checked for new, changed, and deleted documents. C. Specify whether you want to use the field name or facet path for mapping metadata when the documents are exported. D. Specify URI patterns to identify the documents that you want to export. Answer: A

Anda mungkin juga menyukai