Anda di halaman 1dari 22

NITANSHI MAHAJAN

What if we could gain relevant insights from our news


feed so that we only read articles or news about
something/someone we are concerned about ?
Aim of the Project :
1) Focus on efficient reading.
2) Capture insights from unstructured data.

How to deal with this problem ?


Using the Natural Language Processing.
An IBM supercomputer that
combines artificial intelligence
(AI) and sophisticated
analytical software for optimal
performance as a “question
answering” machine.
Watson was created as a question
answering (QA) computing system that
IBM built to apply -

Advanced natural language processing


 Information retrieval
 Knowledge representation
 Automated reasoning
Machine learning technologies
 Cloud – based application.
 Helps in building custom made models.
 Combines AI and ML with data analysis.
Mention detection
Relation Detection
Co reference Resolution
Rule Definition
Regex Definition
Other functionality
PRE - ANNOTATION

TRAINING AND ANALYSIS


TRIAN IBM
MODEL DEPLOY WATSON
ON IBM TO NLU
WKS SERVICE
Uses Natural Language Understanding(NLU) techniques
that help computer understand the complexities of
human language to derive fast and meaningful insights
from large unstructured data.

It helps in drawing various parameters such as :


Concepts
Emotions
Categories
Relations
Entities
Keywords
Semantic Roles
Sentiment
There was a small test model built by me to understand the
working of IBM WKS and it’s NLU service to get a gist of how
ML works.
Created workspaces of domains for which the
models were to be built.
Uploaded documents in the formats specified.
Creating entity types and relation types for
annotation purpose. In order to accelerate the
process, we could use pre- annotators.
After defining entity types , relation types and other
things, we annotate our document.
To help Watson understand nuances of the language of
our domain, we train Watson using defined rules.
After the training is done, we test and evaluate our
model and iterate the annotation process with more
accuracy until we obtain optimum results.
After training and evaluation, we deploy our model
to one of the Watsons’ Services to find solution to our
business problem.
Natural Language Processing (NLP) is the art and
science which helps us extract information from
text and use it in our computations and algorithms.

Named Entity Sentiment


Stemming
Recognition Analysis

Named Entity Semantic Text


Lemmatization
Disambiguation Similarity

Word Language
Part of Speech
Embedding Identification
Autocorrect
Spellchecker
Grammar Checker
Spam Detection
Email classification in Gmail
Duplicate detection
And many more…

Anda mungkin juga menyukai