Abstract:
abound. First, short texts do not always observe the syntax of a written language.
texts usually do not contain sufficient statistical signals to support many state-of-
the-art approaches for text mining such as topic modeling. Third, short texts are
more ambiguous and noisy, and are generated in an enormous volume, which
further increases the difficulty to handle them. We argue that semantic knowledge
prototype system for short text understanding which exploits semantic knowledge
short texts.
architecture diagram:
EXISTING SYSTEM:
generates the k most likely output strings corresponding to the input string. This
is both accurate and efficient. The approach includes the use of a log linear model,
a method for training the model, and an algorithm for generating the top k
candidates, whether there is or is not a predefined dictionary. The log linear model
large scale data show that the proposed approach is very accurate And efficient
settings.
PROPOSED SYSTEM:
challenges abound. First, short texts do not always observe the syntax of a written
easily applied. Second, short texts usually do not contain suffi cient statistical
topic modeling. Third, short texts are usually more ambiguous. We argue that
knowledge is needed in order to better understand short texts. In this work, we use
for tasks such as text segmentation, part-of-speech tagging, and concept labeling,
ADVANTAGES:
Module description:
Number of Modules:
After careful analysis the system has been identified to have the following:
Modules:
1. User module
2. Owner module
3. Admin module
4. Chart module
User module:
User module , the new user should register application form , before
enter the particular site, after login , user should create the profile for that
particular login user, user can search any word ,they can view related word like
owner module:
before enter the particular site, after login , user should create the profile for that
particular login user , owner can add the new worsd,and related words based on
that,if user can search the particular word they can add as soon as possible,owner
can view the chart based on most number ofword search. Ambiguity level 0 refers
to instances that most people regard as unambiguous. These instances contain only
one sense, such as “dog” (animal) and “california” (state); Ambiguity level 1
instances usually contain more than one senses, but all of these senses are related
to some extent, such as “google” (company & search engine) and “nike” (brand &
company);
Admin module:
Admin is a super user. they can view all the user and owner
details.admin can view the chart based on most number of word search , they can
add related word ,so user can easily mapping arelated words for example
Ambiguity level 2 refers to instances that most people think as ambiguous. These
instances contain two or more unrelated senses, such as “apple” (fruit & company)
and “jaguar” (animal & company). In this work, we only focus on disambiguation
of instances.
word search module, user can search any word ,so they can
easily mapping a realated words, based on search only we will create a chart in this
project, user can view the string manipulation word like string,sub string ,shortcut
Chart module:
of word searching or related word mapping,admin and owner can view which word
mostly searching for the user.so owner can easily add the mapping word.
System Requirements
Hard Disk - 20 GB
Monitor - SVGA
Scripts : JavaScript,jquery,ajax
eectively and e ciently. More specifically, we divide the task of short text
understanding into three subtasks: text segmentation, type detection, and concept
improve e ciency at the same time. We introducea Chain Model and a Pairwise
They achieve better accuracy than traditional POS taggers on the labeled
incorporate the impact of spatial-temporal feature into our framework for short text
understanding
Future enhancement:
In future, we will future develop our algorithm in the
following aspects:
e ectively and effciently. More specifically, we divide the task of short text
concept labeling
detection. They achieve better accuracy than traditional POS taggers on the
labeled benchmark.
.