RESEARCH ARTICLE
OPEN ACCESS
ABSTRACT
The world now contains an unimaginably vast amount of electronic brain information wh ich is getting ever infin ite ever more
rapidly. There are many reasons for the informat ion exp losio n. The most obvious one is technology. Despite the affluence of
tools to apprehand, process and distribute all th is information sensors, computers, mobile phones it already overreaches the
available storage area. The quantity of data is growing very fast and we need to reflect how data should be processed and what
kind o f information to ext ract. The consequence is being felt worldwide, fro m industry to science, fro m government to the arts.
Scientists and computer engineers have introduced a new word for th e wonder: Big Datain s mall terms big data relates to
informat ion that cant be processed or examined using conventional processes or tools. Classification with b ig data has become
one of the most recent vogue when talking about learning from the available information.
Keywords :- Big Data, classification ,Fuzzy Logic, Fuzzy Rule Based Classification System, Hadoop, key-value, Map Reduce
I. INTRODUCTION
Now a days, data has become a very important part of our
life. The data fro m various sources is collected and stored
somewhere in the data warehouse. For identify ing the datasets
that are of large size and have greater co mplexity we use
concept of big data. Big data is a group of large amount of
assembled and unassembled data fro m different sources
comprises of Big data. Data coming fro m social network,
mach ine generated data, and traditional enterprise are various
sources of big data[1].Storing the data, managing the stored
data and analyzing, using traditional techniques of data
mining is not possible. Therefore it is necessary that for
predicting the future trends, useful info rmation has to be
extracted fro m these data sets. The concept of Hadoop is used
for processing large vo lu mes of data [2]. There are various
effective and accepted tools for pattern recognition and
classification which are used now which work under the
framework of Map Reduce which is a programming model
and the tools are called as Fuzzy Rule Based Classificat ion
Systems [3]. They are able to obtain a good precision using
these tools and they provide the end user with a model by
making use of labels called as linguistic labels. Also
management of various traits such as uncertainty, amb iguity
or vagueness can be done in a very effectual way. When it
comes to dealing with big data it beco mes interesting, as this
situation comes with built-in ability. The big data contains
ISSN: 2347-8578
www.ijcstjournal.org
Page 32
International Journal of Computer Science Trends and Technology (IJCS T) Volume 4 Issue 2 , Mar - Apr 2016
operation through which results will be co mpared and
analyzed.
ISSN: 2347-8578
www.ijcstjournal.org
Page 33
International Journal of Computer Science Trends and Technology (IJCS T) Volume 4 Issue 2 , Mar - Apr 2016
ISSN: 2347-8578
1) Linguistic fu zzy partit ions are build: This step builds the
fuzzy DB fro m the domain associated to each attribute A i
using equally distributed triangular membership functions.
www.ijcstjournal.org
Page 34
International Journal of Computer Science Trends and Technology (IJCS T) Volume 4 Issue 2 , Mar - Apr 2016
2) A new fu zzy ru le associated to each examp le xp = (xp1 ; : : : ;
xp n ,Cp ) is generated.
a) Matching degree xp of example is calculated with respect to
the fuzzy labels of each attribute using a conjunction operator.
b) The maximu m membership degree obtaining a fu zzy reg ion
is selected which is corresponding to the example.
c) Building of a new fuzzy rule whose antecedent is calculated
corresponding to the previous fuzzy reg ion and whose
successive is the class label of the example Cp.
d) Rule weight is computed finally.
Now, several ru les with the same antecedent can be built. If
they have the same class in the successive, then, same rules
are deleted if having the same class in the successive.
However, rule with highest weight is maintained in the RB if
the class in the successive is not same if the class in the
successive is different, only the rule with the highest weight is
maintained in the RB.
ISSN: 2347-8578
www.ijcstjournal.org
Page 35
International Journal of Computer Science Trends and Technology (IJCS T) Volume 4 Issue 2 , Mar - Apr 2016
antecedent and will be computed in different mapper
processes with different training sets.
The class estimat ion process followed is exactly the same for
both approaches.
IV.
V. CONCLUSION
In this work, we presented the importance of Big Data and
how fuzzy ru le-based classification algorith m with linguistic
variables will be implemented. Using this algorithm we will
try to obtain an interpretable model that will be ab le to handle
big collections of data. We will use Map Reduce
Programming model. In th is way, our model d istributes the
computation using the map function and then, combines the
outputs through the reduce function. The Chi-FRBCS-Big
Data algorith m is developed in two different versions: Ch iFRBCSBigData- Max and Chi-FRBCS-BigData-Ave and
compared for their performance.
REFERENCES
[1]
[2]
[3]
[4]
[5]
ISSN: 2347-8578
www.ijcstjournal.org
Page 36
International Journal of Computer Science Trends and Technology (IJCS T) Volume 4 Issue 2 , Mar - Apr 2016
Based Classificat ion Systems for Big
Data,International Journal of Computational
Intelligence Systems, Vol. 8, No. 3, July 6-11,
2014.
[6]
[7]
[8]
[9]
[10]
[11]
ISSN: 2347-8578
www.ijcstjournal.org
Page 37