MSE
0.1
reproduce at a higher rate is to use a selection
method based on the roulette wheel selection
technique.
There is several ways to test the networks 0.01
Epoch
performance. Usually, MSE "mean squared
Training MSE Cross Validation MSE
error" is used. It is two times the average cost
which is computed as follows: Figure 3. The graphs of MSE for SVM
∑∑ (d )
P N
ij −
2
yi j
j =o i =0
MSE = .
NP
MSE versus Epoch
1
Cross Validation MSE
Where, P is the number of output processing 0.9
Training MSE
elements. N is the number of exemplars in the 0.8
0.7
data set. yij is the network output for exemplar i 0.6
at processing element j . dij is the desired output
MSE
0.5
0.3
The best network results for training data of 0.2
the proposed taggers are reports a minimum 0.1
Min. MSE 0.0390025 0.0001036 0.0206866 The comparison study has to be done
carefully because the features used here do not
Final MSE 0.0390053 0.0001034 0.0206866 match those in the previous studies. On the other
hand, the comparison of proposed tagger with
other existing taggers is difficult matter, because
the tagger accuracy relies on numerous
MSE versus Epoch
1 parameters such as language complication
1 100 199 298 397 496 595 694 793 892 991
(ambiguous words, ambiguous phrases), the
0.1 language type (English, Arabic, Chinese, etc),
the training data magnitude, the tag-set size and
MSE
0.01
the evaluation measurement criteria. Tag-set size
has a great impact on the tagging process.
0.001
The proposed taggers are assessed using the
0.0001
measurement of Accuracy, besides MSE aspects.
Epoch In comparison study of proposed taggers with
Training MSE Cross Validation MSE
the results of several taggers of other researchers,
the accuracy of proposed taggers gets a super
Figure 2. The graphs of MSE for MLP rate when using GA optimization techniques to
improve the values of the momentum rate and
the step size. The proposed taggers (MLP, SVM
and FRNN) achieved accuracy of 99.99% at last
experiments when using GA optimization
process. Table 2 summarized the comparison Taggers Accurcy Without Using GA Optimizatiion
information and illustrated in Figures 5 and 6. 105
100
6.2 Conclusion
SVM
Pérez-Ortiz 2001
95
Khoja 2001b
Schmid 1994
Accuracy
Diab 2004
FRNN
This research was mainly aimed to implement 90
Diab 2004
Pérez-Ortiz 2001
Ahmed 2002
Schmid 1994
and utilize an automatic POS tagging system
Khoja 2001b
Ahmed 2002
85
based on neural network methods, which has the
MLP
SVM
ability to tag the Arabic texts mechanically. We 80 MLP
FRNN
have demonstrated variant kinds of tagger which
75
can solve the problem of Arabic part of speech. Tagger Name
The new approaches are highly accurate with
low processing time, and high speed words
tagging. The results are greatly encouraging, Figure 6. Comparison result (Proposed
with correct assignments between 86% and taggers without using GA optimization)
99.99% depending on either using genetic
Algorithm optimization or not, in optimizing the 7. Future Work
values of network variables like the momentum The research has demonstrated the
rate and step size. applicability of Neural-based tagging techniques
to Arabic tagger systems. Still, there are some
modules that can be added to improve the
Table 2. The comparison results of preprocess phase like the affixes extraction and
proposed tagger & other taggers segmentation.
REFERENCES
proposed
Diab [8]
Ahmed[
Schmid
Khoja
Pérez
[17]
[15]
[11]
1]
98 Pérez-Ortiz 2001
96 Khoja 2001b conference, Seattle, WA, 2000.
Accuracy
Diab 2004
94
Schmid 1994
[6] Brill, E. "Unsupervised learning of
Pérez-Ortiz 2001
Schmid 1994
92
Ahmed 2002 disambiguation rules for part of speech
Diab 2004
Khoja 2001b
Ahmed 2002
90
SVM
88 MLP
tagging". Proceedings of third ACL
86 FRNN Workshop on Very Large Corpora, 1995.
84
Tagger Name
[7] Brill, E. "A simple rule-based part-of-speech
tagger", proceedings of ANLP-92, 3rd
Conference on Applied Natural Language
Figure 5. Comparison result (Proposed Processing, pp 152–155, Trento, IT, 1992.
[8] Diab, M., Kadri H. & Daniel J. "Automatic
taggers using GA optimization) tagging of Arabic text: from raw text to base
phrase chunks", proceedings of HLT- machines, International Symposium on
NAACL-04, 2004. Information Technology, Kuala Lumpur
[9] Gimenez, J. & Llu´ıs M. "Fast and accurate Convention Centre, 200808, ISBN 978-1-
part-of-speech tagging: The SVM approach 4244-2328-6©IEEE, Malaysia, August 26-
revisited", proceedingsof the International 29,2008.
conference on recent advances on natural [21] Yousif, J. H. & Sembok T. M. T. "Design
language processing, Borovets, Bulgaria, and Implement an Automatic Neural Tagger
2003. Based Arabic Language for NLP
[10] Khoja S, Garside, R. & Gerry, K. "An Applications", Asian Journal of Information
Arabic tagset for the morphosyntactic Technology 5(7): PP 784-789, ISSN 1682-
tagging of Arabic", corpus linguistics, 3915,2006.
Lancaster University, Lancaster, UK, 2001. [22] Yousif, J. H. & Sembok T. M. T.
[11] Khoja, S. "APT: Arabic part-of-speech "Recurrent Neural Approach based Arabic
tagger", proceedings of the student Part-of-Speech Tagging", International
workshop at the second meeting of the north Conference on Computer and
American chapter of the association for Communication Engineering (ICCCE'06),
computational linguistics (NAACL2001), VOL 2, ISBN 983-43090-1-5© IEEE. KL-
Carnegie Mellon University, Pennsylvania, Malaysia May 9-11,2006.
2001. [23] Yousif, J. H. & Sembok T. M. T. "Arabic
[12] Ma, Q., Uchimoto, K., Murata,M. & Part-of-Speech Tagger Based neural
Isahara,H. "Elastic neural networks for part networks", International Arab Conference
of speech tagging", proceedings of on Information Technology
IJCNN’99, pp 2991–2996, Washington, DC, (ACIT2005),ISSN 1812/0857. Jordan-
1999. Amman-2005.
[13] Mahtab, N. & Choukri, K. "Survey on
Industrial needs for Language Resources",
2005.Online
"http://www.nemlar.org/Publications/Nemla
r-report-ind-needs_web.pdf".
[14] Marques, N. C. & Gabriel. P. L. "Using
neural nets for Portuguese part-of-speech
tagging", proceedings of the 5th international
conference on the cognitive science of
natural language processing, Dublin City
University, Ireland, 1996.
[15] Persz-ortz A. J. & Forcada M. L. "Part-of-
speech tagging with recurrent neural
networks", proceedings of the International
Joint Conference on Neural Networks,
IJCNN- IEEE2001:1588-1592, 2001.
[16] Principe, J.C. ,Euliano,N.R. & Lefebvre,
W.C. "Neural and adaptive systems,
fundamentals through simulations", John
Wiley & Sons, NY, 2000.
[17] Schmid, H. "Part-of-speech tagging with
neural networks", proceedings of COLING-
94, Kyoto, Japan, pp 172–176, 1994.
[18] Srinivas, M. & Patnaik, L. M. Genetic
algorithms: a survey", IEEE Computer
27(6), pp17-26, 1994.
[19] Weischedel, R., et al. "Coping with
ambiguity and unknown words through
probabilistic models", Computational
Linguistics. 19(2), pp359-382,1993.
[20] Yousif, J. H. & Sembok T. M. T. "Arabic
part-of-speech tagger based support vectors