7 Phrase disambiguation 7.1 Analysis of Kazakh phrase structure ambiguity Ambiguity computer analysis of language structure has been one of the difficulties problems
faced by the earliest. problems and eliminate ambiguity effective structural policy research has
Hindle, Rooth of computational linguistics research and Brill of rule-based approach eliminate
ambiguity of the phrase matching.
This article from the delimitation ambiguity and structural relationship is to study two aspects of
phrase structure ambiguity.
one of the difficulties in Kazakh phrase research is the phrase disambiguation problem.
Ambiguous reasons is word POS ambiguity, phrase boundaries is not easy to determine, POS with
the same sequence, there are five ambiguous forms.
(1)VD form(v+adv)
Eg.1a:
is verb phrase.
Eg.1b:
is adverb phrase.
(2)ND form(n+adv,pron+adv)
Eg.2a:
is verb phrase.
Eg.2b:
is adverb phrase.
(3)NPV form(n+prep+v, pron+prep+v)
Eg.3a:
is verb phrase.
Eg.3b:
is noun phrase.
(4)VPV form(v+prep+v)
Eg.4a:
is verb phrase.
Eg.4b:
is adverb phrase.
(5)VP form(v+prep)
Eg.5a:
is verb phrase.
Eg.5b:
is verb phrase.
For these ambiguities, we can not simply use the rules to match ways to eliminate, but rather to
use maximum entropy model to solve the problem.