Zhao-Ming Gao - Academia.edu (original) (raw)
Uploads
Papers by Zhao-Ming Gao
We present an English miscollocation identification system based on dependency relations drawn fr... more We present an English miscollocation identification system based on dependency relations drawn from the Stanford parser. We test our system against a subset of error-tagged Chinese Learner English Corpus (CLEC)and obtain an overall precision of 0.75. We describe some applications and limitations of our system and suggest directions for future research.
ATALA Workshop …, 1999
This paper aims to present the methodology and guidelines for annotation in CKIP Chinese Treebank... more This paper aims to present the methodology and guidelines for annotation in CKIP Chinese Treebank. Under the framework of the Information-based Case grammar (ICG), a lexical feature-based grammar formalism, which stipulates each lexical item containing both syntactic and semantic information, the potential phrasal heads of input are located and the semantic relations between words are also identified. Thus, not only phrasal categories but also thematic roles are both annotated. Incorporating with Head-Driven Principle, some guidelines are also implemented for more consistent annotation in such grammatical phenomenon as the constructions of coordinates, topicalization, and the construction with nominal predicate. In addition, we tag the CKIP Treebank with semantic categories to extract useful collocation among semantic classes of the bracketed constitutes, which is also supposed to further enhance the performance of our parsing model.
This paper describes the design criteria and annotation guidelines of the Sinica Treebank. The th... more This paper describes the design criteria and annotation guidelines of the Sinica Treebank. The three design criteria are: Maximal Resource Sharing, Minimal Structural Complexity, and Optimal Semantic Information. One of the important design decisions guided by these criteria is the encoding of thematic role information. We discuss the representational and methodological issues based on our design criteria.
We present an English miscollocation identification system based on dependency relations drawn fr... more We present an English miscollocation identification system based on dependency relations drawn from the Stanford parser. We test our system against a subset of error-tagged Chinese Learner English Corpus (CLEC)and obtain an overall precision of 0.75. We describe some applications and limitations of our system and suggest directions for future research.
ATALA Workshop …, 1999
This paper aims to present the methodology and guidelines for annotation in CKIP Chinese Treebank... more This paper aims to present the methodology and guidelines for annotation in CKIP Chinese Treebank. Under the framework of the Information-based Case grammar (ICG), a lexical feature-based grammar formalism, which stipulates each lexical item containing both syntactic and semantic information, the potential phrasal heads of input are located and the semantic relations between words are also identified. Thus, not only phrasal categories but also thematic roles are both annotated. Incorporating with Head-Driven Principle, some guidelines are also implemented for more consistent annotation in such grammatical phenomenon as the constructions of coordinates, topicalization, and the construction with nominal predicate. In addition, we tag the CKIP Treebank with semantic categories to extract useful collocation among semantic classes of the bracketed constitutes, which is also supposed to further enhance the performance of our parsing model.
This paper describes the design criteria and annotation guidelines of the Sinica Treebank. The th... more This paper describes the design criteria and annotation guidelines of the Sinica Treebank. The three design criteria are: Maximal Resource Sharing, Minimal Structural Complexity, and Optimal Semantic Information. One of the important design decisions guided by these criteria is the encoding of thematic role information. We discuss the representational and methodological issues based on our design criteria.