Andy Chin | Education University of Hong Kong (original) (raw)
Papers by Andy Chin
... Dissertation Information. Title: The Verb GIVE and the Double-object Construction in Cantones... more ... Dissertation Information. Title: The Verb GIVE and the Double-object Construction in Cantonese in Synchronic, Diachronic and Typological Perspectives, Add Dissertation. Author:Andy Chin, Update Dissertation. Email: click here to access email. ...
Journal of Chinese Linguistics, 2010
"This paper proposes that there are two types of indirect object markers in the ... more "This paper proposes that there are two types of indirect object markers in the Chinese language: The go-type and the give-type. The chronological development of these two types of indirect object markers will be discussed. Moreover, with reference to the Cantonese dialects, this paper will examine the factors contributing to the replacement of the go-type marker by the give-type marker. Finally, this typology of the indirect object markers is discussed from an areal linguistic perspective."
We used a production segmentation system, which draws heavily on a large dictionary derived from ... more We used a production segmentation system, which draws heavily on a large dictionary derived from processing a large amount (over 150 million Chinese characters) of synchronous textual data gathered from various Chinese speech communities, including Beijing, Hong Kong, Taipei, and others. We run this system in two tracks in the Second International Chinese Word Segmentation Bakeoff, with Backward Maximal Matching (right-to-left) as the primary mechanism. We also explored the use of a number of supplementary features offered by the large dictionary in postprocessing, in an attempt to resolve ambiguities and detect unknown words. While the results might not have reached their fullest potential, they nevertheless reinforced the importance and usefulness of a large dictionary as a basis for segmentation, and the implication of following a uniform standard on the segmentation performance on data from various sources.
Very large corpora of properly processed textual materials are uncommon but they can provide impo... more Very large corpora of properly processed textual materials are uncommon but they can provide important resources for language modeling in natural language processing, ranging from speech processing and text input to automatic IR and patent translation. However, when properly cultivated in spatial-temporal terms, they can foster innovative knowledge discovery in database applications by functioning as monitoring corpus and enhance the human centered communication environment by allowing more substantive introspection and comparison of linguistic and social-cultural developments of the relevant speech communities. This paper discusses how the gigantic synchronous and homothematic corpus of Chinese, LIVAC, can contribute to the monitoring the linguistic homogeneity and heterogeneity diachronically and synchronically. After processing media texts of more than 400 million Chinese characters over 16 years, LIVAC has yielded a lexical corpus of 1.5 million words. This paper examines some a...
Bulletin of Chinese Linguistics
提要 本文是作者於2011和2012年調查海南島西部哥隆話的初步報告。調查目的是比較現代哥隆話跟二十多年前的差別 (如符鎮南(1996)和歐陽覺亞(1998))。此外,我們也比較哥隆話和黎語的一... more 提要 本文是作者於2011和2012年調查海南島西部哥隆話的初步報告。調查目的是比較現代哥隆話跟二十多年前的差別 (如符鎮南(1996)和歐陽覺亞(1998))。此外,我們也比較哥隆話和黎語的一百個基本詞匯,討論哥隆話的系屬。
This paper reports on a corpus-based sociolinguistic study of terms of address with a special foc... more This paper reports on a corpus-based sociolinguistic study of terms of address with a special focus on kinship terms found in The Corpus of Mid-20th Century Hong Kong Cantonese (http://hkcc.eduhk.hk/) which has a size of about one million Chinese character tokens. The corpus data was collected by transcribing the speech dialogues of 80 black-and-white movies produced in Hong Kong between 1940 and 1970. The kinship terms extracted from the corpus can tell us about the family structure and marital life of Hong Kong six decades ago.
Digital Humanities and New Ways of Teaching
Linguistics and Education
Abstract Early readers can play a significant role in the intergenerational transmission of gende... more Abstract Early readers can play a significant role in the intergenerational transmission of gender roles. The present study examines how females and males are represented in selected early readers recommended by the Education Bureau of Hong Kong for the promotion of ‘Reading to Learn’ and ‘Reading across the Curriculum’. The study used both manual and computational methods to examine how experiential and relational values are expressed through variables such as the ratio of female-to-male character types, the roles and activities depicted, character identification and the order of mention of males and females. The findings show that although the number of female human character types was similar to that of their male counterparts, there were substantially more male than female animal character types. The study also reveals gender stereotypes including confining females to a limited range of traditional roles and activities, addressing females more informally than males, and a stronger tendency to identify females by their relationships with others. The paper ends with some recommendations for education authorities, teachers and parents on how to help children interpret gender and redress unfair practices.
Journal of Chinese Linguistics, 2016
2010 4th International Universal Communication Symposium, 2010
ABSTRACT
... Dissertation Information. Title: The Verb GIVE and the Double-object Construction in Cantones... more ... Dissertation Information. Title: The Verb GIVE and the Double-object Construction in Cantonese in Synchronic, Diachronic and Typological Perspectives, Add Dissertation. Author:Andy Chin, Update Dissertation. Email: click here to access email. ...
Journal of Chinese Linguistics, 2010
"This paper proposes that there are two types of indirect object markers in the ... more "This paper proposes that there are two types of indirect object markers in the Chinese language: The go-type and the give-type. The chronological development of these two types of indirect object markers will be discussed. Moreover, with reference to the Cantonese dialects, this paper will examine the factors contributing to the replacement of the go-type marker by the give-type marker. Finally, this typology of the indirect object markers is discussed from an areal linguistic perspective."
We used a production segmentation system, which draws heavily on a large dictionary derived from ... more We used a production segmentation system, which draws heavily on a large dictionary derived from processing a large amount (over 150 million Chinese characters) of synchronous textual data gathered from various Chinese speech communities, including Beijing, Hong Kong, Taipei, and others. We run this system in two tracks in the Second International Chinese Word Segmentation Bakeoff, with Backward Maximal Matching (right-to-left) as the primary mechanism. We also explored the use of a number of supplementary features offered by the large dictionary in postprocessing, in an attempt to resolve ambiguities and detect unknown words. While the results might not have reached their fullest potential, they nevertheless reinforced the importance and usefulness of a large dictionary as a basis for segmentation, and the implication of following a uniform standard on the segmentation performance on data from various sources.
Very large corpora of properly processed textual materials are uncommon but they can provide impo... more Very large corpora of properly processed textual materials are uncommon but they can provide important resources for language modeling in natural language processing, ranging from speech processing and text input to automatic IR and patent translation. However, when properly cultivated in spatial-temporal terms, they can foster innovative knowledge discovery in database applications by functioning as monitoring corpus and enhance the human centered communication environment by allowing more substantive introspection and comparison of linguistic and social-cultural developments of the relevant speech communities. This paper discusses how the gigantic synchronous and homothematic corpus of Chinese, LIVAC, can contribute to the monitoring the linguistic homogeneity and heterogeneity diachronically and synchronically. After processing media texts of more than 400 million Chinese characters over 16 years, LIVAC has yielded a lexical corpus of 1.5 million words. This paper examines some a...
Bulletin of Chinese Linguistics
提要 本文是作者於2011和2012年調查海南島西部哥隆話的初步報告。調查目的是比較現代哥隆話跟二十多年前的差別 (如符鎮南(1996)和歐陽覺亞(1998))。此外,我們也比較哥隆話和黎語的一... more 提要 本文是作者於2011和2012年調查海南島西部哥隆話的初步報告。調查目的是比較現代哥隆話跟二十多年前的差別 (如符鎮南(1996)和歐陽覺亞(1998))。此外,我們也比較哥隆話和黎語的一百個基本詞匯,討論哥隆話的系屬。
This paper reports on a corpus-based sociolinguistic study of terms of address with a special foc... more This paper reports on a corpus-based sociolinguistic study of terms of address with a special focus on kinship terms found in The Corpus of Mid-20th Century Hong Kong Cantonese (http://hkcc.eduhk.hk/) which has a size of about one million Chinese character tokens. The corpus data was collected by transcribing the speech dialogues of 80 black-and-white movies produced in Hong Kong between 1940 and 1970. The kinship terms extracted from the corpus can tell us about the family structure and marital life of Hong Kong six decades ago.
Digital Humanities and New Ways of Teaching
Linguistics and Education
Abstract Early readers can play a significant role in the intergenerational transmission of gende... more Abstract Early readers can play a significant role in the intergenerational transmission of gender roles. The present study examines how females and males are represented in selected early readers recommended by the Education Bureau of Hong Kong for the promotion of ‘Reading to Learn’ and ‘Reading across the Curriculum’. The study used both manual and computational methods to examine how experiential and relational values are expressed through variables such as the ratio of female-to-male character types, the roles and activities depicted, character identification and the order of mention of males and females. The findings show that although the number of female human character types was similar to that of their male counterparts, there were substantially more male than female animal character types. The study also reveals gender stereotypes including confining females to a limited range of traditional roles and activities, addressing females more informally than males, and a stronger tendency to identify females by their relationships with others. The paper ends with some recommendations for education authorities, teachers and parents on how to help children interpret gender and redress unfair practices.
Journal of Chinese Linguistics, 2016
2010 4th International Universal Communication Symposium, 2010
ABSTRACT