WORKING PAPER 73 ANOTHER APPROACH TO ENGLISH by (original) (raw)
Related papers
1974
A new approach to building descriptions of English is 6utlined and programs implementing the ideas for sentencesized fragments are demonstrated.
Using a parser as a heuristic tool for the description of New Englishes
2009
We propose a novel use of an automatic parser as a tool for descriptive linguistics. This use has two advantages. First, quantitative data on very large amounts of texts are available instantly, a process which would take years of work with manual annotation. Second, it allows variational linguists to use a partly corpus-driven approach, where results emerge from the data. The disadvantage of the parser-based approach is that the level of precision and recall is lower. We give evaluations of precision and recall of the parser we use. We then show the application of the parser-based approach to a selection of New Englishes. using several subparts of the International Corpus of English (ICE). We employ two methods to discover potential features of New Englishes: (a) by exploring quantitative differences in the use of established syntactic patterns (b) by evaluating the potential correlation between parsing breakdowns and regional syntactic innovations.
Sentence Fragments Regular Structures
1988
This paper describes an analysis of telegraphic fragments as regular structures (not errors) handled by rn~n~nal extensions to a system designed for processing the standard language. The modular approach which has been implemented in the Unlsys natural language processing system PUNDIT is based on a division of labor in which syntax regulates the occurrence and distribution of elided elements, and semantics and pragumtics use the system's standard mechankms to interpret them.
A System for Automatic English Text Expansion
IEEE Access
We present an automatic text expansion system to generate English sentences, which performs automatic Natural Language Generation (NLG) by combining linguistic rules with statistical approaches. Here, "automatic" means that the system can generate coherent and correct sentences from a minimum set of words. From its inception, the design is modular and adaptable to other languages. This adaptability is one of its greatest advantages. For English, we have created the highly precise aLexiE lexicon with wide coverage, which represents a contribution on its own. We have evaluated the resulting NLG library in an Augmentative and Alternative Communication (AAC) proof of concept, both directly (by regenerating corpus sentences) and manually (from annotations) using a popular corpus in the NLG field. We performed a second analysis by comparing the quality of text expansion in English to Spanish, using an ad-hoc Spanish-English parallel corpus. The system might also be applied to other domains such as report and news generation.
Description-directed Natural Language Generation
1985
We report here on a significant new set of capabilities that we have incorporated into our language generation system MUMBLE. Their impact will be to greatly simplify the work of any text planner that uses MUMBLE as ita linguistics component since MUMBLE can now take on many of the planner's text organization and decision-making problems with markedly less hand-tailoring of algorithms in either component.
Handbook of Natural Language Processing, 2000
We report here on a significant new set of capabilities that we have incorporated into our language generation system MUMBLE. Their impact will be to greatly simplify the work of any text planner that uses MUMBLE as ita linguistics component since MUMBLE can now take on many of the planner's text organization and decision-making problems with markedly less hand-tailoring of algorithms in either component.
Preliminary report on a program for generating natural language
International Joint Conference on Artificial Intelligence, 1975
A program framework has bben designed in which the lingLuistic facts and heuristics necessary for generating fluent natural language can be encoded. The linguistic data is represented in annotated procedures and data structures which are designed to make English translations of already formulated messages given:in a primary program's internal representation. The messages must include the program's intentions in saying them, in order to adequately specify the grammatical operations required for a translatibn. The pertinant questions in this research have been,: what structure does natural language have that allows it to encode mutifaceted messages; and how must that structure be taken into account in the design of a generation facility for a computer.program. This paper describes the control and data structures of the design and and their motivation. It is a condensation of my Master's Thesis <1>, to which the reader is refered for further information. Work is presently underway on implementing the design in LISP and developing a grammar for use in one or more'of the domains given below.