Creating a Corpus and Chained Bigrams for Spanish Keyboard Development and Evaluation (original) (raw)

Keyboard Layout Analysis: Creating the Corpus, Bigram Chains, and Shakespeare's Monkeys

The process to create a corpus suitable for evaluating computer keyboard layouts optimised for typing English and computer program code. After sourcing, sampling and cleaning suitable texts, the texts are processed to extract bigrams, which are then used to create sample input texts of a desired length. These texts have a character distribution, and letter sequence, closely matching either English or computer programs, even though they look random. The resulting texts are excellent for evaluating keyboard layouts. Corpus analysis is included. Keywords: English text corpus, computer code corpus, English letter frequency, computer program character frequency, bigram frequency, letter follows letter probability, letter precedes letter probability, keyboard layout, keyboard layout evaluation. Best viewed and printed in colour.

AN OPTIMISED QWERTY KEYBOARD FOR THE TYPING OF ORAL ENGLISH DOCUMENTS

Journal of Language, Literature and Cultural Studies , 2017

Due to the number of phonetic symbols needed in Oral English texts, authors usually depend on the use of symbols from the insert menu of a word processing program like Microsoft Word or switching between a phonetic keyboard and the conventional English keyboard. The present paper seeks to avoid this problem by designing a single keyboard incorporating the Oral English symbols in the conventional Qwerty keyboard. The study employed the Microsoft Keyboard Layout Creator (MSKLC) 1.4 application and Charis SIL and Doulos SIL Unicode fonts packages to implement the typing of IPA symbols and diacritics on a single Qwerty ANSI keyboard layout. This paper reveals that a single functional keyboard layout can be optimized for easy and accurate typing of symbols in Oral English documents. Therefore, shifting among several keyboards will no longer be necessary after the basic qwerty has been optimised.

Letter Frequency Analysis of Languages Using Latin Alphabet

International Linguistics Research, 2018

The evaluation of the peculiarities of alphabets, particularly the frequency of letters is essential when designing keyboards, analysing texts, designing alphabet-based games, and doing some text mining. Thus, it is important to determine what might be useful for designers of text input tools, and of other technologies related to sets of letters. Knowledge of common features among different languages gives an opportunity to take advantage of the experience of other languages. Nowadays an increasing amount of texts is published on the Internet. In order to adequately compare the frequencies of letters in different languages used in the online space, Wikipedia texts have been selected as a source material for investigation. This paper presents the Method of the Adjacent Letter Frequency Differences in the frequency line, which helps to evaluate frequency breakpoints. This is a uniform evaluation criterion for 25 main languages using Latin script in order to highlight the similarities ...

Granada: A typographic study on the visual features of written Spanish

Language and writing are two of mankind’s most important cultural manifestations, is it possible to establish a connection between them from a typographic perspective and assert that languages themselves —among those written with the Latin script— have a particular look? This research project intends to answer this question through the comparative description of ten European languages from a visual point of view which includes a characterization of their texture and rhythm. Subsequently, and regarding the project’s creative component, the features of written Spanish —the author’s native language— will be used as a type design criteria.

Improving text entry performance for Spanish-speaking non-expert and impaired users

2005

In this paper, an efficient method is described which is able to improve the efficiency of typing texts for non-expert and impaired users. Specifically, we propose an innovative keyboard configuration that improves typing performance in languages with transparent orthographies. By adopting a novel orthogonal framework, the configuration of the keyboard is defined as a 2-D regular array of keys. According to this scheme it is possible to input, in a direct and intuitive way, any possible combination of pseudo-syllables (which are text entry units with simpler consonant-vowel phonemic structure), being also possible to introduce single characters in the classical, letter by letter, way. The orthogonal keyboard scheme has been applied and tested with the Spanish language, for it is the most spoken transparent language in the world. The performed tests show a significant improvement of the efficiency in alphanumeric text typing.

Optimal character arrangements for ambiguous keyboards

1998

| Many persons with disabilities lack the ne motor coordination necessary to operate traditional keyboards. For these individuals, ambiguous or reduced keyboards offer an alternative access method. By placing multiple characters on each k ey, the size and accessibility of the individual keys can be enhanced without requiring a larger keyboard. Using statistical disambiguation algorithms to automatically interpret each keystroke, these systems can approach the keystroke e ciency keystrokes per character of conventional keyboards. Since the placement o f c haracters on each key determines the e ectiveness of these algorithms, several methods of optimizing keyboard arrangements have previously been proposed. This paper presents a new method for optimizing an arbitrary set of N characters over a collection of M keys. While earlier e orts relied upon approximations of keystroke eciency, the proposed approach optimizes the arrangement under this exact performance measure. Applied to the canonical 26 characters on 9 key telephone keypad" problem, this method provides an improvement in e ciency of 2.5 percentage points over previously established layouts. Using only a minimum of calculations, the proposed technique operates quickly and e ciently, deriving optimal arrangements in a matter of seconds using a personal computer. The exible method is applicable to arbitrary disambiguation algorithms, character sets, and languages. Keywords| A m biguous keyboard, reduced keyboard, disambiguation, character prediction

Frequency Analysis of Languages Using Latin Alphabet

2018

The evaluation of the peculiarities of alphabets, particularly the frequency of letters is essential when designing keyboards, analysing texts, designing alphabet-based games, and doing some text mining. Thus, it is important to determine what might be useful for designers of text input tools, and of other technologies related to sets of letters. Knowledge of common features among different languages gives an opportunity to take advantage of the experience of other languages. Nowadays an increasing amount of texts is published on the Internet. In order to adequately compare the frequencies of letters in different languages used in the online space, Wikipedia texts have been selected as a source material for investigation. This paper presents the Method of the Adjacent Letter Frequency Differences in the frequency line, which helps to evaluate frequency breakpoints. This is a uniform evaluation criterion for 25 main languages using Latin script in order to highlight the similarities ...

Using Sequential Data Analyses to Determine the Optimum Layout for an Alternative Keyboard

Proceedings of the Human Factors and …, 2001

The standard QWERTY keyboard was developed over a hundred years ago. It is suspected to be involved in repetitive strain injuries, such as carpal tunnel syndrome (CTS). To reduce and eliminate many of the movements that are suspected to contribute to CTS, a new type of alphanumeric input based on the chording concept was designed. This AID-CTS keyboard is an alphanumeric input system that uses a pair of devices each comprised of an inverted dome upon which the hands rest. As a chordal device, the AID-CTS keyboard typing methodology entails creating a keystroke via a combination of positions of the two domes. The purpose of the current study was to determine a new character layout that would reduce the ergonomic impact of typing further. Two studies were conducted. In Study 1, we analyzed two-letter sequences using sequential, multi-way frequency analyses and established a listing of the most important two-letter transitions. In Study 2, we created a number of competing character layouts and analyzed them regarding their ergonomic impact. The studies resulted in an optimum layout that minimizes arm and wrist movements.