Text Indexing Research Papers - Academia.edu (original) (raw)

We consider the range mode problem where given a sequence and a query range in it, we want to find items with maximum frequency in the range. We give time- and space- efficient algorithms for this problem. Our algorithms are efficient for... more

We consider the range mode problem where given a sequence and a query range in it, we want to find items with maximum frequency in the range. We give time- and space- efficient algorithms for this problem. Our algorithms are efficient for small maximum frequency cases. We also consider a natural generalization of the problem: the range mode enumeration problem, for which there has been no known efficient algorithms. Our algorithms have query time complexities which is linear to the output size plus small terms.

The Propel Distributed Services Platform (PDSP) is the core software product of Propel, a new Internet infrastructure software company. The PDSP product was created to enable Java developers to architect, implement, deploy, and maintain... more

The Propel Distributed Services Platform (PDSP) is the core software product of Propel, a new Internet infrastructure software company. The PDSP product was created to enable Java developers to architect, implement, deploy, and maintain Internet applications and services much ...

Papan ketik merupakan salah satu sarana teknologi yang menjadi alat bantu manusia untuk melakukan proses pengetikan pada perangkat komputer. Walaupun papan ketik telah banyak membantu, masih banyak kemungkinan terjadi typographical error... more

Papan ketik merupakan salah satu sarana teknologi yang menjadi alat bantu manusia untuk melakukan proses pengetikan pada perangkat komputer. Walaupun papan ketik telah banyak membantu, masih banyak kemungkinan terjadi typographical error atau kesalahan yang dibuat saat proses mengetik. Sebelumnya telah terdapat penelitian mengenai kemudahan dan keakuratan penggunaan desain tata letak papan ketik tertentu untuk menulis teks bahasa Arab. Namun, pada penelitian tersebut sulit ditemukan piranti yang dapat membantu mengenali kesalahan dalam penulisan teks Arab, sehingga proses tersebut dilakukan secara manual. Oleh karena itu, dibutuhkan sebuah sistem yang dapat melakukan hal tersebut, yaitu menemukan kesalahan dari hasil pengetikan teks bahasa Arab dan sekaligus untuk mengetahui jumlah kesalahan yang dibuat saat pengetikan, untuk membantu penelitian serupa. Penelitian ini bertujuan untuk mengembangkan sistem tersebut. Pemodelan yang digunakan pada penelitian ini adalah pengembangan inkr...

In the unit-cost comparison model, a black box takes an input two items and outputs the result of the comparison. Problems like sorting and searching have been studied in this model, and it has been generalized to include the concept of... more

In the unit-cost comparison model, a black box takes an input two items and outputs the result of the comparison. Problems like sorting and searching have been studied in this model, and it has been generalized to include the concept of priced information, where different pairs of items (say database records) have different comparison costs. These comparison costs can be arbitrary (in which case no algorithm can be close to optimal (Charikar et al. STOC 2000)), structured (for example, the comparison cost may depend on the length of the databases (Gupta et al. FOCS 2001)), or stochastic (Angelov et al. LATIN 2008). Motivated by the database setting where the cost depends on the sizes of the items, we consider the problems of sorting and batched predecessor where two non-uniform sets of items AAA and BBB are given as input. (1) In the RAM setting, we consider the scenario where both sets have nnn keys each. The cost to compare two items in AAA is aaa, to compare an item of AAA to an ...