Tapas Kanungo's Software Page (original) (raw)
This page has pointer to software packages that I have been involved with in some way. Either I wrote it, or was involved in its design, or I used it a paper that I co-authored. If you have questions or comments regarding my software, please feel free to email me -- see contacts page.
- UMDHMM -- Hidden Markov Model Toolkit
- TRUEVIZ -- Groudtruth and Visualization for Document Images (Chang Ha Lee)
- PSET -- Software for Performance Evaluation of Document Segmentation Algorithms. (Song Mao)
- MVNTEST -- Statistical Hypothesis Testing Software
- KMEANS -- K-Means Clustering (David Mount)
UMDHMM
Hidden Markov Model (HMM) Software: Implementation of Forward-Backward, Viterbi, and Baum-Welch algorithms. The software has been compiled and tested on UNIX platforms (sun solaris, dec osf and linux) and PC NT running the GNU package from Cygnus (has gcc, sh, etc.). A tar file can be found at: (tar file). If you need a zip file: zip file . The README file. Postscript slides for tutorial talks that I gave on HMM. The PDF version of the tutorial.
If for some reason you need the older verison of the code, the files are here -- zip file and shar file
To refer:
- Tapas Kanungo, "UMDHMM: Hidden Markov Model Toolkit," in "Extended Finite State Models of Language," A. Kornai (editor), Cambridge University Press, 1999. http://www.kanungo.com/software/software.html. Also:
- Philip Resnik implemented a neat part-of-speech tagger (tutorial and scripts) using my HMM package.
- Here is a system that uses my HMM package for predicting the toplogy of trans membrane helical protiens:[system] [paper]
- If you want to use the software for bioinformatics see: link.
- If you want to use the software for speech recognition work see (prob. 2): link.
TRUEVIZ
TRUEVIZ: Groundtruth Editing and Visulization Software.
Software was co-developed by my student Chang Ha Lee and Jeff Czorapinski.
Software: trueviz-1_02.tar.gz
Readme: README
Papers:
- C. H. Lee and T. Kanungo, "The Architecture of TRUEVIZ: A GroundTRUth/Metadata Editing and VISualiZing Toolkit," Pattern Recognition, vol. 36, no. 3, pp. 811-825, 2003. Download pdf
- Kanungo, Lee, Czorapinsky, Bella, "TRUEVIZ: A groundtruth/metadata Editing and Visualizing Toolkit for OCR," in Proc. of SPIE Conf. on Document Recognition and Retrieval," San Jose, January 2001. Download gzipped ps
- Tech Report: Lee, Kanungo, gzipped ps . TrueViz was used at National Library of Medicine, to create Rover, which is used in the MEDLINE database creation process.
MVNTEST
My multivariate hypothesis testing software for Gaussian Data. A compressed shar file available from Statlib. A tech report describing the tests and software.
PSET
PSET: A Page Segmentation Evaluation Toolkit. Software written by my student Mao Song. (software/tar file) The README file.
Papers:
- S. Mao and T. Kanungo, "Software Architecture of PSET: A Page Segmentation Evaluation Toolkit, Int. Journal on Document Analysis and Recognition," vol. 4, no. 3, pp. 205-217, 2002. Download pdf
- S. Mao and T. Kanungo, "PSET: A Page Segmentation Evaluation Toolkit," Proc. of Document Analysis Systems, Rio de Janeiro, Brazil, 2000. Download ps.
- A technical report describing the software System. (The journal version above has more current information.)
- A journal paper that uses the PSET package to evaluate various segmentation algorithms
S. Mao and T. Kanungo, "Empirical Performance Evaluation Methodology and its Application to Page Segmentation Algorithms," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 3, pp. 242-256, March 2001. Download pdf - Here is a technical report that describes our experimental results. (The journal paper above is more current.)
K-Means
The K-Means Custering Software was written by David Mount. You can download the code from his software page . Papers associated with this work are:
- T. Kanungo, D. M. Mount, N. S. Netanyahu, C. Piatko, R. Silverman and A. Y. Wu, An Efficient k-Means Clustering Algorithm: Analysis and Implementation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 881-892, 2002. Download pdf
- T. Kanungo, D. M. Mount, N. S. Netanyahu, C. Piatko, R. Silverman and A. Y. Wu, "A Local Search Approximation Algorithm for k-Means Clustering," Computational Geometry: Theory and Aplications, vol. 28, pp. 89-112, 2004, Download pdf
More coming up. (DDM, DVI2TIF, ESTIM, VALID, etc.)
Publications
My OCR Page
Multilingual OCR Workshop
Back to my home page
Back to my IBM home page