Creating probe packages (original) (raw)

Contents

Overview

This document describes how to create a Bioconductor probe package from the reporter sequence information of a particular chip. Probe packages are a convenient way for distributing and storing the probe sequences and related information.

First, let us load the AnnotationForge package.

library("AnnotationForge")

For Affymetrix genechips

In this section we see how a probe package can be created for Affymetrix genechips from the tabulator-separated sequence files that can be obtained from the vendor (at https://www.thermofisher.com/us/en/home/support.html). As an example, the file HG-U95Av2_probe_tab.gz is provided in the extdatasubdirectory of the AnnotationForge package.

filename <- system.file("extdata", "HG-U95Av2_probe_tab.gz", 
                        package="AnnotationForge")
outdir   <- tempdir()
me       <- "Wolfgang Huber <w.huber@dkfz.de>"
species  <- "Homo_sapiens"
makeProbePackage("HG-U95Av2",
                 datafile   = gzfile(filename, open="r"),
                 outdir     = outdir,
                 maintainer = me,
                 species    = species,
                 version    = "0.0.1")
## Importing the data.
## Creating package in /tmp/Rtmpe37xSn/hgu95av2probe 
## Writing the data.
## Checking the package.
## *** WARNINGS ***
##  * checking data for ASCII and uncompressed saves ... WARNING Status: 1 WARNING, 1 NOTEBuilding the package.
## [1] "hgu95av2probe"
dir(outdir)
## [1] "BiocStyle"     "hgu95av2probe"

For other chiptypes

To deal with different file formats and additional types of probe annotation data from public or in-house databases, the function makeProbePackage offers a great deal of flexibility. The user can specify her own import function through the importfun argument. By default, its value is getProbeDataAffy, a function that reads tabular Affymetrix genechip sequence files. Import functions for other types of arrays can be adapted from this prototype.

The help pages and R code contained in the produced packages are derived from a template directory that obeys the usual R package conventions(1999). The input parameters of an import function are A prototype for such a directory is provided within the package_AnnotationForge_. To facilitate the automated production of large numbers of similar packages, we provide a text substitution mechanism similar to the one used in the GNU configure system.

The output of an import function is a named list with elements

For more details, please refer to the help files for the functionsmakeProbePackage and getProbeDataAffy. For an example, refer to the source code of getProbeDataAffy.

References