Local usage:Input format (original) (raw)

CLARION format overview

CLARION (generiC fiLe formAt foR quantItative cOmparsions of high throughput screeNs) is a data format especially developed to be used with WIlsON, which relies on a tab-delimited table with some metadata to describe the columns. It is based on the Summarized Experiment format and supports all types of data which can be reduced to features (e.g. genes, transcripts, proteins, probes) and their annotation with assigned numerical values (e.g. count, score, log2foldchange, z-score, p-value). The feature annotations (e.g. symbol, GO category, KEGG pathways, etc.) and numerical values can later be used for filtering and plotting. Minimally, a row in such a table has to contain a unique identifier for the feature (e.g. gene accession) and one numerical value. Most result tables derived from RNA-Seq, ChIP/ATAC-Seq, Proteomics, Microarrays, and many other analyses can thus be easily reformatted to become compatible without having to modify the code of WIlsON for each specific experiment.

It is suggested to use a spreadsheet software (e.g. Excel) when performing a manual reformat of the original tab-delimited table (shown in green in the following figure) in order to avoid errors due to shifted columns. In order to become CLARION, two blocks of descriptive information have to be inserted above the original table called header (shown in blue) and metadata (shown in red).

Clarion format example

The format of the three blocks follows distinct structures:

CLARION header

Header parameters

Mandatory columns are marked with an asterisk (*) in the following description.

CLARION header example

CLARION metadata

Information considering the nature of columns of the following original data table. Columns that are not described in the metadata section are ignored by WIlsON.

Metadata columns

Mandatory columns are marked with an asterisk (*) in the following description.

CLARION metadata example

CLARION data

The original tab-delimited data matrix (in this example a modified DeSEQ output format).

CLARION data example

A minimum example

See this image for a cropped version of the example above. This example shows the minimum requirements needed for a valid CLARION file (metadata in red & data in green).

CLARION minimal example