GFF2PS MAIN PAGE: Converting GFF to PostScript. (original) (raw)

Contents

PROGRAM DESCRIPTION

gff2ps - Produces PostScript graphical output from GFF-files.

gff2ps

is a program for visualizing annotations of genomic sequences. The program takes as input the annotated features on a genomic sequence in

GFF

format, and produces a visual output in PostScript. It can be used in a very simple way, because it assumes that the

GFF

file itself carries enough formatting information, but it also allows through a number of options and/or a configuration file, for a great degree of customization.

"General Feature Format" (GFF) is described on the Sanger Centre gff definition page.

We will appreciate if you can cite

gff2ps

paper and/or the URL, as follows:

EXAMPLES
Science cover 291(5507) gff2ps has achieved another genome landmark. The mosquito genome annotation for five chromosome arms (2L, 2R, 3L, 3R and X) has been summarized into a two-sided five-pages foldout included as the figure 1 of "The Genome Sequence of the Malaria Mosquito Anopheles gambiae" [Holt et al. Science 298(5591):129-149 (2002)]. Both foldout sides can be downloaded as PDF files from Science web site at the "Annotation of the Anopheles gambiae genome sequence" poster page. You can also download a printable version of this figure from the following links: 1.5Mb gzipped PostScript file or 2.9Mb gzipped PDF file (±196x114cm).
Science cover 291(5507) We have performed the map of the Human Genome with gff2ps. 22 autosomic, X and Y chromosomes were displayed in a big poster appearing as the figure 1 of "The Sequence of the Human Genome" [Venter et al. Science 291(5507):1304-1351 (2001)]. The single chromosome pictures can be accessed from here to visualize the web version of the "Annotation of the Celera Human Genome Assembly" poster. A printable version of this figure is also available from the following links: 2.7Mb gzipped PostScript file or 10Mb gzipped PDF file (±107x150cm).
Genome Research cover 10(4) We have participated on GASP, which results appeared in Genome Research on 24, April. gff2ps generated the poster summarizing all the submitted predictions of each participating group made on the Adh region 2.9Mbp genomic sequence. All those results are analyzed in "Genome Annotation Assesment in Drosophila melanogaster" [Reese et al. Genome Research 10(4):483-501(2000)]. You can also download a printable version of this figure from the following links: 466Kb gzipped PostScript file or 2.2Mb gzipped PDF file (±99x59cm).
Science cover 287(5461) gff2ps was used to obtain the plots for each chromosome arm of Drosophila melanogaster (X, 2L, 2R, 3L, 3R and 4) appearing in the "Coding content of the fly genome" figure, included as a poster in "The Genome Sequence of Drosophila melanogaster" [Adams et al. Science 287(5461):2185-2195(2000)]. A single-page one-side printable version of this figure is also available from the following links: 1.1Kb gzipped PostScript file or 1.7Mb gzipped PDF file (±100x55cm).
ISMB'99 GASP1 tutorial A previous version of gff2ps (0.90) was used to generate the three B0 pages poster for ISMB'99 tutorial #3 on "Drosophila melanogaster ADH region annotation experiment (GASP1)". Each page shows 1 Mb of sequence, splited on four blocks containing all the genomic predictions submitted to that experiment. You can download here the three panelsposter or view some of our pictures from ISMB'99 meeting held in Heidelberg in August, 1999. From this link you can see the last version of the Adh Poster, and you can learn how to get posters with gff2ps.
GFF2PS Snapshots

You can find several examples of the flexibility of gff2ps in those of the following references which include the figure numbers where the program was used.

Do you think we have forgotten any citation ? Do not hesitate to send an email with the corresponding citation to authors and we will include it here...

HOWTOs

In this section you can find usefull tutorials on how to use gff2ps. It will be regularly updated with new documents.

You are welcome to provide more examples on how did you use gff2ps in your projects, by sending your report files or a link to your own html report/howto. Your experience will be valuable for other users, mostly for newer ones. Send an email to authors, we will try to include here your contribution as soon as possible.

NEWS

| | 27 May 2003 | v0.98 | Minor changes (upgrading to v0.98d): + A bug when setting zoom area from command-line has been fixed. Thanks to Massimo Vergassola, who noticed us that issue. + Rainbow fill for shapes now depends on feature scores, so color is computed from that GFF field in a continuous color gradient (violet-blue-cyan-green-yellow-orange-red, from lower up to higher scores respectively). + PostScript variables defining offsets for axes coordinates have been made global. | | | -------------- | -------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | | | 08 Nov 2002 | v0.98 | Minor changes (upgrading to v0.98c): + Custom file "zoom" option now is not overridden by command-line zoom. + Sorting overlapping features for PS layers has been reviewed. | | | | 04 Oct 2002 | v0.98 | gff2ps was used on the visualization of the malaria mosquito genome annotation appearing in Science. Up to 278Mbp length, five chromosome arms (2L, 2R, 3L, 3R and X), scaffolds mapped on those chromosomes, a genic set close to 14000 genes, chromosome level polymorphisms, gene expression levels data, homology to Drosophila, SNPs density and G+C content, were integrated into this figure. | | | | 17 Jul 2002 | v0.98 | Minor changes (upgrading to v0.98b): + Fixing a silly mistake on landscape margins definition in PS code for v0.98, that was making the PS files to crash in that mode. + Group color filling gawk function was also checked to be able to set group shape color properly for the PS code. | | | | 04 Jul 2002 | v0.98 | v0.98 has been released, it includes few small fixes. Thanks to Gengxin Chen, who pointed that the regular expresion we were using for parsing the strand field from the GFF records will fail on some systems. Nature is publishing the paper describing the sequence and analysis of the chromosome 2 of Dictyostelium discoideum. gff2ps was used by the Genome Sequencing Center at IMB-Jena to produce the set of figures (each of them showing 100Kb of annotated sequence) for the initial for the chromosome 2 map web page, which were provided as supplementary material. | | | | 25 Sep 2001 | v0.97 | Two howto documents has been released for gff2ps. v0.97b is now available in our download section, but newest features are not yet documented. | | | | 16 Feb 2001 | v0.97 | gff2ps was used to plot the Human Genome annotation appearing in Science. The total sequence length was of 2.90Gb, distributed among the 24 chromosomes (22 autosomic, X and Y). v0.97 will be available in our download section soon. | | | | 24 Mar 2000 | v0.94 | gff2ps was used to plot the Drosophila melanogaster whole genome annotation appearing in Science. The total sequence length was of 120Mb, distributed among the six chromosome arms of the four D. melanogaster chromosomes. | | | | 25 Jan 2000 | v0.94 | New variable switch to fit feature drawings in the full width source tracks or to split track width, half for drawings and half for feature labels. Solved a problem with tickmark scale when defining less nucleotides per page than 1000. Source labels can now be disabled individually. | | | | 17 Jan 2000 | v0.94 | A bug for input filename checking fixed. Vertical pagination had an error on a function counter which is also checked. | | | | 10 Jan 2000 | v0.94 | gff2ps WEB SERVER is now available at Institut Pasteur. A warning for an old variable not used does not appear. | | | | 05 Jan 2000 | v0.94 | Fixes a bug on multiple sequences/sources sorting. Complete "User's Manual" comes with the program (Appendix figures were improved). | | | | 03 Jan 2000 | v0.93 | Official ANNOUNCEMENT of first fully operative version of gff2ps. A previous version bug on overlapping groups distribution on multiple lines is fixed. Page layout for vertical pagination is also adjusted. Standard error reports structure improved. | | | | 23 Dec 1999 | v0.92 | Finishing the web-pages for the program (this one, the examples page and the ISMB'99 page). Main GNUawk script is now included onto Main Shell script. Program can handle input from standard input. Timing section added to standard error reports. Some enhancements on PostScript prolog section. This code also ends with empty stack. | | | | 27 Sep 1999 | v0.91 | Two main pagination functions: one to fit all the source tracks in one physical page, the other one to fix track size and split output in many vertical pages as needed. PostScript header and prolog variables now are defined within the Main GNUawk script. First draft of "User's Manual". | | | | 03 Aug 1999 | v0.90 | This gff2ps version was used to produce the ADH poster for the ISMB'99 meeting. The program is based on two scripts: Shell (gff2ps) and GNUawk (gff2ps.awk). Defined environment variables to set gff2ps.awk directory, the default custom file, and the path for custom files. | |

DOWNLOADING

Download from here the latest versions of the user's manual (v0.96) and of the gff2ps program (v0.98l). You must replace from the script the paths for bash and gawk to the ones defined in your system on the following two lines:

| | #!/your/bin/dir/bash GAWK="/your/bin/dir/gawk"; | | | -------------------------------------------------- | |

Due to the intensive usage of associative arrays by our program, we recommend to work with GNU awk version greater than 3.0. Also ensure that you have bourne shell in the "/bin/sh", although we recommend to use bash, version 2 or greater.

You can find in our ftp server a gzipped tarball containing the Scripts for

gff2ps version 0.94

, a README text file, a PostScript Manual and few example drawings. Download this file only if you are interested on the whole example set, the newest versions of the program and the manual are being updated separately and provided as gziped files.

Once you have downloaded files, you can extract them with the following commands, depending on its file extension:

`*.gz' gunzip *.gz
`*.tar.gz' gunzip -c *.tar.gz | tar xvf - On Linux you can try with: tar zxvf *.tar.gz
REPORTING BUGS

If you find any bug or something is not plotted properly, you can send a bug report. To easily find what's wrong, you should attach to that e-mail a tarball containing the custom file you were using when the bug ocurred, an example of your input GFF files, the PostScript file generated and a report file that you can get with the "-V" command-line option (type "gff2ps -h" for further info on that verbose mode switch). We will try to answer as soon as possible.

WEB SERVER

Thanks to Catherine Letondal from Institut Pasteur for providing us of a web server for the gff2ps.

That server was made with PISE, a program developed by Catherine to generate web interfaces for molecular biology programs. We should install a mirror in our server as soon as possible.

FEATURE LIST

The following list shows many features of gff2ps.

WISH LIST

Although we have implemented many features, there are some ideas to be added before releasing version 1.0 of gff2ps.
Here is a short list:

We are open to any helpful suggestion for improving our program. Do not hesitate to get in touch with us.

AUTHORS

Josep Francesc ABRIL FERRANDO
Roderic GUIGÓ SERRA

CopyRight © 1999 - 2003

gff2ps is under GNU General Public License.