Documentation read from 04/17/2019 22:07:26 version of /vol/public-pseed/FIGdisk/FIG/bin/svr_genome_statistics.

svr_genome_statistics

svr_genome_statistics

    svr_genome_statistics fld1 fld2 ... fldN <genome_ids.tbl >genome_data.tbl

Get one or more pieces of data about each specified genome.

This script takes as input a tab-delimited file with genome IDs at the end of each line. For each genome ID, one or more selected data items are appended to each line.

This is a pipe command: the input is taken from the standard input and the output is to the standard output.

The data items are specified as positional parameters on the command line, and are appended in the order specified to the output lines. The permissible data items are as follows.

complete

1 if the genome is more or less complete, else 0.

contigs

The number of contigs for the genome

dna-size

The number of base pairs in the genome

domain

The domain of the genome (Archaea, Bacteria, ...).

genetic-code

The genetic code used by this genome.

pegs

The number of protein encoding genes in the genome.

rnas

The number of RNAs in the genome.

name

The scientific name of the genome.

Command-Line Options

url

The URL for the Sapling server, if it is to be different from the default.

c

Column index. If specified, indicates that the input IDs should be taken from the indicated column instead of the last column. The first column is column 1.