Documentation read from 04/17/2019 22:07:26 version of /vol/public-pseed/FIGdisk/FIG/bin/svr_get_coupling_data.

svr_get_coupling_data [-g genome] -d CouplingDirectory

svr_get_coupling_data [-g genome] -d CouplingDirectory

Get functional coupling data for genes in a genome

------

Example:

    svr_get_coupling_data -d 83333.1.Coupling -g 83333.1

Would build a directory containing the following files:

    functionally.coupled                   - a 3-column table giving basic fc scores
    functionally.coupled.hypos             - a 4-column table [hypo-PEG,fc-score,nonHypo-PEG,nonHypo-function]
    functionally.coupled.hypos.by.peg      - functionally.coupled.hypos sorted by PEG

    expression.coupled                     - a 3-column table giving basic fc scores (Pearson Correlation coefficients)
    expression.coupled.hypos               - a 4-column table [hypo-PEG,fc-score,nonHypo-PEG,nonHypo-function]
    expression.coupled.hypos.by.peg        - expression.coupled.hypos sorted by PEG

------

The genome is normally taken from the command line. If it is not, it is read from STDIN and a separate output directory is constructed for each genome ID.

Normally, the last field in each line would contain the genome ID. If some other column contains the genome IDs, use

    -c N

where N is the column (from 1).

This is a pipe command. The input is taken from the standard input, and the output is to the standard output.

Command-Line Options

-g GenomeID

This is normally used to get the genome ID, and in this case output for a single genome is constructed. If it is not used, genome IDs are read from STDIN.

-c Column

This is used only if there is no -g parameter and the column containing genome IDs is not the last.

-d CouplingDirectory

If -g Genome is used, this directory gets built and will contain the data files for the genome. If not, then this directory gets built. Subdirectories (named the genome ID) will get the output files for each genome named in the input.

Output Format

If -g Genome is used, the output directory will contain the files for the single genome. If not, the output directory will have a subdirectory for each genome specified in STDIN (named by the genome ID).