Optimizing Substitution Matrix by Separating Score Distributions

Paper

[Download PDF from OUP] (Personal research/study purposes only.)

Download Source

The program package used for our experiments is available from here.
The previous version of this package is here(Jul 11 2002).
This software requires only a standard unix platform.

Compile

First, extract the source.
$ gzip -cd optssd-3.0.1.tar.gz | tar xvf -
To compile this program, go to the extracted directory
$ cd optssd-3.0.1
and type as follows:
$ ./configure
Then, edit src/Makefile.common if required.
To compile the source, type
$ make
Load modules will be in 'bin' directory.

Usage

To execute optssd, one must prepare

Optssd Setting File1

Default filename is input.prj.
An example is here. The format will be clear.

Optssd Setting File2

Second file must be pointed by setting UDOptions in the file above.
An example is here.

Some score parameter files (substitution matrices) are in 'scr' directory.
And some databases are also available from here.

Then, type the command,

$ bin/optssd [input.prj]

Output Files

optssd.log Experimental environment.
itnnn.scr A score function on nnnth optimization step. It can be converted to the popular format by "scutil -B" command.
scr.log The log of objective function and the minimum number of errors
*.as scores of alignments
cgdata.*.log EVD parameters for each group

Databases and Optimized Matrices

DatabaseClassified Sequence FilesGroup FileOptimized Matrix
COGCOGs.tar.gzCOG.grpCOGOPT.mat
PfamPfam.tar.gzPfam.grpPFAMOPT.mat
SCOP 40%IDscop1.61-40.tar.gzscop1.61-40.grpSCOP40OPT.mat
SCOP 95%IDscop1.61-95.tar.gzscop1.61-95.grpSCOP95OPT.mat

Links


hourai@is.s.u-tokyo.ac.jp