Dear GCGers:
I recently downloaded the FASTA format flat file TB.dbs from the
Sanger center database. In running FROMFASTA I noticed that there were
several occurrences of the same name given to two or more adjacent
sequences of different lengths, presumably non-assembled sequences from
the same cosmid. In Unix systems this results in all but the last
sequence being lost. Does anyone have a work-around for this other than
hand editing the titles in the 4 meg flat file?
Thanks,
******************************************************************************
Paul H. Roy Phone: +1 418 654 2705
Departement de biochimie,FSG FAX: +1 418 654 2715
Universite Laval E-mail: proy at rsvs.ulaval.ca
Quebec, QC G1K 7P4
CANADA
******************************************************************************