NCBI UniGene files

Francis Durst francis.durst at bota-ulpnospam.u-strasbg.fr
Wed Jan 31 13:13:34 EST 2001

In article <3A776EF3.642CDBDC at staff.usyd.edu.au> Bill Blackhall <b.blackhall at staff.usyd.edu.au> writes:
>From: Bill Blackhall <b.blackhall at staff.usyd.edu.au>
>Subject: Re: NCBI UniGene files
>Date: Wed, 31 Jan 2001 12:48:36 +1100

>The NCBI files have a file extension of .cgi (what that means, I have no
>idea). They appear to be simple text files with each EST within them in
>fasta format. Each EST begins on a new line with the > symbol, then some
>text, and then the sequence starting on a new line. There is no trace
>data associated with them. Some of the files contain 100 or more ESTs,
>so copying and pasting into separate files could get tedious.

You may use Seqverter (www.genestudio.com/seqverter.htm) to
split multi-sequence fasta files into individual files (also fasta).

All the best

PS mine runs on Winnt 4, don't know about other OS

More information about the Staden mailing list

Send comments to us at biosci-help [At] net.bio.net