Sequin 2.45 now available

francis at NCBI.NLM.NIH.GOV francis at NCBI.NLM.NIH.GOV
Wed Mar 11 12:54:04 EST 1998

Dear Sequin users,

We have recently released a new version of Sequin, the sequence
submission/editing tool from NCBI, for all platforms except Linux
and SunOS.  These versions will be available shortly.

The current version of Sequin is now 2.45

Please refer to the Sequin home page at:


for the latest developments, new questions in the Frequently Asked
Questions section, a more detailed view of the Sequin Revision History,
and the most recent version of the help documentation.

A new version of the Sequin Quick Guide is available at:


Major changes for Sequin version 2.45

Summary of new features and changes:

1.  Easier sequence annotations
2.  New PowerBLAST features
3.  Replacing or updating your sequence
4.  Contact information for future submissions
5.  Formatting segmented population/phylogenetic sets
6.  Creating automatic definition lines
7.  Encoding new information in definition line
8.  Direct submission information
9.  Minor changes

1. Easier sequence annotations

    You can now use the Sequence Editor Feature menu, as well as the
    main Sequin Annotate menu, to annotate features on the sequence.
    The features listed are identical, and the instructions for adding
    them are the same, with one exception. If you annotate them in the
    Annotate menu, you must provide the nucleotide sequence location of
    the feature.  However, if you add features from the Sequence
    Editor, you do not need to enter the nucleotide coordinates
    manually. Simply highlight the sequence which the feature covers,
    and the location of the sequence will be automatically entered in
    the feature location box.

2. New PowerBLAST features

    PowerBLAST capabilities have been enhanced.  When you do a
    PowerBLAST from within Sequin, you can limit a search either for or
    against an organism or taxonomic group. Under Organism Filter,
    click on "Restrict to" to limit your search to a particular
    organism. Or, conversely, click on "Filter against" to search
    against all organisms except one.  Type the scientific name of the
    organism (e.g., Homo Sapiens) or taxonomic group (e.g., Mammalia)
    in the "Name" box.  After you do a PowerBLAST search, additional
    controls will be added to the bottom of the record viewer window.
    These controls allow you to retrieve the PowerBLAST hits from
    Entrez, and then look for Entrez neighbors. Use the alignment
    pop-up to select the type of alignment (search) that was performed.
    If multiple blast search types were run in one PowerBLAST search,
    this allows you to get one type at a time.  Then click on the
    Retrieve button to retrieve the records in a document summary
    window, where you can view Medline, Protein, Nucleotide, Structure,
    and Genome neighbors of the sequence(s). Click on the Refine button
    to open a query refinement window in which you can further refine
    the PowerBLAST hits by selecting other Entrez terms, such as Author
    name to view sequences belonging to a specific author.

3. Replacing or updating your sequence

    We had previously explained that you can now replace or merge the
    sequence in the record with a new sequence without going into the
    Sequence Editor. This option is available in the Update Sequence
    submenu of the main Sequin Edit menu.  In addition to being able import
    a sequence in FASTA format (Read FASTA File), import a sequence record
    in ASN.1 format (Read Sequence Record), or download a sequence record
    from Entrez (Download Accession), you can now import a sequence from a
    Sequin PowerBLAST alignment (Selected Alignment). Note that in all
    cases, both the target and the imported sequence must be nucleotide
    sequences.  The alignment between your original sequence and the
    imported sequence can be viewed in a separate window.  You can then
    choose to merge the 5' or 3' end of the imported sequence with the
    target sequence in the record, or replace the target sequence with the
    imported sequence. The features on the imported sequence  will be
    automatically copied to the original sequence. You can also choose only
    to propagate features from the imported sequence record to the target.

4. Contact information for future submissions

    The contact, authors, and affiliation information you provide on
    the Submitting Authors form can now be saved as a block and used
    for subsequent submissions.  For your first Sequin submission, fill
    in the requested information.  Then, in the record viewer, click on
    "Edit Submitter Info" under the Edit menu, and then on Export
    Submitter Info under the File menu in the resulting Submission
    Instructions form.  For subsequent Sequin submissions, click on
    Import Submitter Info on the first page of the Submitting Authors
    form.  You must still fill in the manuscript title on the this
    page, though.

5. Formatting segmented population/phylogenetic sets

    Sequin can now read segmented sets of sequences which are parts of
    phylogenetic, population, or mutation studies.  A segmented set is a
    colllection of non-overlapping sequences which cover a specified
    genetic region, such as a set of exons along with fragments of flanking
    introns.  The sequences must be in FASTA or FASTA+GAP format.  Each
    segment should have its own sequence identifier (the term immediately
    following the ">", but organism name and source modifiers should only
    be indicated for the first segment from each sequence.  Square brackets
    are used to delimit the members of a set.  For example:

    >bioseq1part1 [org=Mus musculus] [strain=BALB/c]
    >bioseq2part1 [org=Rattus norvegicus] [strain=Sprague-Dawley]

6. Creating automatic definition lines

    Sequin can now create definition lines (sequence titles)
    automatically based on information provided in the record.  This
    option works for single sequences as well as sets of sequences, and
    can handle complex annotations with multiple features.  The
    definition lines will follow standard GenBank conventions.  Use the
    function "Generate Definition Line" under the Sequin Annotate

7. Encoding new information in definition line

    If you are submitting the sequence for an organism which is not
    present in the NCBI taxonomy database, you can indicate the lineage
    of the organism on the first line (definition line) of your
    FASTA-formatted nucleotide sequence.  Use the modifier
    [lineage=lineage] on the line where other modifiers are indicated.
    For example,
    >dna1 [org=Neworganism] [strain=A] [lineage=Newlineage]

    For information about the organisms presently in GenBank, see the
    NCBI taxonomy browser at http://www.ncbi.nlm.nih.gov/Taxonomy/

    Additional source information can also be encoded directly in the
    definition line.  You can now indicate
    extrachromosomal,plasmid,transposon,insertion sequence,cyanelle,proviral,
    virion}] and [molecule={dna,rna}].  In each case you can pick one item from
    the list in {}, so a sample definition line could be:
    >dna1 [org=Homo sapiens] [location=genomic] [molecule=dna]

8. Direct submission information

    The DIRECT SUBMISSION reference for your new submission will now
    appear as it will once the record is released to the public.  In
    the past, this information was stored by Sequin, but not
    displayed.  This citation lists the authors who should recieve
    scientific credit for the sequence, and may have as many authors as
    you see fit.  It should have, at the very least, one author.
    Author names are initially entered on the Submitting Authors form.
    You can modify the list by double clicking on the reference.

9.  Minor changes

    Reference publication types now include Proceedings (meetings) and
    Proceedings Chapter (meeting abstracts).

    When you highlight a range of sequence in the Sequence Editor, the
    selected sequence is shown as a box in the Graphic view of the

    An option called "Select Target" was added to the Sequin Search
    menu.  This option changes the sequence which is selected in the
    Target Sequence pop-up.

Please do not hesitate to let us know  (info at ncbi.nlm.nih.gov)
if you have any questions or comments.

francis, for the sequin development team.

| B.F. Francis Ouellette
| GenBank Coordinator
| francis at ncbi.nlm.nih.gov

More information about the Bio-soft mailing list

Send comments to us at biosci-help [At] net.bio.net