IUBio

fetch foul-up

Chris Botka botka at CGL.UCSF.EDU
Wed Jul 16 23:52:39 EST 1997


I fetched(gcg9) the following accession from the genbank database: =
AA390074.  The sequence in the record fetch returns is incorrect, =
however the rest of the record IS correct.  Below are two copies =
of the genbank record, one from retrieve at ncbi and one fetched from =
the gb_est11 GCG formatted database.  I looked at the flat file =
(gb_est11.seq - created when formatting the GCG database) and the =
sequence there is identical to the one sent back by retrieve.  I =
grep'd flat file (gb_est11.seq) with the first 30 nt of the =
sequence returned by fetch and did not get a string match.  I also =
grep'd the other est flat files for the 1st 20 nt and did not get =
a match.   Also, when I search the dbest with the sequence =
returned by fetch there is no exact match, but there is an exact =
match to the sequence returned by retrieve.  Has anyone heard of =
this happening before?  I do not think that I have made any errors =
converting the GenBank db to GCG.  Any ideas??

Thanks,

Chris

>Subject: Results-RETRIEVE Server:
>Reply-To: Retrieve Server <retrieve at ncbi.nlm.nih.gov>
>From: RETRIEVE E-Mail Server <retrieve at ncbi.nlm.nih.gov>
>Status: RO
>
=
>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D
>To Obtain Help Documentation:  send e-mail to =
'retrieve at ncbi.nlm.nih.gov'
>   with the word 'help' in the body of the mail message.
>
>Note:  GenBank retrieval and submission tools are available =
through
>   the World Wide Web at the URL:  <http://www.ncbi.nlm.nih.gov/  =
For more
>   information contact User Services at:  info at ncbi.nlm.nih.gov
=
>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D
>
>
>
>Database: GenBank Updates (101.0+, 07/16/97)
>Query:  "aa390074" [acc] OR "aa390074" [loc]
>Parse status: OK: 0 documents retrieved.
>//
>Database: GenBank (101.0, 6/14/97)
>Query:  "aa390074" [acc] OR "aa390074" [loc]
>Parse status: OK: 1 document retrieved.
>Documents selected: 1-1  (up to 1000 lines)
>
>LOCUS       AA390074      393 bp    mRNA            EST       =
23-APR-1997
>DEFINITION  vb44f12.r1 Soares mouse lymph node NbMLN Mus musculus =
cDNA clone
>            751823 5'.
>ACCESSION   AA390074
>NID         g2043028
>KEYWORDS    EST.
>SOURCE      house mouse.
>  ORGANISM  Mus musculus
>            Eukaryotae; mitochondrial eukaryotes; Metazoa; =
Chordata;
>            Vertebrata; Mammalia; Eutheria; Rodentia; =
Sciurognathi; Muridae;
>            Murinae; Mus.
>REFERENCE   1  (bases 1 to 393)
>  AUTHORS   Marra,M., Hillier,L., Allen,M., Bowles,M., =
Dietrich,N., Dubuque,T.,
>            Geisel,S., Kucaba,T., Lacy,M., Le,M., Martin,J., =
Morris,M.,
>            Schellenberg,K., Steptoe,M., Tan,F., Underwood,K., =
Moore,B.,
>            Theising,B., Wylie,T., Lennon,G., Soares,B., =
Wilson,R. and
>            Waterston,R.
>  TITLE     The WashU-HHMI Mouse EST Project
>  JOURNAL   Unpublished (1996)
>COMMENT
>
>            Contact: Marra M/Mouse EST Project
>            WashU-HHMI Mouse EST Project
>            Washington University School of MedicineP
>            4444 Forest Park Parkway, Box 8501, St. Louis, MO =
63108
>            Tel: 314 286 1800
>            Fax: 314 286 1810
>            Email: mouseest at watson.wustl.edu
>            This clone is available royalty-free through LLNL ; =
contact the
>            IMAGE Consortium (info at image.llnl.gov) for further =
information.
>            MGI:460807
>            Seq primer: -28m13 rev2 ET from Amersham
>            High quality sequence stop: 324.
>FEATURES               Location/Qualifiers
>     source          1..393
>                     /organism=3D"Mus musculus"
>                     /strain=3D"C57BL/6J"
>                     /note=3D"Vector: pT7T3D-Pac (Pharmacia) with =
a modified
>                     polylinker; Site_1: Not I; Site_2: Eco RI; =
1st strand cDNA
>                     was primed with a Not I - oligo(dT) primer =
[5'
>                     =
TGTTACCAATCTGAAGTGGGAGCGGCCGCGATACTTTTTTTTTTTTTTTTTTTTTTTT
>                     3']; double-stranded cDNA was ligated to Eco =
RI adaptors
>                     (Pharmacia), digested with Not I and cloned =
into the Not I
>                     and Eco RI sites of the modified pT7T3 =
vector. RNA
>                     provided by Dr. Bertrand Jordan. Library =
constructed and
>                     normalized by Bento Soares and M.Fatima =
Bonaldo."
>                     /clone=3D"751823"
>                     /clone_lib=3D"Soares mouse lymph node NbMLN"
>                     /sex=3D"male"
>                     /dev_stage=3D"4 weeks"
>                     /lab_host=3D"DH10B"
>     mRNA            <1..>393
>BASE COUNT      144 a     63 c     70 g    116 t
>ORIGIN
>        1 acattgaatt catatgttct caccttttta aaaatttcca tacaacttga =
gaattaggta
>       61 agttcctttg tatacaaatt tcagcagcta ggatatgtca cccttcagta =
ctacaaagta
>      121 cagaaattcc tagaatataa tcattctcac agaatagata ccaagaaaaa =
gctatctatt
>      181 aggtttctaa ttccaggacc tagatagaca agtacacttt taattgaggc =
caaatgatga
>      241 gccacaatgg taacccataa ctaaatagag gacaaagaac ataagtatga =
tgcaaaatga
>      301 gaggtaggat gtgtagcagg attccaaatt ttagcttaat tgtctttgac =
aatggactat
>      361 ttggagaaat gaatgaatag gttgtggctg tca
>//
>
botka at socrates.8% more aa390074.gb_est11
LOCUS       AA390074      393 bp    mRNA            EST       =
23-APR-1997
DEFINITION  vb44f12.r1 Soares mouse lymph node NbMLN Mus musculus =
cDNA clone
            751823 5'.
ACCESSION   AA390074
NID         g2043028
KEYWORDS    EST.
SOURCE      house mouse.
  ORGANISM  Mus musculus
            Eukaryotae; mitochondrial eukaryotes; Metazoa; =
Chordata;
            Vertebrata; Mammalia; Eutheria; Rodentia; =
Sciurognathi; Muridae;
            Murinae; Mus.
REFERENCE   1  (bases 1 to 393)
  AUTHORS   Marra,M., Hillier,L., Allen,M., Bowles,M., =
Dietrich,N., Dubuque,T.,
            Geisel,S., Kucaba,T., Lacy,M., Le,M., Martin,J., =
Morris,M.,
            Schellenberg,K., Steptoe,M., Tan,F., Underwood,K., =
Moore,B.,
            Theising,B., Wylie,T., Lennon,G., Soares,B., Wilson,R. =
and
            Waterston,R.
  TITLE     The WashU-HHMI Mouse EST Project
  JOURNAL   Unpublished (1996)
COMMENT
            Contact: Marra M/Mouse EST Project
            WashU-HHMI Mouse EST Project
            Washington University School of MedicineP
            4444 Forest Park Parkway, Box 8501, St. Louis, MO =
63108
            Tel: 314 286 1800
            Fax: 314 286 1810
            Email: mouseest at watson.wustl.edu
            This clone is available royalty-free through LLNL ; =
contact the
            IMAGE Consortium (info at image.llnl.gov) for further =
information.
            MGI:460807
            Seq primer: -28m13 rev2 ET from Amersham
            High quality sequence stop: 324.
FEATURES             Location/Qualifiers
     source          1. .393
                     /organism=3D"Mus musculus"
                     /strain=3D"C57BL/6J"
                     /note=3D"Vector: pT7T3D-Pac (Pharmacia) with =
a modified
                     polylinker; Site_1: Not I; Site_2: Eco RI; =
1st strand cDNA
                     was primed with a Not I - oligo(dT) primer =
[5'
                     =
TGTTACCAATCTGAAGTGGGAGCGGCCGCGATACTTTTTTTTTTTTTTTTTTTTTTTT
                     3']; double-stranded cDNA was ligated to Eco =
RI adaptors
                     (Pharmacia), digested with Not I and cloned =
into the Not I
                     and Eco RI sites of the modified pT7T3 =
vector. RNA
                     provided by Dr. Bertrand Jordan. Library =
constructed and
                     normalized by Bento Soares and M.Fatima =
Bonaldo."
                     /clone=3D"751823"
                     /clone_lib=3D"Soares mouse lymph node NbMLN"
                     /sex=3D"male"
                     /dev_stage=3D"4 weeks"
                     /lab_host=3D"DH10B"
     mRNA            <1. .>393
BASE COUNT      144 a     63 c     70 g    116 t
ORIGIN

  AA390074  Length: 393  July 16, 1997 17:19  Type: N  Check: 6583 =
 ..

       1  TCCTTCCTTC CTTCTGTCTG TCCTTCTGTC CTTCCTTCCT TCTGTCTGTT=20=


      51  TCTCTGTCTG TCCGTCTGTC CTTCTGTTTC TTTCTTTCTC CGTTTCTCCG=20=


     101  TCCGTTTCTC TGTTTCTCCT TCCGTTTCTC CTTCCGTTTC TCTGTCTGTT=20=


     151  TCTCCGTCCG TCCTTTTCTC CTTTTCTTTC TTTCTTTCTT TCTCCTTCCT=20=


     201  TCCTTTTCTC TGTCCTTTTC TCTGTTTCTT TCTTTCTCTG CCAATTTCTT=20=


     251  TCTCCGTCCT TCCTTTTCTC CTTTTCTCCT CCAACGGATA TGTAATTGGC=20=


     301  CGCACGCCCG CGCGTGCGTC CGATCGTGTG GCTATGTACA TGGCTCCTTC=20=


     351  CTCGCGCGAC CGTTCGCTCG TTCGACTGGC TCCTTCCTCG CGC
______
Christopher W Botka					Phone: (415)476-5379
Sequence Analysis Consulting Service	Fax: (415)502-1755
UCSF Computer Graphics Lab			Office: Lib-111
Internet: botka at cgl.ucsf.edu				Mail Drop: Bx 0446
http://www.cgl.ucsf.edu/sacs/			NeXT/MIME mail OK



More information about the Info-gcg mailing list

Send comments to us at biosci-help [At] net.bio.net