I found another entry, X62695, which DOES have nice joins to describe the
mRNA. Perhaps this means that the particular entries (including X16277) simply
haven't been updated created to newer standards. If so, the question is: are
GenBank folks going through to find these older entries and reforming them to
the new standard?
X62695 looks pretty good in general. There is only one
note:
unsure 18054
/note="replace(18054,'g')"
which shouldn't have been a note because it could be handled by a data type
which can be processed by a machine.
The other odd thing is the labels for the introns:
intron 14286..15708
/label=IntronK
Is it a convention now to smash the K against the word Intron? Can a label
have spaces in it (in which case it needs quote marks)? Is this documented?
Also, why are the exons given like this:
exon 15709..15855
/number=12
In other words, introns get labels, but exons get numbers! Why? Code to get
at this information, has to be smart enough to look for either one. Why not
simply label the exon? What happens if someone discovers a new intron
somewhere in the middle and labels it 12a?
Tom Schneider
National Cancer Institute
Laboratory of Mathematical Biology
Frederick, Maryland 21702-1201
toms at ncifcrf.gov