In article <6vecvf$r3p$1 at mark.ucdavis.edu>,
Erica Seitz <emseitz at ucdavis.edu> wrote:
>Does anyone know the base composition (AT/GC content) of the S. cerevisiae
>genome, or where to find this information?
For the published genome sequence, the G+C content (GC%) is
about 38.3%. Or more precisely:
Length of sequence is 12069303 bp:
Nucleotide count:
A = 3729537
T = 3717515
C = 2313352
G = 2308899
GC% = 0.382975802330922
If you consider coding and non-coding regions separately:
For ORFs...
Length of combined ORF sequences is 17210970 bp:
Nucleotide count:
A = 5603270
T = 4779634
C = 3289938
G = 3538128
GC% = 0.396727552253011
For non-coding regions...
Length of combined non-coding sequences is 3085113 bp:
Nucleotide count:
A = 1004675
T = 1004448
C = 538077
G = 537913
GC% = 0.348768424365655
Here non-coding regions are taken to be any sequence which isn't
in an ORF, Ty element, LTR sequence, or RNA sequence.
Hope this helps.
Keith B.
Department of Genetics,
University of Nottingham
--
-_-_-_-_-_-_- tggaagggct aattcactcc caacgaagac aagatatcct tgatctgtgg
-_-_-_-_- atctaccaca cacaaggcta cttccctgat tagcagaact acacaccagg
-_-_-
- http://evol.nott.ac.uk/~pdxkrb/