WHITEHEAD INSTITUTE/MIT CENTER FOR GENOME RESEARCH
HUMAN GENOMIC MAPPING PROJECT
DATA RELEASE 3 (JULY 1994)
The third quarterly release of data from the Human Physical Mapping
Project at the Whitehead Institute/MIT Genome Center, covering data
generated through the end of June, 1994, is now available.
This data release contains YAC screening data for 3419 sequence
tagged sites (STSs) screened on the CEPH mega-YAC library. For
each STS, we report addresses for the YACs
found to contain the STS. From the data obtained so far, there are
over 400 contigs assembled using double linkage between STSs.
The data is available electronically in two ways.
ANONYMOUS FTP: The entire data release is available as a set of
Microsoft Excel files and tab-delimited ascii files on our ftp
server. Using an ftp client (such as "Fetch" on the Macintosh),
connect to "genome.wi.mit.edu". Use "anonymous" as your user name,
and give your e-mail address as your password. The data files are
present in the directory /distribution/human_STS_releases/july94.
The contents are as follows:
07-94.INTRO.hqx - Description of the data release, Macintosh format.
07-94.INTRO.txt - The same as ascii text
07-94.INTRO.ps - The same in Postscript format.
07-94.STS.YAC.hqx - STS & YAC screening data, in MS-Excel format.
07-94.STS.YAC.txt - The same as tab-delimited text.
07-94.sequence - Full sequences of previously unpublished STSs.
THE WORLD-WIDE WEB: You will need a World Wide Web client such as
Mosaic (Unix, MS-Windows and Macintosh) or MacWeb (Macintosh).
Instruct your client to connect to "http://www-genome.wi.mit.edu/".
>From there, follow the "Human STS Data Release" link. You will be
able to browse and download the raw data set as well as to view
A subset of the STSs (those for which we have chromosomal assignments)
are also available through the Genome Database (GDB).
QUESTIONS AND PROBLEMS. If users have any questions or problems,
please contact us at human_STS_help at genome.wi.mit.edu We invite
suggestions about how to make these data release most useful.
DATA RELEASE POLICY AND CITATION. Data releases are scheduled every
90 days. At the end of each calendar quarter, all genomic mapping
data are reviewed and prepared for distribution via CGR's electronic
databases. Data releases typically occur within two weeks of the
close of the quarter (i.e., in mid-January, mid-April, mid-July and
mid-October). Releases are announced by electronic messages posted
to the following two newsgroups: "bionet.genome.chromosomes" and
CGR's data release policy aims to ensure that scientific colleagues
have immediate access to information that may assist them in the
search for genes. Data releases do not constitute scientific
publication of CGR's work, but rather provide scientists with a
regular look into our lab notebooks. For projects aimed at the
analysis of particular genes or subchromosomal regions, permission is
hereby granted to use our data without the need for a formal
collaboration, subject only to appropriate acknowledgment. For
projects aimed at large-scale mapping of entire chromosomes or entire
genomes, use of the data and markers should be on a collaborative
The information for the human genome mapping project should be cited
as: Whitehead Institute/MIT Center for Genome Research, Human Genetic
Mapping Project, Data Release 3 (July 1994).
Lincoln D. Stein Whitehead Institute/MIT Genome Center
lstein at genome.wi.mit.edu Cambridge, MA 02142