IUBio

Notes from ACEDB Workshop, part I

John L. McCarthy jlmccarthy at lbl.gov
Wed Sep 8 11:42:17 EST 1993


By popular request, here are some very rough outline notes I took at the
ACEDB Workshop in Boston last month. I cannot vouch for their accuracy, but
hope I have not misquoted anyone too badly. Please post corrections and
clarifications to this newsgroup.

The full set of notes are about 20 pages long. I have broken them into two
parts to accomodate my news reader and mail software.

If you have questions about particular points, you probably should send
them directly to the people who made those points at the workshop.  A list
of workshop attendees with Email addresses was posted to this newsgroup a
few weeks ago.

A copy of this document is available via anonymous ftp
on weeds.mgh.harvard.edu  under pub/acedb-dev/doc

-John McCarthy, Lawrence Berkeley Laboratory


==================================================================
ACEDB Workshop  Wed 28-31 July 1993 Boston, MA 
Highlight notes from John McCarthy (LBL) jlmccarthy at lbl.gov 

more or less in chronological order, with TOPICS IN CAPS
see list of attendees for full names, addresses, etc.

Wed 28 July 
=================================
COMMUNICATIONS, GOPHER
Mike Cherry is providing ACEDB archive on his gopher server at
Stanford -- genome.stanford.edu.

Mike asks if people could please send notes to bionet.software.acedb
another thing that would be nice to have is FAQ - Brad will do this

weeds.mgh.harvard.edu has anonymous gopher+  login/server
Also experimenting with remote anonymous AATDB login aatdb
user name c-aatdb used for MacX
Mike used Richard's acedump to create gopher files.
WAIS paragraph format / from acedump    also textace

Brad notes that his gopher archive includes USDA sysad archive.

Brad Sherman did research for secure anonymous ftp

Mike: weeds is a separate machine, not cross mounted to anything, and
not used for anything else

stty script --  Otto points out that you can trap interupts with Bourne
shell
Brad points out that most vulnerable spot is within application.

anonymous FTP setup instructions,  
thor.ece.uc.edu
/pub/sun-faq/anon-ftp.script
guidelines:  . cert.org   /pub/tech.tips/anonymous.ftp

=========================================================
PRINTING FOR REMOTE USERS
richard: set ACEDB_LPR to null -- code checks for this.
maybe could do a script that turns print script into Email.
Michelle: also may want to print to a file  
(richard:  currently always goes into /PS directory).  
Stan: might be nice to have switch to disable all writing
could do changeroot to keep people restricted to specified area.

=================================
ACEDB NEW DEVELOPMENTS - Richard Durbin: 
ORGANIZATIONAL STATUS, what Richard and Jean are doing:
RD  -worm sequencing (2,5,10 Mb/yr)
      -sanger Centre  (RD joint head of informatics)
      -CGC map -> Cambridge  (worm cytogenetics center)
            Mary O'Callaghan doing this (moc at mrc-lmb.cam.ac.uk)
      -in house use: sequence (worm, yeast, human)
                             worm genetic map
   so RD will have less time for kernal development

Jean:  -Human mapping - Frensal (Genethon)
          -IGD
          -ports to various unix systems
          - management of kernal  (want to keep unified version)
             incorporate changes from others & redistribute
             also would like to make other's tools & modules available

RELEASES:
        1-10  now  (still backwards compatible with old databases)
        2-0 September? (model changes for maps, sequences)
                             running this in Cambridge now
                      but want to make other changes before release
              would like to get some feedback from this workshop first
     developers can get 2-0 code but should not use for databases
          that will be distributed, etc.
Will have AWK scripts for database restructuring from 1-10 to 2-0
Extra things in specifications directories, PS fonts, Xfonts.wrm,


NEW ARCHITECTURE FOR MAP DISPLAY-- RD
  columnar architecture/format will help
  would also be nice to have developers style guide

Stan: would be nice to have another level of indirection, so people
could map different names to std display code names -- intermediate
layer to make it easier to change models and yet maintain
compatibility with display modules.

RD: not convinced that another layer of indirection is good idea
may slow things down

RD: people should not expect that all the good ideas that get
suggested here will get implemented.  Commitment to maintain kernal.

------------------- break-----------------------
NEW DEVELOPMENTS (mostly in 1-10)
SUBCLASSES 
  -can appear on main window
  -can use in queries, .ace files
  -no code change required
specification
  -wspec/subclasses.wrm
  -read after models.wrm
  -ace file
      class genome_Sequence
      Subclass-of Sequence  //can be a subclass  (can have 2 parents?)
      Visible            //indicates whether displayed on main window
      Filter     "Genomic_canonical"  query
  How it works:
       Have 8 bits free per key in memory 
       -uses 1 bit per filter
       checks on object save
       when filter changes -- rescans entire class with query
       can't change entries in class class directly, reads from
subclasses.wrm
        
May be possible in future:
      to have different default display type for different subclasses
      
Used in 2.0
?Map   -  Chromosome
?Locus - gene, RFLP, etc.
?Interval - rearrangement, Chrom_Band, Hybrid cell line, ...
?Df_dup_defic     Inside
                           Outside
?Probe

2-0 has a models class 
so won't have ?class as part of keyset
you will get this type of info in other ways

2 possible ways to do sub-maps
  1 scale with some linear transformation
  2 interpolate between shared loci (now done in 1-10 physical map)

Otto: IGD has used clone grid as general model for 2-D maps

RD: map display now automatically transforms contig info in this way
 maybe could do some graphic to indicate where there are conflicts
  between different versions of maps

GRAPH PACKAGE additions
   filled arcs
   filled polygons (from Suzanna)

IMAGES (grey level: 256-->128)
      (a) in subwindow    graphPixels()
      (b) PIXEL_SCROLL graph type   (coordinates are pixels) 
            graph Pixel Background()

Possible Development
      -ASCII dump format for draw stack
      -stand alone drawing editor
 full color images will require more thought!
now easier to attach sub-menus to boxes

Otto: can pick up X colors and coordinates to use from XV
Suzanna: has done some things to display TIFF images

MAP PACKAGE
map.h     w7/mapcontrol.c
A. drawing coordinates  (zoom, centre, scroll, etc.)
B. columns of data (user toggleable)
C Cursor

MAP  *map
Look -> map -> mapCreate(column, ...         )
{float   min, max
Coordinate Functions
Locator (small green bar)

Stan: is this the way you would draw multiple (synteny) maps?
RD: not yet..., in the meantime can use TableMaker multi-map

Planned Enhancements
1. add a help line per column for display control
2. allow >32 columns
3. give names to sets of columns
4  ?   Multiple ?MAP's per window

Code:
     separate columns further
             ==>library of columns
     separate SEG's array per column    (convert (store),  destroy)
   put PICK control in MAP package
          -select   /  follow
          - neighbor       on/off  (key)  [e.g., to light up related
objects]

Sequence Fetch/Align
--------------------------
a. summon up SwissProt/PIR entry(s)
b. show multiple alignment based on BLAST segments  =BLIXEM
uses unix command
   fetch  -Bdbname (SP,PIR,EMBL,GB)
             -a accession number
             -P Pearson (FASTA) format
             -s  just  sequence ... no new line
   our version uses the EMBL CDROM index structu



More information about the Acedb mailing list

Send comments to us at biosci-help [At] net.bio.net