gene_association.cgd.gz Contains all GO annotations for C. albicans genes (protein and RNA) The gene_association.cgd.gz file uses the standard file format for gene_association files of the Gene Ontology (GO) Consortium. A more complete description of the file format is found here: http://www.geneontology.org/doc/GO.annotation.html#file Columns are: Contents: 1) DB - database contributing the file (always "CGD" for this file) 2) DB_Object_ID - CGDID 3) DB_Object_Symbol - see below 4) NOT (optional) - 'NOT' qualifier for a GO annotation, when needed 5) GO ID - unique numeric identifier for the GO term 6) DB:Reference(|DB:Reference) - the reference associated with the GO annotation 7) Evidence - the evidence code for the GO annotation 8) With (or) From (optional) - any With or From qualifier for the GO annotation 9) Aspect - which ontology the GO term belongs in 10) DB_Object_Name(|Name) (optional) - a name for the gene product in words, e.g. 'acid phosphatase' 11) DB_Object_Synonym(|Synonym) (optional) - see below 12) DB_Object_Type - type of object annotated, e.g. gene, protein, etc. 13) taxon(|taxon) - taxonomic identifier of species encoding gene product 14) Date - date GO annotation was made 15) Assigned_by - source of the annotation (always "CGD" for this file) Note on CGD nomenclature (pertaining to columns 3 and 11): Column 3 - When a Standard Gene Name (e.g. CDC28, COX2) has been conferred, it will be present in Column 3. When no Gene Name has been conferred, the ORF Name (e.g., orf19.6632) will be present in column 3. Column 11 - The ORF Name (e.g., orf19.6632) will be the first name present in Column 11. Any other names (except the Standard Name, which will be in Column 3 if one exists), including Aliases used for the gene will also be present in this column. Note: This file contains ALL of the GO curation at CGD, whereas the gene_association file that is available on the GO consortium (GOC) web site, http://www.geneontology.org/, has been filtered according to GOC guidelines, which are discussed in more detail at http://www.geneontology.org/GO.annotation.shtml. Note: The files are gzip compressed tab-delimited text files. There are several freely available software options for decompressing gzipped files using Windows. The software and other useful information is available on these web sites: - WinZip (http://www.winzip.com/) - Stuffit (http://www.stuffit.com/) - Gzip (http://www.gzip.org/ and the gzip user's manual: http://www.math.utah.edu/docs/info/gzip_toc.html