Index of /download/homology/orthologs/C_albicans_SC5314_A_nidulans_by_inparanoid

Icon  Name                                                          Last modified      Size  Description
[PARENTDIR] Parent Directory - [TXT] C_albicans_SC5314_A_nidulans_orthologs.txt 2019-08-22 13:44 135K [TXT] inparanoid_output.08-22-2019.txt 2023-06-29 09:53 1.2M [   ] orf_trans_all_Aspergillus_nidulans.08-22-2019.fasta.gz 2023-06-29 09:53 3.9M [   ] orf_trans_all_Candida_Assembly_21_haploid.09-09-2013.fasta.gz 2023-06-29 09:53 2.3M [   ] orf_trans_all_Candida_Assembly_22_haploid.08-22-2019.fasta.gz 2023-06-29 09:53 3.0M [   ] pompep.08-22-2019.fasta.gz 2023-06-29 09:53 1.5M [   ] rejected_sequences.pompep.08-22-2019.fasta.gz 2023-06-29 09:53 25K
This directory contains the orthology assignments between C. albicans SC5314 and A. nidulans

The assignments were made using InParanoid version 3.0 (http://inparanoid.sbc.su.se/).

To run InParanoid, the current set of C. albicans SC5314 protein sequences from CGD  
were compared to the latest set of A. nidulans proteins from the Aspergillus Genome Database (AspGD); 
the set of S. pombe proteins from the Sanger Institute was used as an outgroup.
Stringent cutoffs were set: BLOSUM80 (instead of the default BLOSUM62),
and an InParanoid score of 100%. 

Please note, the ortholog pairings were automatically generated, with no
curator intervention. Thus, there will occasionally be pairings that
may not occur with a different scoring matrix. In the interests of
automating the process, we do not intend to hand-curate the ortholog
pairs at this time.

Files are provided containing the input sequences that were used by InParanoid, 
and the raw output file that was generated by InParanoid. In addition,
a file containing the processed output, listing orthology assigments
is also provided.


The following files are available:

C_albicans_SC5314_A_nidulans_orthologs.txt                     - the processed output.  
							       The format of the pairwise ortholog and Best Hit files (updated
							       September 2012) contains three tab-delimited columns for each
							       organism.
							       For each organism, the columns display:
							       - the systematic name of the gene
							       - the standard/genetic name of the gene (if one exists)
							       - the database identifier for the gene.

orf_trans_all_Candida_Assembly_21_haploid.MM-DD-YYYY.fasta.gz  - the C. albicans SC5314 protein file used as input
orf_trans_all_Aspergillus_nidulans.MM-DD-YYYY.fasta.gz         - the A. nidulans protein file used as input
pompep.MM-DD-YYYY.fasta.gz                                     - the S. pombe protein set used as an outgroup

inparanoid_output.MM-DD-YYYY.txt                               - the raw output from InParanoid
rejected_sequences.pompep.MM-DD-YYYY.fasta.gz                  - the sequences rejected due to the S. pombe outgroup



The dates (indicated by MM-DD-YYYY) in the above file names represent the date when 
the input files were downloaded and latest set of ortholog predictions generated.