In genomic analyses, especially when working with non-model species, we often need to infer gene function based on similarity to other species. This means that lots of people are running the same, or similar, analyses on the same genomes and these can take a really long time to run. Everyone running the same analyses seems a bit silly so I’ve shared some of these results here. If others want to add to the list I’d be happy to include them.
Here I have uploaded the gene ontology analyses of the whole European honeybee (Apis mellifera 4.5) and buff-tailed bumblebee (Bombus terrestris 1.0) transcriptomes which was produced using Annocript. To run the analysis I shortened the names of the sequences to just the NCBI accession number. There is a control file which is the same between the two analyses although point to different fasta files. I have sanitized the control file to remove personal information and if people would like to analyze their data in a similar way they would need to point to the paths for the relevant programs.
The output file for each is a really big table with the NCBI accession number the inferred gene ontology and the sequence. For more info on the output file take a look here.
A quick note about usage. I’m happy for people to use these but please give credit and acknowledge the source of these files.
Downloads again:
Here are two more bumblebee ontology files shared by Michael Lattorff which came from the now defunct B2G-FAR repository.
Bombus impatiens annot file
Bombus terrestris annot file
These analyses were run at East Carolina University by Seth Barribeau with Dr. Michael Brewer