The retroGene track shows processed mRNAs that have been inserted back into the genome since the mouse/human split. RetroGenes can be either functional genes that have acquired a promoter from a neighboring gene, non-functional pseudogenes, or transcribed pseudogenes.
All mRNAs of a species from GenBank were aligned to the genome using blastz. mRNAs that aligned twice in the genome (once with introns and once without introns) were initially screened. Next, a series of features were scored to determine candidates for retrotranspostion events. These features include position and length of the polyA tail, degree of synteny with mouse, coverage of repetitive elements, number of exons that can still be aligned to the retroGene and degree of divergence from the parent gene. Retrogenes are classified using a threshold score function that is a linear combination of this set of features. RetroGenes in the final set have a score threshold greater than 425 based on a ROC plot against the Vega annotated pseudogenes.
The "type" field has four possible values:
The RetroFinder program and browser track were developed by Robert Baertsch at UCSC.
Kent, W.J., Baertsch, R., Hinrichs, A., Miller, W., and Haussler, D. Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes. Proc Natl Acad Sci USA 100(20), 11484-11489 (2003).
Schwartz, S., Kent, W.J., Smit, A., Zhang, Z., Baertsch, R., Hardison, R., Haussler, D., and Miller, W. Human-Mouse Alignments with BLASTZ. Genome Res. 13(1), 103-7 (2003).