![]() |
Gorilla gorilla gorilla Public Domain Photograph |
The Oct. 2008 Gorilla gorilla gorilla draft assembly (Sanger Institute genome_31265) was produced by the Sanger Institute sequencing center.
A genome position can be specified by the accession number of a sequenced genomic region, an mRNA or EST, a chromosomal coordinate range, or keywords from the GenBank description of an mRNA. The following list shows examples of valid position queries for the Gorilla genome. See the User's Guide for more information.
Request: | Genome Browser Response: | |
---|---|---|
chrZ | Displays all of chromosome Z | |
chr3:1-1000000 | Displays first million bases of chr 3 | |
chr3:1000000+2000 | Displays a region of chr 3 that spans 2000 bases, starting with position 1000000 | |
| ||
BX950328 | Displays region of mRNA with GenBank accession number BX950328 | |
BU335953 | Displays region of EST with GenBank accession BU335953 on chr Z | |
| ||
pseudogene mRNA | Lists transcribed pseudogenes, but not cDNAs | |
homeobox caudal | Lists mRNAs for caudal homeobox genes | |
zinc finger | Lists many zinc finger mRNAs | |
kruppel zinc finger | Lists only kruppel-like zinc fingers | |
zhang | Lists mRNAs deposited by scientist named Zhang | |
Taguchi,T. | Lists mRNAs deposited by co-author T. Taguchi | |
Use this last format for author queries. Although GenBank requires the search format Taguchi T, internally it uses the format Taguchi,T.. |
This is NCBI Genome Project 31265 with accession numbers specified by LOCUS accession record CABD01000000
This draft assembly is a 2x ABI based Phusion assembly which has been improved with 20x Solexa short read sequence. The short reads were aligned to the ABI assembly using Maq and where evidence exists from the Solexa reads that the ABI base call is wrong it has been corrected. Regions of the ABI assembly that had no Solexa coverage were assumed to be non-Gorilla sequence and removed. Large gaps in the Supercontigs are inferred by Phusion from the distance between placed fosmid end reads.
In this initial assembly, there are 217,735 identified gaps, comprising a total of 265,263,406 bases. There is an additional 16,910,764 bases marked with Ns outside of identified gaps. The total base count of AGCT nucleotides is 2,043,974,660 bases, plus the identified gaps and additional Ns add up to 2,326,148,830 bases of sequence in 609,861 Supercontigs. The Supercontig N50 size is 11,283 bases, with 45,589 Supercontigs larger than that comprising one half of the sequence (1,163,074,415 bases)
The Gorilla mitochondrial sequence is also available as the virtual chromosome "chrM". (NCBI accession NC_011120)Bulk downloads of the sequence and annotation data are available via the Genome Browser FTP server or the Downloads page. These data have specific conditions for use. The Gorilla browser annotation tracks were generated by UCSC and collaborators worldwide. See the Credits page for a detailed list of the organizations and individuals who contributed to the success of this release.