Description

This track displays transcriptome data from tiling GeneChips produced by Affymetrix. For the complete non-repetitive portion of the human genome, more than 256 million probe pairs were tiled every 5 bp in non-repeat-masked areas and hybridized to cytosolic polyA+ long RNA (>200 nucleotides) from 8 different cell lines. Note that the female cell lines HeLa, SK-N-AS, and U87MG do not contain data for chrY. For HeLa and HepG2, samples were also produced for nuclear polyA+ RNA in addition to cytosolic polyA+ RNA. For experimental details and results, see Kapranov et al. in the References section below.

This track contains the signal level estimated for each mapped probe position after hybridization with long RNA samples. A separate track -- "Affy Tx lRNA Reg " -- contains transcribed fragments (transfrags) representing long RNAs (see Kapranov et al. in the References section below). While the raw data are based on perfect match minus mismatch (PM - MM) probe values and may contain negative values, the track has a minimum value of zero for visualization purposes. Likewise, the probes with high signal values are cut off at the intensity of 150 by default and will appear to have similar magnitude.

Display Conventions and Configuration

This annotation follows the display conventions for composite tracks. The subtracks within this annotation may be configured in a variety of ways to highlight different aspects of the displayed data. The graphical configuration options are shown at the top of the track description page, followed by a list of subtracks. For more information about the graphical configuration options, click the Graph configuration help link. To display only selected subtracks, uncheck the boxes next to the tracks you wish to hide.

Methods

For each data point, probes within 30 bp on either side were used to improve the estimate of expression level for a particular probe. This helped to smooth the data and produce a more robust estimate of the transcription level at a particular genomic location. The following analysis steps were performed:

  1. Replicate arrays were quantile-normalized and the median intensity (using both PM and MM intensities) of each array was scaled to a target value of 50.
  2. The expression level was estimated for each mapped probe position by collecting all the probe pairs that fell within a window of ± 30 bp, calculating all non-redundant pairwise averages of PM - MM values of all probe pairs in the window, and taking the median of all resulting pairwise averages.
  3. The resulting signal value is the Hodges-Lehmann estimator associated with the Wilcoxon signed-rank statistic of the PM - MM values that lie within ± 30 bp of the sliding window centered at every genomic coordinate.
  4. Transfrags in the track labeled "Affy Tx lRNA Reg" were determined by connecting adjacent positive probes with a maximum allowable gap of 11 nucleotides and a minimum run of 49 nucleotides (at least 11 probes with at most one negative probe between two positive probes). Transfrags were discarded if they overlapped pseudogenes or harbored low-complexity or repetitive sequences.

Credits

Data generation and analysis were performed by the transcriptome group at Affymetrix with assistance from colleagues at the University of Leipzig, Fraunhofer Institute, and University of Vienna: P. Kapranov, J. Cheng, S. Dike, D.A. Nix, R. Duttagupta, A.T. Willingham, P.F. Stadler, J. Hertel, J. Hackermüller, I.L. Hofacker, I. Bell, E. Cheung, J. Drenkow, E. Dumais, S. Patel, G. Helt, M. Ganesh, S. Ghosh, A. Piccolboni, V. Sementchenko, H. Tammana, T.R. Gingeras.

Questions or comments about this annotation? Email genome@soe.ucsc.edu.

References

Kapranov P, Cheng J, Dike S, Nix DA, Duttagupta R, Willingham AT, Stadler PF, Hertel J, Hackermüller J, Hofacker IL, et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007 Jun 8;316(5830):1484-8.