IRC ChIPSeq Data Repository

Welcome to the IRC Chromatin Tandem Affinity Purification Sequencing (ChTAP Seq) data repository. This repository contains data generated by Solexa sequencing of chromatin fragments isolated by tandem affinity purification using TAP-tagged transcription factors.

The data sets provided in this initial release are in headerless BED format. The BED files were generated from the eland_extended files output from the Illumina ELAND pipeline. The files contain all sequences mapping to 10 or fewer locations on the mouse genome (assembly version 37/mm9) with two or fewer mismatches; we will be adding files containing only sequences mapping perfectly to a single location in the near future.

The BED files can be used as input to various ChIP-seq peak identification algorithms, or can be uploaded to genome browsers such as the UCSC genome browser. In order to view the files using the UCSC browser, a header line must be added to the file (see the file definition here). When viewed through the UCSC browser, the mapped reads are colour-coded as follows:

  • Dark blue/light blue: Reads mapping perfectly to the +/- genomic strand.
  • Orange/yellow: Reads mapping with one mismatch to the +/- genomic strand.
  • Red/pink: Reads mapping with two mismatches to the +/- strand.

Search Repository