All Projects → ENCODE-DCC → chip-seq-pipeline

ENCODE-DCC / chip-seq-pipeline

Licence: MIT license
ENCODE Uniform processing pipeline for ChIP-seq

Programming Languages

python
139335 projects - #7 most used programming language
perl
6916 projects
Roff
2310 projects
shell
77523 projects
AngelScript
46 projects
Makefile
30231 projects

========== ENCODE ChIP-seq Pipeline

ENCODE Uniform processing pipeline for ChIP-seq

Current implementation is deployed to the DNAnexus platform.

Mapping

  1. Map reads with BWA, mark duplicates Picard, and remove duplicates.
  2. Estimate library complexity and calculate calculate NRF (non-redundant fraction), PBC1, PBC2 (PCR bottleneck coefficient).
  3. Calculate cross-correlation analysis with spp/phantompeakqualtools.
  4. Generate p-value and fold-over-control signal tracks for each replicate and replicates pooled with MACS2.

Peak calling (histone marks)

  1. Call peaks with MACS2.
  2. Calculate and report overlapping peaks from both replicates.

Peak calling (transcription factors)

  1. Call peaks with SPP.
  2. Threshold peaks with IDR.
  3. Report IDR-thresholded peak sets, self-consistency ratio, rescue ratio, reproducibility test.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].