What is PEPATAC?
PEPATAC is an ATAC-seq pipeline. It trims adapters, maps reads, calls peaks, and creates bigwig tracks, TSS enrichment files, and other outputs. It is optimized on unique features of ATAC-seq data to be fast and accurate and provides several unique analytical approaches.
What makes PEPATAC better?
PEPATAC has many nice features, such as scalability, restartability, copious logging, portability, standardized reference genome assembly, nice QC plots, and beautiful HTML reports. But what really sets it apart from others are these key advantages:
- Standard project organization:
PEPATACreads sample data formatted in standard PEP format. This means
PEPATACprojects are compatible with other PEP tools, such as
- Speed and memory use:
PEPATACis employs several speed optimizations, such as using GenomicDistributions. It requires substantially lower time and memory than other pipelines.
PEPATACpioneers a prealignment strategy to filter mitochondrial reads, leading to faster runtime and more accurate alignment statistics.
- Flexibility: Choose from multiple adapter trimmers, peak callers, and output options. Or, just use the defaults.
PEPATAC produces many outputs to set the stage for project-specific analysis:
- HTML report: View this HTML Summary report demo
- We have produced an interactive display of the output folder structure, which includes:
- Easily parsable summary statistics file
- Called peaks (in both
- BigWig signal tracks:
- nucleotide-resolution ("exact cut") signal
- smoothed signal
PEPATAC is a python script. Once you have all the prerequisites installed, you just run it on the command line (see usage for the complete arguments list):
pepatac.py --input reads.fq.gz [options...]