All Projects → ratschlab → pancanatlas_code_public

ratschlab / pancanatlas_code_public

Licence: MIT license
Public repository containing research code for the TCGA PanCanAtlas Splicing project

Programming Languages

python
139335 projects - #7 most used programming language
Jupyter Notebook
11667 projects
shell
77523 projects
r
7636 projects

Code for Comprehensive Analysis of Alternative Splicing Across Tumors from 8,705 Patients

Public repository containing research code for the TCGA PanCanAtlas Splicing project accompanying the manuscript Comprehensive Analysis of Alternative Splicing Across Tumors from 8,705 Patients

The code is organized in several sub-directories, which contain the following content:

  • alt_splice Code concerning the detection and analysis of alternative splicing events.

    • events
      Wrapper code for the detection of splicing events in the cohort using SplAdder.

    • event_stats
      Code for visualising the event type distributions and number of alternative splicing events detected per cancer type.

    • filtering
      Code implementing various filters on the AS events, based on read coverage, min delta PSI, etc.

    • complexity
      Detection of neo-junctions predominantly present in cancer samples but absent from annotation or the GTEx outgroup.

    • splice_outliers
      Code for detection and visualization of splicing outliers.

    • tumor_vs_normal_rank
      Identification and ranking of tumor specific introns.

    • tumor_vs_normal_stats
      Assessment of events detected in tumor samples vs normal samples.

    • tumor_vs_normal_test
      Differential testing of tumor vs normal samples on sets of subsamples per cancer type.

  • neoepitopes
    Contains code for the visualization of results on the prediction of neo-epitopes.

  • peptide_generation
    Code concerned with generating peptides from splice-graphs.

  • qc
    Code and additional resources related to quality control of input data.

  • sample_processing_rna
    Code for pre-processing and alignment of the RNA-Seq data, including expression counting, the collection of usage statistics, and construction of splicing graphs.

  • sqtl
    Visualization scripts for the sQTL analyses.

  • tSNE_and_heatmap_visualizations
    Code for tSNE embeddings on alternative splicing and expression data as well as scripts for visualization and different highlightings.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].