EDEG (JDEG) Analysis
DOLPHIN performs exon-level (junction-level) differential gene expression analysis using MAST results generated by the Seurat toolkit.
The MAST method [1] is used to evaluate exon-level statistical significance, and exon weights are integrated to compute gene-level scores.
- DOLPHIN.EDEG.call_convert.run_h5ad_rds(input_anndata, output_rds)[source]
Convert a .h5ad file to a Seurat RDS object by calling an R script.
- DOLPHIN.EDEG.generate_EDEG.run_edeg(seurat_output, adata_input, gtf_path, output)[source]
Aggregate exon-level marker results into gene-level statistics using weighted methods.
This function converts exon-level marker results from Seurat using MAST into gene-level statistical insights. It applies the Stouffer method to combine exon-level p-values, weighted by exon length, and computes average absolute log2 fold changes, also weighted by exon length.
- Parameters:
seurat_output (str) – Path to the CSV file containing exon-level differential expression results from Seurat.
adata_input (str) – Path to the .h5ad file used for generating exon-level data.
gtf_path (str) – Path to the GTF-derived annotation file generated by DOLPHIN.
output (str) – Path where the final gene-level results will be saved as a CSV file.
- Returns:
A DataFrame containing gene-level statistics including: - MAST_weighted_abs_avg_log2FC: Weighted average of absolute log2 fold changes per gene. - MAST_weighted_stouffer_pval: Combined p-value using the weighted Stouffer method. - MAST_weighted_stouffer_pval_adj_bonf: Bonferroni-adjusted Stouffer p-value.
- Return type:
pd.DataFrame
- DOLPHIN.EDEG.generate_JDEG.run_jdeg(seurat_output, output)[source]
Aggregate junction-level marker results into gene-level statistics using stouffer method.
This function converts junction-level marker results from Seurat using MAST into gene-level statistical insights. It applies the Stouffer method to combine junction-level p-values, and computes average absolute log2 fold changes.
- Parameters:
- Returns:
A DataFrame containing gene-level statistics including: - MAST_abs_avg_log2FC: Average of absolute log2 fold changes per gene. - MAST_stouffer_pval: Combined p-value using the Stouffer method. - MAST_stouffer_pval_adj_bonf: Bonferroni-adjusted Stouffer p-value.
- Return type:
pd.DataFrame