The spatialGE.STenrich
module is part of the spatialGE package. It detects genes showing spatial expression patterns (e.g., hotspots) and tests if spots/cells with a high average expression of a gene set show evidence of spatial aggregation.
The package spatialGE can be used to detect spatial patterns in gene expression at the gene and gene set level. Data from multiple spatial transcriptomics platforms can be analyzed, as long as gene expression counts per spot or cell are associated with spatial coordinates of those spots/cells.
This module accepts an RDS file containing a normalized STlist object, such as output by the spatialGE.Preprocessing module.
Each row of the output CSV files represents a test for the null hypothesis of no spatial aggregation in the expression of the set in the “gene_set” column. The column “size_test” is the number of genes of a gene set that were present in the FOV. The larger this number the better, as it indicates a better representation of the gene set in the sample. The “adj_p_value” is the multiple test adjusted p-value, which is the value used to decide if a gene set shows significant indications of a spatial pattern (adj_p_value < 0.05).
Ospina, O. E., Wilson C. M., Soupir, A. C., Berglund, A. Smalley, I., Tsai, K. Y., Fridley, B. L. 2022. spatialGE: quantification and visualization of the tumor microenvironment heterogeneity using spatial transcriptomics. Bioinformatics, 38:2645-2647. https://doi.org/10.1093/bioinformatics/btac145
Name | Description | Default Value |
---|---|---|
input file * | Normalized spatial transcriptomics data coming from the spatialGE.Preprocessing module. | |
gene sets database * | Select a gene set database to test for spatial enrichment. Upload a gene set if your gene set is not listed as a choice from MSigDB. | |
permutations * | The number of permutations to estimate the null distribution (no-spatial pattern). The more permutations, the longer STenrich takes to complete, but p-values may be more accurate. | 100 |
random seed * | A seed number to replicate results. It is advisable to run STenrich with different seed values to check for consistency. Different seed values could yield slightly different p-values. | 12345 |
minumum spots * | The minimum number of high expression ROIs/spots/cells required for a gene set to be tested. If a sample has less than this number of high expression ROIs/spots/cells, the gene set is not tested in that sample. | 5 |
minimum genes * | The minimum number of genes of a set required to be present in a sample, for that gene set to be tested in that sample. If a sample has less genes of a set than this number, the gene set is ignored in that sample. | 5 |
standard deviations * | The number of standard deviations to define the high expression threshold. If an ROI/spot/cell has average gene set expression larger than the entire sample average plus this many standard deviations, it will be considered a high-expression ROI/spot/cell. | 1.0 |
filter p values * | Plot only gene sets whose multiple test adjusted p-value is less than this threshold. | 0.05 |
filter gene proportion * | Plot only gene sets where the proportional number genes in the set present in the field of view equals or exceeds this threshold. | 0.3 |
* required
Input:
lung.rds
Requires the genepattern/spatialge-stenrich:0.4 Docker image.
spatialGE.STenrich
is distributed under a BSD-style license available at https://github.com/genepattern/spatialGE.STenrich/blob/main/LICENSE.
Version | Release Date | Description |
---|---|---|
1 | June 28, 2024 | Initial version |