Author(s): Joe Solvason
Contact: Joe Solvason (solvason@eng.ucsd.edu)
Adapted as a GenePattern Module by: Ted Liefeld (jliefeld@cloud.ucsd.edu)
Task Type: Transciption factor analysis
LSID: urn:lsid:genepattern.org:module.analysis:00441
DefineTFBindingSites.from.PBM
normalizes the median fluorescence intensity (MFI) values obtained from protein-binding microarray (PBM) data for a transcription factor of interest. The k-mer with the maximum MFI that conforms to the binding site definition is normalized to 1.0 and all other k-mers are normalized relative to that MFI value. For example, a normalized value of 0.1 is 10% of the maximum MFI.
The raw PBM dataset for a transcription factor is downloaded from uniPROBE and the user indicates the columns of the forward k-mer and the MFI. The user also defines the minimal binding site using IUPAC nomenclature (ie, N = ATGC, W = AT, etc). The tool searches for the k-mer with the largest MFI signal that conforms to the IUPAC binding site definition. For all other k-mers, the MFI signal will be normalized relative to the MFI signal of the maximum k-mer and the resulting value is called the relative affinity. Therefore, the k-mer with the maximum MFI signal will have a relative affinity of 1.0. The normalization calculation for each k-mer is: relative affinity = (MFI signal) / (MFI signal of the maximum IUPAC k-mer).
* indicates required parameter
True
, a header exists in the raw PBM file. If False
, no header exists.default = False
True
, only report k-mers abiding by the binding site definition. If False
, report all k-mers.default = False
True
, enforce the relative affinities to range between 0 and 1. If False
, the original range of the relative affinities will remain the same.default = None
8-mer:
every possible forward k-mer sequence with length k8-mer:
the reverse complement of the forward k-merE-score:
the enrichment score of the k-merMedian:
the median fluorescence intensity of the k-merZ-score:
the z-score of the k-mer8-mer 8-mer E-score Median Z-score
AAAAAAAA TTTTTTTT 0.29130 2871.60 3.5965
AAAAAAAC TTTTTTTG 0.10748 2086.00 0.3958
AAAAAAAG TTTTTTTC 0.23656 2539.91 2.3673
AAAAAAAT TTTTTTTA 0.21760 2434.82 1.9442
AAAAAACA TTTTTTGT 0.19839 2407.46 1.8310
PBM Kmer:
the sequence of every possible k-merPBM Relative Affinity:
the relative affinity of each k-mer normalized to the k-mer with the highest MFIPBM Kmer PBM Relative Affinity
AAAAAAAA 0.15
AAAAAAAC 0.11
AAAAAAAG 0.13
AAAAAAAT 0.13
AAAAAACA 0.12
Example input data is available on github