Author(s): Joe Solvason
Contact: Joe Solvason (solvason@eng.ucsd.edu)
Adapted as a GenePattern Module by: Ted Liefeld (jliefeld@cloud.ucsd.edu)
Task Type: Transciption factor analysis
LSID: urn:lsid:genepattern.org:module.analysis:00441
DefineTFBindingSites.from.PBM normalizes the median fluorescence intensity (MFI) values obtained from protein-binding microarray (PBM) data for a transcription factor of interest. The k-mer with the maximum MFI that conforms to the binding site definition is normalized to 1.0 and all other k-mers are normalized relative to that MFI value. For example, a normalized value of 0.1 is 10% of the maximum MFI.
The raw PBM dataset for a transcription factor is downloaded from uniPROBE and the user indicates the columns of the forward k-mer and the MFI. The user also defines the minimal binding site using IUPAC nomenclature (ie, N = ATGC, W = AT, etc). The tool searches for the k-mer with the largest MFI signal that conforms to the IUPAC binding site definition. For all other k-mers, the MFI signal will be normalized relative to the MFI signal of the maximum k-mer and the resulting value is called the relative affinity. Therefore, the k-mer with the maximum MFI signal will have a relative affinity of 1.0. The normalization calculation for each k-mer is: relative affinity = (MFI signal) / (MFI signal of the maximum IUPAC k-mer).
* indicates required parameter
True, a header exists in the raw PBM file. If False, no header exists.default = FalseTrue, only report k-mers abiding by the binding site definition. If False, report all k-mers.default = FalseTrue, enforce the relative affinities to range between 0 and 1. If False, the original range of the relative affinities will remain the same.default = None8-mer: every possible forward k-mer sequence with length k8-mer: the reverse complement of the forward k-merE-score: the enrichment score of the k-merMedian: the median fluorescence intensity of the k-merZ-score: the z-score of the k-mer8-mer 8-mer E-score Median Z-score
AAAAAAAA TTTTTTTT 0.29130 2871.60 3.5965
AAAAAAAC TTTTTTTG 0.10748 2086.00 0.3958
AAAAAAAG TTTTTTTC 0.23656 2539.91 2.3673
AAAAAAAT TTTTTTTA 0.21760 2434.82 1.9442
AAAAAACA TTTTTTGT 0.19839 2407.46 1.8310
PBM Kmer: the sequence of every possible k-merPBM Relative Affinity: the relative affinity of each k-mer normalized to the k-mer with the highest MFIPBM Kmer PBM Relative Affinity
AAAAAAAA 0.15
AAAAAAAC 0.11
AAAAAAAG 0.13
AAAAAAAT 0.13
AAAAAACA 0.12

Example input data is available on github