MAD Quality Metric

Replicate log-ratio standard deviation

The Median Absolute Deviation (MAD) quality metric is more accurately described as the replicate log-ratio standard deviation. This metric evaluates reproducibility between two biological or technical replicates within RNA-seq/RAMPAGE/Entex experiments in terms of similarity of the expression levels across all pairs of genes with the present cut-off. We measure the standard deviation of the log-ratios between replicates using a robust approach, and the values are calculated per pair of replicates. 

The MAD score between replicates X1 and X2 of the sample X is calculated as:

MAD = 1.4826 * median | log2(X1_i/X2_i) | 

where
X1 : [X1_1,X1_2,..,X1_N] expression values in the replicate #1 
X2 : [X2_1,X2_2,..,X2_N] expression values in the replicate #2 

The MAD score is calculated on normalized FPKM values for genes G:1..N that have X1_i, X2_i more than preset cut-off >1 FPKM

Figure 1. Distribution of the replicate log-ratio standard deviation by the sample type. The dashed red line shows mean values per each sample type; the number of the samples is indicated in the parenthesis of the legend. 

References

A benchmark for RNA-seq quantification pipelines (in press)