Data Specifications

The datasets provided in this portal conform to the following specifications to facilitate comparisons across tissues. The raw data for the tissues in this portal come from multiple studies. The processed data, however, may differ from the results reported in each study for a couple reasons:

  1. Different processing decisions could be made for each study, while the data in this portal are processed uniformly to facilitate comparison across tissues.
  2. The published study results are immutable, while new versions of data in this portal may be released, for example to improve quality or to use updated reference genome and annotations.

Nevertheless, we also host the original results from individual studies on the Download page when available.

Processing

The ratgtex-pipeline repository contains code for preprocessing, QC, read mapping, and some QTL mapping code. Pantry was used to quantify RNA phenotypes and map xQTLs. The original RatGTEx pipeline is on protocols.io (this does not yet reflect the latest version that uses Pantry). The ratgtex-server-data repository contains code that processes results into additional data files for download and for the API.

Genome assembly and gene annotations used for each data release
Methods used for v4

Formats

RNA phenotype tables (including gene expression)

expr.{log2,tpm}.{tissue}.v4.bed.gz
phenos.{tissue}.{modality}.{unnorm,norm}.v4.bed.gz

BED format, with four phenotype information columns plus one column per sample. The coordinates define the cis-window around the gene TSS for xQTL mapping.

All significant cis-xQTL phenotype-SNP pairs

cis_qtl_signif.{tissue}.modality.v4_rn8.txt.gz

A compressed tab-separated table.

Strong associations from trans-eQTL mapping

trans_qtl_pairs.{tissue}.v4.txt.gz

All measured SNPs genome-wide were tested against expression of each gene, and pairs with p-value < 1e-5 and > 5 Mb TSS distance are included here. Rows are sorted by variant location.

Conditionally independent cis-xQTLs

eqtls_indep.v4_rn8.tsv
xqtls_indep.{modality}.v4_rn8.tsv

A table of cis-QTLs from all tissues. Stepwise regression was used to test for cis-QTLs beyond and uncorrelated with the top cis-QTL per gene. eqtls_indep.v4_rn8.tsv and xqtls_indep.cross_modality.v4_rn8.tsv are the same results but with formatting differences as described below. For xqtls_indep.cross_modality.v4_rn8.tsv, all six modalities were combined and mapped as one set so that xQTLs are conditionally independent across modalities per gene per tissue.

Top association per gene

top_assoc.v4_rn8.tsv
top_assoc.{modality}.v4_rn8.tsv

A table of the strongest variant-gene association per gene per tissue, even if not significant. top_assoc.v4_rn8.tsv and top_assoc.expression.v4_rn8.tsv are the same results but with formatting differences as described below. For top_assoc.cross_modality.v4_rn8.txt, all six modalities were combined and mapped as one set so that one top association of any modality per gene per tissue is shown.