This table provides GWAS associations results from the GWAS catalog, annotated by phecodes, ancestry, etc.
Format
A data.table with the following columns:
assoc_ID
: Integer vector of the unique identifier for each associationSNP_hg19
: Character vector variants encoded in hg19 genome reference buildSNP_hg38
: Character vector variants encoded in hg38 genome reference buildrsID
: Character indicating the Reference SNP cluster IDrisk_allele_dir
: Character vector specifying the risk allele direction (alt: Alternate allele, ref: Reference allele)risk_allele
: Character specifying the risk allelephecode
: Character vector of the phecode for the associationphecode_string
: Character vector of the phecode string labelcategory_string
: Character vector of the phecode categoryancestry
: Character vector of the ancestery of the source GWAS using 1000 Genomes superpopulations ancestry groupings (AFR: African, EAS: East Asian, EUR: European, AMR: AdMixed American, SAS: South Asian)cat_LOG10_P
: Numeric vector of the -log10(P) of the association from the catalogcat_OR
: Numeric value of the odds ratio of the associationcat_L95
: Numeric value of the 95% lower confidence interval of the associationcat_U95
: Numeric value of the 95% upper confidence interval of the associationStudy_accession
: Character vector of the study accession ID from the GWAS catalogpubmedid
: Numeric value of the PubMed ID for source publication of associationpub_count
: Numeric value indicating how many times association has been published in GWAS catalogfirst_pub_date
: Date of the first date the association was publishedAFR_RAF
: Numeric value of the African ancestry risk allele frequency from gnomADEUR_RAF
: Numeric value of the European (non-Finnish) ancestry risk allele frequency from gnomADEAS_RAF
: Numeric value of the East Asian ancestry risk allele frequency from gnomADAMR_RAF
: Numeric value of the Latino/Admixed American ancestry risk allele frequency from gnomADSAS_RAF
: Numeric value of the South Asian ancestry risk allele frequency from gnomADALL_RAF
: Numeric value of the risk allele frequency from gnomAD
Source
Allele frequencies: https://gnomad.broadinstitute.org/ Summary statistics: https://www.ebi.ac.uk/gwas/
Details
The odds ratio and 95% confidence intervals (columns cat_OR, cat_L95, and cat_U95) are reported relative to the risk allele which is specified in the column risk_allele.