This table provides GWAS associations results from the GWAS catalog, annotated by phecodes, ancestry, etc.
Format
A data.table with the following columns:
assoc_ID: Integer vector of the unique identifier for each associationSNP_hg19: Character vector variants encoded in hg19 genome reference buildSNP_hg38: Character vector variants encoded in hg38 genome reference buildrsID: Character indicating the Reference SNP cluster IDrisk_allele_dir: Character vector specifying the risk allele direction (alt: Alternate allele, ref: Reference allele)risk_allele: Character specifying the risk allelephecode: Character vector of the phecode for the associationphecode_string: Character vector of the phecode string labelcategory_string: Character vector of the phecode categoryancestry: Character vector of the ancestery of the source GWAS using 1000 Genomes superpopulations ancestry groupings (AFR: African, EAS: East Asian, EUR: European, AMR: AdMixed American, SAS: South Asian)cat_LOG10_P: Numeric vector of the -log10(P) of the association from the catalogcat_OR: Numeric value of the odds ratio of the associationcat_L95: Numeric value of the 95% lower confidence interval of the associationcat_U95: Numeric value of the 95% upper confidence interval of the associationStudy_accession: Character vector of the study accession ID from the GWAS catalogpubmedid: Numeric value of the PubMed ID for source publication of associationpub_count: Numeric value indicating how many times association has been published in GWAS catalogfirst_pub_date: Date of the first date the association was publishedAFR_RAF: Numeric value of the African ancestry risk allele frequency from gnomADEUR_RAF: Numeric value of the European (non-Finnish) ancestry risk allele frequency from gnomADEAS_RAF: Numeric value of the East Asian ancestry risk allele frequency from gnomADAMR_RAF: Numeric value of the Latino/Admixed American ancestry risk allele frequency from gnomADSAS_RAF: Numeric value of the South Asian ancestry risk allele frequency from gnomADALL_RAF: Numeric value of the risk allele frequency from gnomAD
Source
Allele frequencies: https://gnomad.broadinstitute.org/ Summary statistics: https://www.ebi.ac.uk/gwas/
Details
The odds ratio and 95% confidence intervals (columns cat_OR, cat_L95, and cat_U95) are reported relative to the risk allele which is specified in the column risk_allele.