Genes for covid-19 resilience: identification of DNA markers corresponding to coronavirus resistance and susceptibility
 
Coronaviruses (CoVs) (order Nidovirales, family Coronaviridae,
                                    subfamily
                                    Coronavirinae) are responsible for respiratory disease outbreaks in many
                                    vertebrate species.
                                    They are a large family of single-stranded RNA enveloped viruses (+ssRNA) that can
                                    be isolated in
                                    different animal species. They have genome sizes ranging between 26 to 32 kilobases
                                    (kb) in length,
                                    being the largest genomes for RNA viruses (consequently increasing the effectiveness
                                    of facemasks).
                                    COVID-19 also known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2),
                                    or "novel
                                    coronavirus 2019" is a new virus
                                    and we are just beginning to understand resistance and susceptibility in humans. 
COVID-19 is similar to severe acute respiratory syndrome (SARS) in the respect
                                    that both viruses
                                    infect their human hosts via the same receptor, the angiotensin-converting enzyme 2
                                    (ACE2 receptor),
                                    and cause similar clinical and pathological features.
                                    Interestingly the spike protein which is responsible for receptor binding is highly
                                    similar between
                                    2019-nCoV and SARS-CoV, this is a result of significant selection for the same
                                    receptor (Wu.,
                                        2020). Research into how
                                    our bodies defend against SARS might reveal how our bodies could defend against
                                    covid-19.
                                
Several recent Genome Wide Association Studies (GWAS) have provided a much deeper
                                    insight into
                                    the genetic variations that can help to explain why some individuals are virtually
                                    unaffected by
                                    COVID-19, and for others the virus is life threatening or even fatal. 
In this post, we provide a review of the peer reviewed literature and present information about candidate genes for SARS-CoV resistance. If you have taken an at home DNA test like those available from 23andMe, Ancestry DNA, Dante labs, you can evaluate your raw DNA data and see how your DNA sequence compares with the research findings.
¿Cómo analizar tu ADN para resistencia o susceptibilidad al coronavirus?
                                
Step 1) Download your raw autosomal DNA file and save it to a safe and secure location
To analyze your DNA data, start by downloading your raw autosomal DNA and save it to a safe location. Here are instructions to download your raw DNA file from: 23andMe, Ancestry DNA, Family Tree DNA, Dante Labs, My Heritage, Genes For Good, Vitagene, and Living DNA.
Paso 2) Analiza tu archivo de ADN crudo
Search your raw DNA data using a text editor such as "text wrangler" or "notepad" using the "find" function, or using command line.
Open your raw DNA file and you will notice the headers of unique SNP ID (rs# or i#), chromosome, position and genotype. The formats differ slightly between each direct to consumer DNA testing company.
To evaluate your risk of poor recovery from the COVID-19 have a look for these DNA markers described below:
Several Genome Wide Association Study (GWAS) were recently published that describe loci associated with respiratory failure in patients infected with SARS-CoV-2 and three studies identified SNP markers in the same ~50 kb genomic segment that is inherited from Neandertals (Ellinghaus D et al., 2020, Zeberg & Pääbo, and Blokland et al., 2020). In addition these GWAS studies also identified a number of other DNA markers that are associated with COVID-19, and each of these are presented in the tabel below.
In addition, other DNA markers covered in this post include rs4804803 which was associated with SARS, and those positioned in the angiotensin-converting enzyme-2 (ACE2) receptor which was proved to be the same receptor for the human respiratory coronavirus NL63, , SARS-coronavirus (SARS-CoV), and the novel coronavirus 2019-nCoV/SARS-CoV (Li et al., 2017; Lu et al., 2019). Since the spike protein of the coronavirus has evolved to match the ACE2 receptor, it's likely that individuals with variations that alter the protein sequence would result in a degree of resistance to covid-19. Below are non-synonymous SNPs from the ACE2 transcript NM_021804.2 and of particular interest are SNPs that cause major changes like rs199951323 which results in a premature stop codon.
| Gene" Gene | dbsnp | Cromosoma (GRCh37) | POS | REF | ALT | Alelos de Riesgo | Efecto de marcador | Referencia | 
|---|---|---|---|---|---|---|---|---|
| IVNS1ABP | rs6668622 | 1 | 185414582 | T | C | Variante susceptible T:T y T:C en hombress | Relación de probabilidades 1.44 | Roberts., 2020; | 
| SRRM1 | rs111972040 | 1 | 24999361 | A | G | Los genotipos de riesgo G:G y A:G, variante de 3_prime_UTR. | Razón de probabilidades para hospitalización = 8.29 | |
| LZTFL1 | rs35044562 | 3 | 45909024 | A | G | Genotipos de riesgo G:G y A:G, variante intronal, variante transcripcional upstream génica. | odds ratio 1.60 | Blokland et al., 2020; Zeberg & Pääbo | 
| LZTFL1 | rs11385942 | 3 | 45876460 | A | - or A or AAA | InDel, A:A and A:- Tienen mayor susceptibilidad a la insuficiencia respiratoria, variante intronal. | odds ratio 1.77 | Ellinghaus et al., 2020; Roberts., 2020 | 
| LZTFL1 | rs10490770 *LD with rs11385942 | 3 | 45864732 | T | C | Los genotipos de riesgo T:C y C:C, variante intronal. | Razón de probabilidades para portadores heterocigotos 1.7 | Zeberg and Pääbo., 2020; | 
| LZTFL1 | rs67959919 *LD with rs11385942 | 3 | 45871908 | G | A | Los genotipos de riesgo G:A y A:A, variante intronal. | Razón de probabilidades para portadores heterocigotos 1.7 | |
| LZTFL1 | rs35624553 *LD with rs11385942 | 3 | 45867440 | A | G | Los genotipos de riesgo G:A y G:G, variante intronal. | Razón de probabilidades para portadores heterocigotos 1.7 | |
| LZTFL1 | rs71325088 *LD with rs11385942 | 3 | 45862952 | T | C | Los genotipos de riesgo C:T y C:C, variante intronal. | Razón de probabilidades para portadores heterocigotos 1.7 | |
| ABO | rs657152 | 9 | 136139265 | A | C or T | Una alelo de riesgo, variante intronal | odds ratio 1.77 | Ellinghaus et al., 2020; Roberts., 2020 | 
| Intergenic | rs5798227 | 12 | 53120100 | C | - | El alelo de riesgo es la eliminación | p = 2.2x10-7 | Blokland et al., 2020; | 
| IGHV3-7 | rs11844522 | 14 | 106522576 | C | T | Variaciones susceptibles T:T, C:T | p=1.9x10-7 | |
| Immunoglobulin Lambda Locus (IGL) | rs73166864 | 22 | 23340580 | T | C or G | Variaciones susceptibles T:T y T:C | odds ratio 1.7 | Roberts., 2020; | 
| TLR7 | rs200553089 | ChrX | 12906010 | G | T | Los genotipos de riesgo T:G y T:T, variante de sentido literal. | Made et al., 2020; | |
| SNPs sinónimos ubicados en ACE2 | ||||||||
| ACE2 | rs373153165 | chrX | 15580093 | C | T or A | Variante missense | p.Asp785Asn/c.2353G>A | Cao et al., 2020 | 
| ACE2 | rs140016715 | chrX | 15582154 | G | A | Variante missense | p.Arg768Trp/c.2302C>T | |
| ACE2 | rs147311723 | chrX | 15582265 | G | A | Variante missense | p.Leu731Phe/c.2191C>T | |
| ACE2 | rs41303171 | chrX | 15582298 | T | C | Variante missense | p.Asn720Asp/c.2158A>G | |
| ACE2 | rs370187012 | chrX | 15582327 | C | T | Variante missense | p.Arg710His/c.2129G>A | |
| ACE2 | rs776995986 | chrX | 15582334 | G | A | Variante missense | p.Arg708Trp/c.2122C>T | |
| ACE2 | rs149039346 | chrX | 15584416 | A | G | Variante missense | p.Ser692Pro/c.2074T>C | |
| ACE2 | rs200180615 | chrX | 15584488 | C | T | Variante missense | p.Glu668Lys/c.2002G>A | |
| ACE2 | * | chrX | 15585879 | A | C | stop_gained | p.Leu656*/c.1967T>G | |
| ACE2 | rs183135788 | chrX | 15585933 | T | C | Variante missense | p.Asn638Ser/c.1913A>G | |
| ACE2 | rs748163894 | chrX | 15588434 | G | A | Variante missense | ||
| ACE2 | rs202137736 | chrX | 15591485 | T | C | Variante de región de splice + variante de intron. | c.1541+5A>G | |
| ACE2 | rs140473595 | chrX | 15591530 | C | T | Variante missense | p.Ala501Thr/c.1501G>A | |
| ACE2 | rs191860450 | chrX | 15593829 | T | C | Variante missense | p.Ile468Val/c.1402A>G | |
| ACE2 | rs758142853 | chrX | 15609868 | A | G | Variante missense | p.Val184Ala/c.551T>C | |
| ACE2 | rs754511501 | chrX | 15609902 | C | T | Variante missense | p.Gly173Ser/c.517G>A | |
| ACE2 | rs746034076 | chrX | 15609943 | T | C | Variante missense | p.Asn159Ser/c.476A>G | |
| ACE2 | rs373252182 | chrX | 15609973 | T | C | Variante missense | p.Asn149Ser/c.446A>G | |
| ACE2 | rs2285666 | chrX | 15610348 | C | T | Variante de región de splice + variante de intron. | c.439+4G>A | |
| ACE2 | rs768736934 | chrX | 15612963 | C | T | Variante de región de splice + variante de intron. | c.345+5G>A | |
| ACE2 | rs4646116 | chrX | 15618958 | T | C | Variante missense | p.Lys26Arg/c.77A>G | |
| ACE2 | rs73635825 | chrX | 15618980 | A | G | Variante missense | p.Ser19Pro/c.55T>C | |
| SNPs asociados con SARS | ||||||||
| CD209 | rs4804803 | 19 | 7812733 | A | G | Genotipo susceptible A:A, variante de transcripto upstream. | NC_000019.10:7747846 | Sakuntabhai et al., 2005; Chan et al., 2010 | 
