7.3. Data#

7.3.2. Exercises#

I have queried the Ensembl database for some haemoglobins from Human and Macaque. For each database record, I’ve used Ensembl’s definition of the “canonical transcript” (or “major” transcript). I’ve then obtained the CDS (coding sequence) for each record. I named the sequences {Species Common Name}-{a letter}. I then aligned the sequences.

0
Human-jATGGTGCATCTGACTCCTGAGGAGAAGTCT
Human-z......---...T......CC..C...A.C
Rhesus-e......---...T......CC..C...AGC
Rhesus-x...........................AA.

4 x 447 (truncated to 4 x 30) dna alignment

  1. Which of the above are orthologs?

  2. Which are paralogs?

  3. For these sequences, how would you describe the divergence between orthologs compared to that between paralogs?


Citations

[Sea03]

David B. Searls. Pharmacophylogenomics: genes, evolution and drug targets. Nature Reviews Drug Discovery, 2:613–623, 2003. URL: https://doi.org/10.1038/nrd1152, doi:10.1038/nrd1152.