Search in:

Artificial DNA: PNA & XNA Volume 5, 2014 - Issue 2

Journal homepage

Free access

1,028

Views

CrossRef citations to date

Altmetric

Listen

Commentary

The genetic code

Rewritten, revised, repurposed

Roy D Sleator Department of Biological Sciences; Cork Institute of Technology; Cork, IrelandCorrespondence[email protected]

Article: e29408 | Received 29 May 2014, Accepted 30 May 2014, Published online: 17 Jun 2014

Cite this article
https://doi.org/10.4161/adna.29408
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Abstract

Despite remaining apparently frozen through the millennia, the genetic code is far more flexible than previously believed and can be extended and repurposed with relative ease.

Keywords: :

synthetic DNA
recoding
unnatural base pairs (UBPs)
nonstandard amino acids (NSAAs)

Despite the fact that there are more than 100 amino acids observed in nature, only 20 are encoded by the canonical genetic code of 61 sense codons and 3 stop codons. Because sense codons outnumber their encoded amino acids by a ratio of 3:1, the genetic code is redundant; with most amino acids coded for by more than one codon.Citation¹ This degeneracy is well documented, with certain organisms having evolved preferences for specific codon-amino acid combinations.Citation² However, despite this inherent flexibility, our natural amino acid repertoire represents less than 20% that which exists in nature; leading Francis Crick to suggest that the code is a “frozen accident.”Citation³ However, several hot papers have emerged in recent years which have led to a significant thaw in this concept of a “frozen” code.

In one of the earliest successful attempts to extend or rewrite the code, Sakamoto and colleagues undertook a process of genetic recodingCitation⁴; forcing specific codons to code for alternative or nonstandard amino acids (NSAAs). Sakamoto’s team converted the TAG stop codon in 7 essential Escherichia coli genes to TAA; eliminated release factor 1 (RF1; which terminates translation at UAA and UAG) and supplied a tRNA that inserts a glutamine when it encounters UAG. Following proof of concept, with a canonical amino acid, the team repeated the experiment reassigning TAG to the NSAA iodotyrosine. Despite the experiment being a success, with all 7 targeted genes terminating properly, all the remaining genes ending in TAG failed to terminate correctly in the absence of RF1.

Lajoie et al.,Citation⁵ overcame this ‘read through’ limitation by employing an in vivo genome-editing approachCitation⁶; replacing all 321 instances of TAG (the rarest codon in the E. coli strain tested) with TAA. The resulting organism described as a genomically recoded organism (GRO), represents a new class of genetically modified organism (GMO) and a potentially important platform for novel drug production. Indeed, in support of the application of GROs as industrial protein production systems, Lajoie et al.,Citation⁵ successfully reassigned TAG to a prephosphorlyated serine – a modification found on serines of the recombinant human growth hormone.Citation⁷ Furthermore, the GRO exhibited increased resistance to T7 bacteriophage; a highly desirable trait in large scale industrial processes which are otherwise susceptible to phage attack.Citation⁸ This observed phage resistance prompted the authors to suggest that genetic recoding may lead to viral protein mistranslation.

In a second paper in the same issue of Science, Lajoie et al.,Citation⁹ investigated the effect of recoding sense codons; removing all instances of 13 rare codons from 42 highly expressed essential genes (including all 41 essential ribosomal protein-coding genes and prfB) across 80 E. coli strains. Despite several genome design constraints, growth defects, and the fact that replacement of synonymous codons occasionally did not produce the same effects as the native codon; genome-wide reassignment of sense codons was at least shown to be possible.

In addition to these laboratory based recoding successes, we are beginning to see more and more variation in the genetic code of natural organisms.Citation¹⁰ Indeed, a recent large scale analysis of stop codon reassignments in the wild revealed far higher recoding rates than previously imagined.Citation¹¹ Investigating > 1,700 environmental samples (including 750 samples from 17 human body sites), Ivanova et al.,Citation¹¹ scanned ~5.6 trillion bp of metagenomic data for stop codon reassignment. Contrary to the previously held belief that natural recoding is rare; the authors report a total of 198 Mb of recoded DNA data. Interestingly, the human body despite accounting for only 10% of DNA present represented 51% of all codon reassignments. Furthermore, distinct patterns of stop codon reassignment were observed in all 3 domains of life, with bacteria showing only opal reassignments, while extensive opal and amber reassignments occurred in phages. The observed high rate of recoding among phage suggests that, contrary to the findings of Lajoie et al.,Citation⁵^,Citation⁹ phages are not obliged to adapt to the codon usage of their hosts, but rather exploit differences in codon usage to manipulate their hosts.

While genetic recoding or rewriting is still restricted by our existing dependency on the 4 natural nucleotides A, T, G, and C; an alternative approach to extending or revising the code involves the use of unnatural base pairs (UBPs), allowing us to incorporate up to 152 additional non-canonical amino acids. Over the past 15 y, Romesberg and colleagues at the Scripps Research Institute, having synthesized and tested more than 300 artificial nucleotides, developed a class of UBPs, exemplified by d5SICS-dNaM (abbreviated as X and Y), formed between nucleotides bearing hydrophobic nucleobases.Citation¹² Romesberg’s group recently proved that it is possible to stably incorporate X and Y into the DNA of actively growing E. coli, creating the first organism to stably propagate an expanded genetic alphabet.Citation¹³ It is hoped that this expanded DNA alphabet will help to build an expanded translational alphabet; encoding more and more NSAAs, ultimately enabling the synthesis of new and improved proteins.

In addition to being rewritten and revised, perhaps the most innovative use of the genetic code in recent times is its deliberate repurposing as a high capacity storage medium. With a theoretical storage potential of 455 exabytes per gram ssDNA,Citation¹⁴ it is estimated that all of the world’s projected 40 ZB of data could be stored in just ~90 g of DNA.Citation¹⁵ Some of the earliest attempts to use DNA as a workable canvas for archival purposes include Joe Davis’ Microvenus; a 35 bit coded visual icon representing the external female genitalia.Citation¹⁶ More recently, construction of JCVI-syn1.0, the first bacterial cell to contain a completely synthetic genome, employed “watermarks” to distinguish the synthetic genome from native DNA. These 7,920 bit watermarks contain a web address, the names of the paper’s authors and some memorable quotations.Citation¹⁷

Large scale data storage in DNA was first achieved by Church and colleaguesCitation¹⁴ who described the conversion of html-coded data to DNA code using a 1 bit per base encoding (A,C = 0; T,G = 1); allowing the conversion of Church’s book Regenesis (including 53,426 words, 11 JPG images and 1 JavaScript program) into DNA sequence. In an effort to reduce error and facilitate up-scaling, Goldman et al.Citation¹⁸., described a modified strategy achieving a storage density of ~2.2 PB/g DNA (Equivalent to ~468,000 DVDs). This modified approach first converts the original file type to binary code (0, 1) which is then converted to a ternary code (0, 1, 2) and in turn to the triplet DNA code. Replacing each trit with 1 of the 3 nucleotides different from the preceding one (i.e. A, T, or C, if the preceding one is G) ensures that no homopolymers are generated – significantly reducing high throughput sequencing errors.Citation¹⁹ Based on a fixed string length (data and indexing) of 117 nt, Goldman et al.Citation¹⁸., suggest that DNA-based storage currently remains feasible even at several orders of magnitude greater than current global data volumes. This, combined with the likely expectation of significantly longer string synthesis as the technology progresses,Citation²⁰ virtually future proofs DNA as a viable big data storage medium.Citation²¹ Furthermore, while the above strategies focus on maintaining DNA in vitro, we have previously postulated that in vivo storage may also be a viable and perhaps even more desirable option.Citation²²

Therefore, despite remaining apparently frozen through the millennia, advances like those described above, have revealed a code that is far more flexible than we could previously have hoped to believe; a code which we can extend and repurpose with relative ease. While it is difficult to predict future directions in this particular field of synthetic biology,Citation²³ it is clear that several exciting possibilities exist. One prospect is the synthesis of completely novel species; designed and synthesized using the principles described previously,Citation¹⁷^,Citation²⁴ yet potentially running multiple genetic codes concurrently. Such hybrid constructs can be thought of as analogous to a computer running multiple operating systems in parallel; each designed for a specific purpose. The native code (consisting of A, T G, and C) would run normal cellular processes, required for growth and reproduction, while the parallel synthetic code (incorporating X and Y) would allow the cell to act as a micro-factory, producing new proteins with novel applications in industry and medicine. Finally, the third partitioned code might contain the manufacturer’s instructions, or user’s manual, digitally encoded in the DNA.

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

Acknowledgments

RDS is coordinator of the EU FP7 project ClouDx-i.

10.4161/adna.29408

References

Sleator RD. Proteins: form and function. Bioeng Bugs 2012; 3:80 - 5; http://dx.doi.org/10.4161/bbug.18303; PMID: 22095055
PubMedGoogle Scholar
Johnston C, Douarre PE, Soulimane T, Pletzer D, Weingart H, MacSharry J, Coffey A, Sleator RD, O’Mahony J. Codon optimisation to improve expression of a Mycobacterium avium ssp. paratuberculosis-specific membrane-associated antigen by Lactobacillus salivarius. Pathog Dis 2013; 68:27 - 38; http://dx.doi.org/10.1111/2049-632X.12040; PMID: 23620276
PubMed Web of Science ®Google Scholar
Sella G, Ardell DH. The coevolution of genes and genetic codes: Crick’s frozen accident revisited. J Mol Evol 2006; 63:297 - 313; http://dx.doi.org/10.1007/s00239-004-0176-7; PMID: 16838217
PubMed Web of Science ®Google Scholar
Mukai T, Hayashi A, Iraha F, Sato A, Ohtake K, Yokoyama S, Sakamoto K. Codon reassignment in the Escherichia coli genetic code. Nucleic Acids Res 2010; 38:8188 - 95; http://dx.doi.org/10.1093/nar/gkq707; PMID: 20702426
PubMed Web of Science ®Google Scholar
Lajoie MJ, Rovner AJ, Goodman DB, Aerni HR, Haimovich AD, Kuznetsov G, Mercer JA, Wang HH, Carr PA, Mosberg JA, et al. Genomically recoded organisms expand biological functions. Science 2013; 342:357 - 60; http://dx.doi.org/10.1126/science.1241459; PMID: 24136966
PubMed Web of Science ®Google Scholar
Isaacs FJ, Carr PA, Wang HH, Lajoie MJ, Sterling B, Kraal L, Tolonen AC, Gianoulis TA, Goodman DB, Reppas NB, et al. Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science 2011; 333:348 - 53; http://dx.doi.org/10.1126/science.1205822; PMID: 21764749
PubMed Web of Science ®Google Scholar
Levarski Z, Soltýsová A, Krahulec J, Stuchlík S, Turňa J. High-level expression and purification of recombinant human growth hormone produced in soluble form in Escherichia coli. Protein Expr Purif 2014; Forthcoming http://dx.doi.org/10.1016/j.pep.2014.05.003; PMID: 24859479
PubMed Web of Science ®Google Scholar
Sturino JM, Klaenhammer TR. Engineered bacteriophage-defence systems in bioprocessing. Nat Rev Microbiol 2006; 4:395 - 404; http://dx.doi.org/10.1038/nrmicro1393; PMID: 16715051
PubMed Web of Science ®Google Scholar
Lajoie MJ, Kosuri S, Mosberg JA, Gregg CJ, Zhang D, Church GM. Probing the limits of genetic recoding in essential genes. Science 2013; 342:361 - 3; http://dx.doi.org/10.1126/science.1241460; PMID: 24136967
PubMed Web of Science ®Google Scholar
Prat L, Heinemann IU, Aerni HR, Rinehart J, O’Donoghue P, Söll D. Carbon source-dependent expansion of the genetic code in bacteria. Proc Natl Acad Sci U S A 2012; 109:21070 - 5; http://dx.doi.org/10.1073/pnas.1218613110; PMID: 23185002
PubMed Web of Science ®Google Scholar
Ivanova NN, Schwientek P, Tripp HJ, Rinke C, Pati A, Huntemann M, Visel A, Woyke T, Kyrpides NC, Rubin EM. Stop codon reassignments in the wild. Science 2014; 344:909 - 13; http://dx.doi.org/10.1126/science.1250691; PMID: 24855270
PubMed Web of Science ®Google Scholar
Malyshev DA, Dhami K, Quach HT, Lavergne T, Ordoukhanian P, Torkamani A, Romesberg FE. Efficient and sequence-independent replication of DNA containing a third base pair establishes a functional six-letter genetic alphabet. Proc Natl Acad Sci U S A 2012; 109:12005 - 10; http://dx.doi.org/10.1073/pnas.1205176109; PMID: 22773812
PubMed Web of Science ®Google Scholar
Malyshev DA, Dhami K, Lavergne T, Chen T, Dai N, Foster JM, Corrêa IR Jr., Romesberg FE. A semi-synthetic organism with an expanded genetic alphabet. Nature 2014; 509:385 - 8; http://dx.doi.org/10.1038/nature13314; PMID: 24805238
PubMed Web of Science ®Google Scholar
Church GM, Gao Y, Kosuri S. Next-generation digital information storage in DNA. Science 2012; 337:1628; http://dx.doi.org/10.1126/science.1226355; PMID: 22903519
PubMed Web of Science ®Google Scholar
O’ Driscoll A, Sleator RD. Synthetic DNA: the next generation of big data storage. Bioengineered 2013; 4:123 - 5; http://dx.doi.org/10.4161/bioe.24296; PMID: 23514938
PubMed Web of Science ®Google Scholar
Sleator RD. The story of Mycoplasma mycoides JCVI-syn1.0: the forty million dollar microbe. Bioeng Bugs 2010; 1:229 - 30; http://dx.doi.org/10.4161/bbug.1.4.12465; PMID: 21327053
PubMedGoogle Scholar
Gibson DG, Glass JI, Lartigue C, Noskov VN, Chuang RY, Algire MA, Benders GA, Montague MG, Ma L, Moodie MM, et al. Creation of a bacterial cell controlled by a chemically synthesized genome. Science 2010; 329:52 - 6; http://dx.doi.org/10.1126/science.1190719; PMID: 20488990
PubMed Web of Science ®Google Scholar
Goldman N, Bertone P, Chen S, Dessimoz C, LeProust EM, Sipos B, Birney E. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature 2013; 494:77 - 80; http://dx.doi.org/10.1038/nature11875; PMID: 23354052
PubMed Web of Science ®Google Scholar
Niedringhaus TP, Milanova D, Kerby MB, Snyder MP, Barron AE. Landscape of next-generation sequencing technologies. Anal Chem 2011; 83:4327 - 41; http://dx.doi.org/10.1021/ac2010857; PMID: 21612267
PubMed Web of Science ®Google Scholar
Fehér T, Burland V, Pósfai G. In the fast lane: large-scale bacterial genome engineering. J Biotechnol 2012; 160:72 - 9; http://dx.doi.org/10.1016/j.jbiotec.2012.02.012; PMID: 22406111
PubMed Web of Science ®Google Scholar
O’Driscoll A, Daugelaite J, Sleator RD. ‘Big data’, Hadoop and cloud computing in genomics. J Biomed Inform 2013; 46:774 - 81; http://dx.doi.org/10.1016/j.jbi.2013.07.001; PMID: 23872175
PubMed Web of Science ®Google Scholar
Sleator RD, O’Driscoll A. Digitizing humanity. Artif DNA PNA XNA 2013; 4:37 - 8; http://dx.doi.org/10.4161/adna.25489; PMID: 23912716
PubMedGoogle Scholar
Sleator RD. The synthetic biology future. Bioengineered 2014; 5:5; http://dx.doi.org/10.4161/bioe.28317; PMID: 24561910
PubMedGoogle Scholar
Annaluru N, Muller H, Mitchell LA, Ramalingam S, Stracquadanio G, Richardson SM, Dymond JS, Kuang Z, Scheifele LZ, Cooper EM, et al. Total synthesis of a functional designer eukaryotic chromosome. Science 2014; 344:55 - 8; http://dx.doi.org/10.1126/science.1249252; PMID: 24674868
PubMed Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

The genetic code

Rewritten, revised, repurposed

Abstract

Disclosure of Potential Conflicts of Interest

Acknowledgments

References

Information for

Open access

Opportunities

Help and information

The genetic code

Rewritten, revised, repurposed

Abstract

Disclosure of Potential Conflicts of Interest

Acknowledgments

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date