Ilardo, MelissaBose, RudrarupMeringer, MarkusRasulev, BakhtiyorGrefenstette, NatalieStephenson, JamesFreeland, StephenGillams, Richard J.Butch, Christopher J.Cleaves II, James2019-10-092019-10-092019-08-28Ilardo, Melissa ; Bose, Rudrarup ; Meringer, Markus ; Rasulev, Bakhtiyor ; Grefenstette, Natalie ; Stephenson, James ; Freeland, Stephen ; Gillams, Richard J.; Butch, Christopher J. ; Cleaves, H. James ; Adaptive Properties of the Genetically Encoded Amino Acid Alphabet Are Inherited from Its Subsets ; Scientific Reports 12468,9 ; https://doi.org/10.1038/s41598-019-47574-x ;https://doi.org/10.1038/s41598-019-47574-xhttp://hdl.handle.net/11603/14995Life uses a common set of 20 coded amino acids (CAAs) to construct proteins. This set was likely canonicalized during early evolution; before this, smaller amino acid sets were gradually expanded as new synthetic, proofreading and coding mechanisms became biologically available. Many possible subsets of the modern CAAs or other presently uncoded amino acids could have comprised the earlier sets. We explore the hypothesis that the CAAs were selectively fixed due to their unique adaptive chemical properties, which facilitate folding, catalysis, and solubility of proteins, and gave adaptive value to organisms able to encode them. Specifically, we studied in silico hypothetical CAA sets of 3–19 amino acids comprised of 1913 structurally diverse α-amino acids, exploring the adaptive value of their combined physicochemical properties relative to those of the modern CAA set. We find that even hypothetical sets containing modern CAA members are especially adaptive; it is difficult to find sets even among a large choice of alternatives that cover the chemical property space more amply. These results suggest that each time a CAA was discovered and embedded during evolution, it provided an adaptive value unusual among many alternatives, and each selective step may have helped bootstrap the developing set to include still more CAAs.9 pagesen-USThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.Attribution 4.0 International (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/AstrobiologyCheminformaticsComputational chemistryChemical origin of lifeProteinsAdaptive Properties of the Genetically Encoded Amino Acid Alphabet Are Inherited from Its SubsetsText