Chun Shen Lim, PhD, Postdoctoral Fellow, Department of Biochemistry, School of Biomedical Sciences, University of Otago
Recombinant protein production is a widely used technique, yet half of these experiments fail at the expression phase and only a quarter of target proteins are successfully purified. We have discovered that the energetics of RNA structure ensembles, that model the 'accessibility' of translation initiation sites, accurately predicts the expression outcomes of 11,430 recombinant protein production experiments in Escherichia coli. We have further discovered that normalised B-factors, that model the 'flexibility' of amino acid residues, accurately predicts the solubility of 12,158 recombinant proteins expressed in Escherichia coli. We have optimised these B-factors, and derived a new set of values for solubility scoring that further improves prediction accuracy. We call this new predictor the ‘Solubility-Weighted Index’ (SWI). We have developed TIsigner (Translation Initiation coding region designer) and SoDoPE (Soluble Domain for Protein Expression) that allows users to choose a protein region of interest for optimising expression and solubility, respectively. The final results will suggest synonymous codon changes within the first few codons of the DNA fragments of interest, meaning that gene optimisation can be done using standard PCR cloning.