AAV genome truncations are a major challenge in gene therapy development, reducing production lot yields and increasing manufacturing costs. The ability to reliably and reproducibly predict the propensity of vector designs for producing truncated genomes early in discovery can mitigate manufacturing challenges later in development, saving significant time and cost. Here, we investigated whether AI-based predictions of AAV genome truncation propensity reliably and reproducibly correlate with long-read sequencing data.
We used FORMsightAI to predict the truncation propensity of 42 AAV vectors and then used PacBio long-read sequencing to sequence the genomes of the vectors. We then compared the FORMsightAI-predictions with long-read sequencing data to evaluate the reliability and reproducibility of the predictions.
We observed a strong correlation between FORMsightAI predictions for genome truncation propensity and long-read sequencing of the vectors (Figure 1). Furthermore, an overlay of the FORMsightAI predictions with long-read sequencing data for the individual 42 vectors demonstrated that the predictions were consistent and accurate with the actual sequencing data (Figure 2).