Improving Bayesian methods for identifying antigenic sites

We’ve uploaded a new manuscript to the arXiv describing an improved model for identifying antigenic sites titled “Improving the identification of antigenic sites in the H1N1 Influenza virus through accounting for the experimental structure in a sparse hierarchical Bayesian model”.

This work, led by Vinny Davies, builds upon a conceptually similar model that we published in Computational Statistics earlier in the year. We are again using spike-and-slab priors to identity the amino acid positions where substitutions change the antigenic phenotype of a virus, potentially allowing the virus to evade pre-existing immunity. The approach is as described in general terms in a previous post on Bayesian identification of sites responsible for antigenic change.

The model described in this new manuscript extends the previously published model by introducing latent variables that represent the underlying HI titre for each pair of reference and test virus. By taking the structure of the data into account in this way, the accuracy and computational efficiency of the model is improved. Applying the model to data from influenza A(H1N1) viruses (pre-2009 pandemic), the majority of selected positions are in previously described antigenic sites. The figure below shows the proximity of three plausible candidate positions selected (66, 146 and 252) to the known antigenic sites (dark grey) on a surface representation of the haemagglutinin structure of A/Puerto Rico/8/34 (H1N1).


To read about Vinny’s work in more depth, have a look at his thesis.