Skip to main content
image-description

Stephen Altschul, PhD

About

Research Interests

Correlations in amino acid usage among sequence positions are evident in very large multiple sequence alignments (MSAs). Two distinct hypotheses for how these correlations arise lead to distinct mathematical approaches to their description, recognition and analysis. The first imagines the homologous proteins within a large MSA as having a common three-dimensional structure, and that correlations are due to the physical interaction of residues near to one another within this structure. This approach, in which correlations are modeled directly using pairwise coupling terms, has been extensively studied for many years. It has gained notable recent success with the introduction of Direct Coupling Analysis (DCA), which mitigates the confounding effects of indirect correlations, in which contacting positions i & j and j & k, lead to correlation between non-contacting positions i & k. The second hypothesis imagines the homologous proteins within a large MSA as falling into related families and sub-families, whose divergent but related functions impose different constraints on their constituent members. Under this model, correlations can be completely explained by the hierarchical structure of family and subfamily divergence, without the need to assume correlations between sequence positions within any particular subfamily. The MSA definition and associated statistical model that correspond to this view have been much less widely studied, and have been a principle focus of my recent research.

Publications

Neuwald AF, Altschul SF. Statistical investigations of protein residue direct couplings. PLoS Comput Biol. 2018 Dec;14(12):e1006237. doi: 10.1371/journal.pcbi.1006237. eCollection 2018 Dec. PubMed PMID: 30596639; PubMed Central PMCID: PMC6329532.

Shah N, Altschul SF, Pop M. Outlier detection in BLAST hits. Algorithms Mol Biol. 2018;13:7. doi: 10.1186/s13015-018-0126-3. eCollection 2018. PubMed PMID: 29588650; PubMed Central PMCID: PMC5863388.

Altschul SF, Neuwald AF. Initial Cluster Analysis. J Comput Biol. 2018 Feb;25(2):121-129. doi: 10.1089/cmb.2017.0050. Epub 2017 Aug 3. PubMed PMID: 28771374; PubMed Central PMCID: PMC5806593.

Neuwald AF, Aravind L, Altschul SF. Inferring joint sequence-structural determinants of protein functional specificity. Elife. 2018 Jan 16;7. doi: 10.7554/eLife.29880. PubMed PMID: 29336305; PubMed Central PMCID: PMC5770160.

More