Faculty & Research
School of Informatics
- Contact Information
- Contact Matthew Hahn by mwh [at] indiana [dot] edu
- By telephone: 812-856-7001/6-7016(lab)
- JH 249B / JH 249 (lab)
- Evolution, Ecology & Behavior
- Research Areas
- Genomics and Bioinformatics
Ph.D., Duke University, 2003
Postdoctoral Fellow, University of California, Davis, 2003-2005
Alfred P. Sloan Foundation Research Fellow, 2010-2012
Our work focuses broadly on asking questions about organismal function and evolution using genomic data. The huge amount of data currently being produced allows us to ask and answer questions on a genomic scale that have never been possible before. Our questions largely revolve around the relative roles of natural selection and genetic drift in shaping nucleotide, gene family, and gene expression variation both within and between species. Although most of the empirical work has been on systems such as humans, flies and mosquitoes (and now tomatoes!), members of the lab can work on topics and organisms that appeal to them. This page covers several major topics currently being studied.
The evolution of gene gain and loss
Comparison of whole genomes has revealed large and frequent changes in the size of gene families, the result of gene duplication and loss. Comparative genomic analyses allow us to identify large-scale patterns of change and to make inferences regarding the role of natural selection in gene gain and loss. To make these analyses possible, we have developed a stochastic birth-and-death model for gene family evolution, applied in the software package, CAFE. Application of this method to data from multiple whole genomes of many groups is revealing remarkable patterns of gene gain and loss. Other approaches to studying this question have involved the analysis of gene movement among chromosomes (especially sex chromosomes), the discovery of polymorphic copy-number variants under local selection, and even new methods for carrying out genome assembly to more accurately estimate gene number.
Selective, demographic, and random processes all determine the frequency of alleles in a population and differences between species. One of the major goals of population genetics has been to uncover which of these processes is acting in natural populations through a combination of directed empirical studies and theoretical models that provide expectations under a variety of conditions. While most of the work in the field has involved single loci or limited multiple locus studies and models, the availability of genomic-scale data will begin to require new genomic-scale approaches. We have been pursuing these questions in a wide variety of studies, largely focused on humans and flies (where the best data have always been). The work has presented new methods for distinguishing demography from selection, distinguishing different forms of positive selection, and more recently, using clinal variation to uncover local adaptation.
Population genomic data are being applied in new and creative ways to recently diverged lineages. One of the goals of the new field of speciation genomics is to understand how the patterns of divergence uncovered by such studies are related to mechanisms of reproductive isolation. Focusing on divergence within the Anopheles gambiae species complex, we have been interested in the roles of introgression and selection in shaping heterogeneous patterns of divergence. This work has built on much of our more general research into population genomics, but also now encompasses new approaches to detecting introgression and to distinguishing differences in introgression among loci from differences in selection among loci.
The evolution of transcriptional regulation
Changes in the timing, level, and location of gene expression have been implicated in many phenotypic differences between individuals and species. Using both DNA sequence and gene expression data, we can address the origin of variation in gene expression and the evolutionary forces that affect this variation. Our work has generally focused on the evolution of transcription factor binding sites and their implication for differences in expression, though with the rise of RNA-seq we are now more often than not starting with the observation of differences in expression. Much of the newer work uses wild tomatoes from the genus Solanum, as the ability to carry out genetic manipulations means that we can gain much more insight into the causes of transcriptional divergence.
Denton, J.F., J. Lugo-Martinez, A.E. Tucker, D.R. Schrider, W.C. Warren, and M.W. Hahn (2014) Extensive error in the number of genes inferred from draft genome assemblies. PLoS Computational Biology 10:e1003998.
Montague, M.J., G. Li, B. Gandolfi, R. Khan, B.L. Aken, S.M.J. Searle, P. Minx, L. Hillier, D. Kolboldt, B.W. Davis, C.A. Driscoll, C. Barr, K. Blackistone, J. Quilez, B. Lorente-Galdos, T. Marques-Bonet, C. Alkan, G.W.C. Thomas, M.W. Hahn, M. Menotti-Raymond, S.J. O’Brien, R.K. Wilson, L.A. Lyons, W.J. Murphy, and W.C. Warren (2014) Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication. Proceedings of the National Academy of Sciences 111:17230-17235.
Carbone, L…M.W. Hahn, G.W.C. Thomas…et al. (2014) Gibbon genome and the fast karyotype evolution of small apes. Nature 513:195-201.
The Marmoset Genome Sequencing and Analysis Consortium (2014) The common marmoset genome provides insight into primate biology and evolution. Nature Genetics 46:850-857.
Cruickshank, T.C. and M.W. Hahn (2014) Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow. Molecular Ecology 23:3133-3157.
Bachtrog, D., J.E. Mank, C.L. Peichel, M. Kirkpatrick, S.P. Otto, T.-L. Ashman, M.W. Hahn, J. Kitano, I. Mayrose, R. Ming, N. Perrin, L. Ross, N. Valenzuela, and J.C. Vamosi (2014) Sex determination: Why so many ways of doing it? PLoS Biology 12:e1001899.
The Tree of Sex Consortium (2014) Tree of Sex: A database of sexual systems. Scientific Data 1:140015.
Cassone, B.J., C. Kamdem, C. Cheng, J.C. Tan, M.W. Hahn, C. Costantini, and N.J. Besansky (2014) Gene expression divergence between malaria vector sibling species Anopheles gambiae and An. coluzzii from rural and urban Yaoundé Cameroon. Molecular Ecology 23:2242-2259.
Hahn, M.W., S.V. Zhang, and L.C. Moyle (2014) Sequencing, assembling, and correcting draft genomes using recombinant populations. G3 4:669-679.
Thomas, G.W.C. and M.W. Hahn (2014) The human mutation rate is increasing, even as it slows. Molecular Biology and Evolution 31:253-257.