05 August 2023
Contemplating libraries in biology. Not that kind. Not that one either.
04 August 2023
The library of possible proteins is beyond vast. Does this cause us to view evolution as harder than it is?
Bates Hall at the Boston Public Library
One of the most effective metaphors for evolutionary change is the image of an exploration of a space, perhaps a map that shows "fitness peaks" or, better, a library of possibilities. The philosopher Daniel Dennett, writing in Darwin's Dangerous Idea, suggested The Library of Mendel as a way of thinking about the total set of possible gene sequences. He was adapting an idea famously employed in a short story by Jorge Luis Borges called "The Library of Babel," which consists of the total set of possible books of a particular length. (This "library" exists on a website designed for creators and researchers.)
Contemplating a space of possibilities—whether that space consists of books written in English (26 letters), or "books" written in the language of DNA (four letters), or "books" written in the language of protein (20 letters)—is both fun and dizzying. The dizziness is induced (for me, at least) by the vastness of these libraries (Babel or Mendel, doesn't matter). How vast? Here is how Dennett describes the Library of Babel's size (italics are his):
No actual astronomical quantity (such as the number of elementary particles in the universe, or the amount of time since the Big Bang, measured in nanoseconds) is even visible against the backdrop of these huge-but-finite numbers. If a readable volume in the Library were as easy to find as a particular drop in the ocean, we'd be in business!
—Darwin's Dangerous Idea, p. 109
Dennett then uses Vast to indicate "Very-much-more-than-astronomically" large and Vanishingly small to indicate the likelihood of something like discovering a "volume with so much as a grammatical sentence in it" in the Library. In other words, we lack words to adequately describe the size of the Library and the improbability of randomly discovering anything coherent inside it.
29 May 2011
Mapping fitness: ribozymes, landscapes, and Seattle
05 February 2011
Mapping fitness: protein display, fitness, and Seattle
Suppose we want to examine the fitness landscape represented by the structure of a single protein. Our map would show the fitness of the protein (its function, measured somehow) and how fitness is affected by variations in the structure of the protein (its sequence, varied somehow). It's hard enough to explain or read such a map. Even more daunting is the task of creating a detailed map of such a widely-varying space. Two particular sets of challenges come to mind.
27 November 2010
Mapping fitness: bacteria, mutations, and Seattle
20 November 2010
Mapping fitness: landscapes, topographic maps, and Seattle

It's a simple concept, and a helpful one, though sometimes subject to over-interpretation. And it helps to frame some of the big questions in evolutionary genetics. One of those big questions is this one, stated somewhat simplistically: how do the variants navigate to fitness peaks, if there are fitness valleys that separate the peaks? (The ideas is that fitness is higher on the peaks, and so a population would be unlikely to descend from a local peak into a valley.) In other words, given a particular fitness landscape, what are the evolutionary trajectories by which variation can explore that landscape?