Genome sequences are fundamental to many fields: a living history of molecular evolution, a reference frame for medicine, a substrate for bioengineering. Our UC Berkeley-based team makes the software that empowers genome scientists to decode, analyze, annotate, and design these sequences.
Our group is a mix of academics and research software engineers, working on intersecting projects. We develop new algorithms and models, gather data on natural and synthetic sequences, and develop code to visualize and explore data.
We use techniques from Bayesian statistics and deep learning through to modern web development. Recent papers describe software for annotating genomes collaboratively over the web, reconstructing ancestral viruses, and designing protein libraries for directed evolution.