Introduction

Overview

Teaching: 10 min
Exercises: 0 min
Questions
  • What are the goals of this practical training?

  • What data do we work on?

Objectives
  • Describe the goals of this practical training.

  • Describe the characteristics of the isolates.

The lessons of the first two days introduce and reinforce basic skills in the Unix shell and R, and are designed for learners with no programming experience. The general topics and skills covered include:

Interacting with Computers

Data Management and Organization

Data Cleaning and visualization

Automation and scripting

Overview of our Data Set and Narrative

Microbes are ideal organisms for exploring ‘Long-term Evolution Experiments’ (LTEEs) - thousands of generations can be generated and stored in a way that would be virtually impossible for more complex eukaryotic systems. In Lenski et.al., 12 populations of Escherichia coli were propagated for more than 40,000 generations in a glucose-limited minimal medium. This medium was supplemented with citrate which E. coli cannot metabolize in the aerobic conditions of the experiment. Sequencing of the populations at regular time points reveals that spontaneous citrate-using mutants (Cit+) appeared in a population of E.coli (designated Ara-3) at around 31,000 generations. It should be noted that spontaneous Cit+ mutants are extraordinarily rare - inability to metabolize citrate is one of the defining characters of the E. coli species. Eventually, Cit+ mutants became the dominate population as the experimental growth medium contained a high concentration of citrate relative to glucose.

Why we chose this data set

Key Points

  • After the first two days you will have some familiarity with working on the command line, data management, cleaning and visualization, automation and scripting