Deep learning is necessary to automatise single-cell data analysis in large datasets
Traditional microbiology usually studies microbes at population-level, without considering the individuals’ heterogeneity in a population. The study of this heterogeneity is essential to understand how individuals respond to environmental changes and determine the fate of the whole population [2].
For this reason, single-cell microfluidic experiments are leading important discoveries in different fields, such as cell-size control, cellular aging, stress responses and antibiotic tolerance/ persistence.
A direct consequence is that improvements in time-lapse microscopy are significantly increasing the size of the new datasets. Traditional approaches require time-consuming manual error correction, and image analysis is becoming a major bottleneck in generating quantitative, single-cell data.
In recent years, deep learning tools have found applications in many biology fields, including automated image analysis [3]. For example, the U-Net algorithm is a convolutional neural network that performs exceptionally well on biomedical data [4].
A purely Python workflow with two U-Net convolutional neural networks
An ideal tool for high throughput analysis of microscopy data has to be rapid, accurate, flexible and require minimal input from the user. DeLTA addresses this combination of needs thanks to a deep-learning-based approach.
In addition, the purely Python workflow allows the entire pipeline to be in an open-source framework. The Bio-Formats toolbox for Python also enables users to work directly with images in many standard formats without needing any preformatting.
To summarise, DeLTA pipeline consists of segmentation, tracking, and lineage reconstruction.
Both the segmentation and tracking models implement a U-Net neural network architecture (TensorFlow/Keras). A pixel-wise-weighted binary cross-entropy loss function is implemented to train both models, as in the original U-Net paper [4].
As input for segmentation, images in many common formats that are output via microscopy software can be used without the need for any preformatting. The tracking model inputs a phase contrast image and segmentation from the current and previous frame and outputs a greyscale image of the predicted tracking event.
Ad hoc functions are used to deal with occasional out-of-focus frames, electronic noise, exaggerated cell movement and drift.
As a result, segmentation generates information about cell morphology; tracking tracks cells across frames and records division events; lineage reconstruction keeps track of cell lineages and records pole age.
New functionality to analyse mixed populations in two dimensions
While the first version of DeLTA was working in 1D [5], this new version segments and tracks bacteria in 2D. Moreover, cells with different properties can be tracked simultaneously within the same movie.
All this is essential to quantify single-cell spatial and temporal dynamics, which we know to be necessary to study microbial co-cultures, cell-to-cell interactions, or spatial patterning.
A possible application is to look at differences in growth rate between antibiotic-resistant and susceptible cells grown in the same field of view.
DeLTA works very well on standard rod-shaped bacteria such as E. coli and B. subtilis. It can be easily extended to cells with similar morphologies, but also to different organisms.