Go to symposium website → www.slas.org/HighContentAnalysis
Back To Schedule
Tuesday, October 22 • 1:30pm - 2:00pm
Poster Presentation #17- Deep Learning-Derived Features Outperform Classical Computer Vision in Low Dimension-Embedding of High-Content Screening Data

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

When automating analysis of high-content screening images, a key step is reducing the object image (e. g. a cell) to a vector of features that describe that object for subsequent comparison with other objects. In classical computer vision, many of these features have an implicit biological meaning (e. g. cell size, intensity, aspect ratio, etc).

Depending on the research question, feature vectors then are subjected to univariate or multivariate analysis. Nonlinear dimensionality reduction techniques such as tSNE and its variants [1] are particularly useful when representing or embedding high-dimensional data in two or three dimensions.Ideally, such visualizations should clearly separate different phenotypes observed in the experiment. However, the embedding quality relies heavily on the quality of the input features derived from the HCS images.

In this poster, we use a high-content screening translocation assay to compare the quality of embedding produced by features extracted using classical versus deep learning-based feature approaches. We compare them based on ability to separate phenotypes and robustness against batch effects. We show that while classical and deep-learning-derived feature sets or a combination of both all produce excellent results, when using classical features, attaining this level of quality requires expert tuning of the feature extraction process to the assay of interest. In contrast, deep-learning-based feature extraction is fully automated and does not require expert knowledge.

Using deep learning-based features, we observe that the embedding quality depends on the network architecture. Standard CNN architectures perform poorly, while tailored architectures outperform classical methods. Finally, we show that for embedding and visualization, adding classical features to the deep-learning based features does not increase resolution and is thus unnecessary.

[1] L. J. P. van der Maaten and G. E. Hinton. Visualizing High-Dimensional Data Using t-SNE. Journal of Machine Learning Research 9(Nov):2579-2605, 2008


Matthias Fassler

Scientific Account Management, Genedata
Matthias Fassler is a Scientific Account Manager at Genedata, Basel.

Tuesday October 22, 2019 1:30pm - 2:00pm BST
Sherry Coutu Seminar Suite Foyer