The art of seeing the elephant in the room: 2D embeddings of single-cell data do make sense.

A recent paper claimed that t-SNE and UMAP embeddings of single-cell datasets are "specious" and fail to capture true biological structure. The authors argued that such embeddings are as arbitrary and as misleading as forcing the data into an elephant shape. Here we show that this conclusi...

Full description

Saved in:
Bibliographic Details
Main Authors: Jan Lause, Philipp Berens, Dmitry Kobak
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-10-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1012403
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A recent paper claimed that t-SNE and UMAP embeddings of single-cell datasets are "specious" and fail to capture true biological structure. The authors argued that such embeddings are as arbitrary and as misleading as forcing the data into an elephant shape. Here we show that this conclusion was based on inadequate and limited metrics of embedding quality. More appropriate metrics quantifying neighborhood and class preservation reveal the elephant in the room: while t-SNE and UMAP embeddings of single-cell data do not preserve high-dimensional distances, they can nevertheless provide biologically relevant information.
ISSN:1553-734X
1553-7358