Skip to content
Deep learning-based proteomics enables accurate classification of bulk and single-cell samples

This groundbreaking study introduces a novel deep learning-based approach called Proformer for accurate classification of proteomic samples. Unlike conventional convolutional neural networks (CNNs), Proformer utilizes a transformer architecture that leverages a self-attention mechanism, enabling efficient learning of complex relationships within multi-dimensional proteomic data.

By integrating retention time, mass-to-charge, intensity, and ion mobility features from mass spectrometry measurements, Proformer demonstrates superior performance compared to CNNs.

The inclusion of ion mobility as a fourth feature dimension further enhances the model’s accuracy, underscoring the importance of leveraging advanced separation techniques. The study also highlights Proformer’s robustness against batch effects and its ability to classify samples even at the single-cell level, paving the way for broader applications in patient stratification and early disease detection. Additionally, the authors provide insights into the model’s decision-making process through interpretability methods like SHAP values, offering a comprehensive understanding of Proformer’s inner workings.

The IonOpticks Aurora Ultimate CSI Series, 25cm nanoflow UHPLC column enabled enhanced separation and contributed to the model’s impressive accuracy.



Pre – Published in BioRxiv, February 07, 2024.

Authors

Karl K. Krull, Arlene Kühn, Julia Höhn, Titus J. Brinker, Jeroen Krijgsveld

Title

Deep learning-based proteomics enables accurate classification of bulk and single-cell samples

Share:

Related publications & news

Karl Krull is taking single-cell proteomics
Karl Krull, a PhD candidate working in Jeroen Krijgsveld‘s group at t...
IN THIS EDITION: Big things are happening at IonOpticks. We’re excited to...
Patricia Skowronek, a postdoctoral researcher working with Prof. Matthias M...
Credit: Modified version of an image by Elham Karimi and Simon Milette, Wal...
An industry-focused top-down mass spectrometry (TDMS) platform for resolvin...
IonOpticks, a leading global producer of high-performance chromatography co...
Overall workflows and numbers of proteins identified from various methods. ...
a graphic split into two parts by a diagonal light-blue line On the left, taking up most of the image, is a teal background with the Nature Communications logo on the top left Below is the white text "Study by Lin et al.". On the smaller, right side of the image, is an abstract representation of cells in red space, with a paper icon at the bottom
Working model of RNF214 functions in HCC. Source: Lin et al., 2024. “...
Previous
Next