Network-based anomaly detection algorithm reveals proteins with major roles in human tissues
Michael Fire and Esti Yeger-Lotem
Gigascience Press Seminar Series
Host GigaScience Press, GigaScience Press |
DateTuesday, April 8, 2025 3:00 PM to 3:30 PM (UTC) |
Live eventThe live event will be accessible via this page. |
Network-based anomaly detection algorithm reveals proteins with major roles in human tissues
Background: Proteins act through physical interactions with other molecules to maintain organismal health. Protein-protein interaction (PPI) networks proved to be a powerful framework for obtaining insight into protein functions, cellular organization, response to signals, and disease states. In multicellular organisms, protein content varies between tissues, influencing tissue morphology and function. Weighted PPI networks, reflecting the likelihood of interactions in specific tissues, offer insights into tissue-specific processes and disease mechanisms. We hypothesized that detecting anomalous nodes in these networks could reveal proteins with key tissue-specific functions.
Results: Here, we introduce Weighted Graph Anomalous Node Detection (WGAND), a novel machine-learning algorithm to identify anomalous nodes in weighted graphs. WGAND estimates expected edge weights and uses deviations to generate anomaly detection features, which are then used to score network nodes. We applied WGAND to weighted PPI networks of 17 human tissues. High-ranking anomalous nodes were enriched for proteins associated with tissue-specific diseases and tissue-specific biological processes, such as neuron signaling in the brain and spermatogenesis in the testis. WGAND outperformed other methods in terms of Area Under the ROC Curve (AUC) and Precision at K (P@K), highlighting its effectiveness in uncovering biologically meaningful anomalies.
Conclusion: Our findings demonstrate WGAND’s potential as a powerful tool for detecting anomalous proteins with significant biological roles. By identifying proteins involved in critical tissue-specific processes and diseases, WGAND offers valuable insights for discovering novel biomarkers and therapeutic targets. Its versatile algorithm is suitable for any weighted graph and is broadly applicable across various fields. The WGAND algorithm is available as an open-source Python library at https://github.com/data4goodlab/wgand.
- Israel Science Foundation401/22