Implementation of Machine Learning for Breast Cancer Classification Based on Genomic Data: Backend Solution with Supabase and Streamlit

Authors

  • Tia Hasna Humayra Telkom University
  • Suryo Adhi Wibowo Telkom University
  • Koredianto Usman Telkom University

Abstract

Breast cancer remains one of the leading causes of cancer-related deaths worldwide, highlighting the need for accurate and efficient diagnostic tools. This study focuses on implementing machine learning models, particularly Artificial Neural Networks (ANN), to classify breast cancer types based on genomic data. Using the METABRIC RNA Mutation dataset, the system combines a cloud-based backend with Supabase and an intuitive frontend built with Streamlit. To ensure data compatibility with the models, preprocessing steps such as standardization, label encoding, and one-hot encoding are applied. TensorFlow is used to load models saved in .h5 format, with two approaches tested: a 30-feature model achieving 99% accuracy and an average prediction time of 80 milliseconds, and a 6-feature model achieving 100% accuracy with a faster prediction time of 42.25 milliseconds. Prediction results are stored securely in Supabase, complete with timestamps for tracking and exported as PDF reports for easy documentation. Data security is prioritized through the use of API keys, JWT tokens, and Streamlit secret management to safeguard sensitive information. The integration of Supabase for backend processing, Streamlit for real-time visualization, and GitHub for CI/CD automation results in a scalable, reliable, and efficient system. This study presents a robust solution for breast cancer classification, providing real-time predictions, secure data handling, and a user-friendly interface suitable for clinical and research applications.

 

Keywords— breast cancer classification, artificial neural network, genomic data, Supabase, Streamlit, real-time prediction, data security.

References

A. H. A. A.-B. Arwa Okaidat, "Breast Cancer and Anxiety: A Relationship Study," in International Conference on Information and Communication Systems (ICICS), 2020.

A. B. S. K. B. E. A. Akhil Kumar Das, "Introduction to Breast Cancer and Awareness," in International Conference on Advanced Computing & Communication Systems (ICACCS), 2021.

S. D. M. H. E. H. Rahmanul Hoque, "Breast Cancer Classification using XGBoost," in International Conference on Computing Communication and Networking Technologies (ICCCNT), 2020.

R. G. Thomas Rincy N, "A Survey on Machine Learning Approaches and Its Techniques:," in International Students' Conference on Electrical,Electronics and Computer Science (SCEECS), India, 2020.

P. K. T. E. Ebru Aydındag Bayrak, "Comparison of Machine Learning Methods for Breast Cancer Diagnosis," in Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), Turkey, 2019.

S. T. Poonam Kathale, "Breast Cancer Detection and Classification," in International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), India, 2020.

F. G. K. C. F. W. Yifei Gong, "The Architecture of Micro-services and the Separation of Frond-end and Back-end Applied in a Campus Information System," in International Conference on Advances in Electrical Engineering and Computer Applications( AEECA), US, 2020.

A. Amanuel, "Supabase vs Firebase: Evaluation of performancand development of Progressive Web Apps," Metropolia, p. 58, 2022.

A. A. A. A. K. A. D. P. A. Sujan Ray, "Selecting Features for Breast Cancer Analysis and Prediction," in International Conference on Advances in Computing and Communication Engineering (ICACCE), USA, 2020.

P. K. Tanaya Padhi, "Breast Cancer Analysis Using WEKA," in International Conference on Cloud Computing, Data Science & Engineering (Confluence), Uttar Pradesh, 2019.

in International Conference on Emerging Trends in Communication, Control and Computing (ICONC3), 2020.

D. V. P. F. J. S. M. B. S. H. Fan Yang-Turner, "Scalable Pathogen Pipeline Platform (SP^3): Enabling Unified Genomic Data Analysis with Elastic Cloud Computing," in International Conference on Cloud Computing (CLOUD), UK, 2019.

M. E. A. M. B. Yassine Amkrane, "Towards Breast Cancer Response Prediction using Artificial Intelligence and Radiomics," in International Conference on Cloud Computing and Artificial Intelligence: Technologies and Applications (CloudTech), Belgium, 2020.

A. K. S. Manish Kumar, "Distributed Intrusion Detection System using Blockchain and Cloud Computing Infrastructure," in International Conference on Trends in Electronics and Informatics (ICOEI), India, 2020.

R. Alharbi, "Kaggle," 2020. [Online]. Available: https://www.kaggle.com/datasets/raghadalharbi/breast-cancer-gene-expression-profiles-metabric.

Downloads

Published

2025-04-30

Issue

Section

Prodi S1 Teknik Telekomunikasi