Model Prediksi Interaksi Senyawa dan Protein untuk Drug Repositioning menggunakan Deep Semi-Supervised Learning
DOI:
https://doi.org/10.25126/jtiik.2020742236Abstrak
Drug repositioning adalah penggunaan senyawa obat yang sudah lolos uji sebelumnya untuk mengatasi penyakit baru selain penyakit awal obat tersebut ditujukan. Drug repositioning dapat dilakukan dengan memprediksi interaksi senyawa obat dengan protein penyakit yang bereaksi positif. Salah satu tantangan dalam prediksi interaksi senyawa dan protein adalah masalah ketidakseimbangan data. Deep semi-supervised learning dapat menjadi alternatif untuk menangani model prediksi dengan data yang tidak seimbang. Proses pre-training berbasis unsupervised learning pada deep semi-supervised learning dapat merepresentasikan input dari unlabeled data (data mayoritas) dengan baik dan mengoptimasi inisialisasi bobot pada classifier. Penelitian ini mengimplementasikan Deep Belief Network (DBN) sebagai pre-training dan Deep Neural Network (DNN) sebagai classifier. Data yang digunakan pada penelitian ini adalah dataset ion channel, GPCR, dan nuclear receptor yang bersumber dari pangkalan data KEGG BRITE, BRENDA, SuperTarget, dan DrugBank. Hasil penelitian ini menunjukkan pada dataset tersebut, pre-training berupa ekstraksi fitur memberikan efek optimasi dilihat dari peningkatan performa model DNN pada akurasi (3-4.5%), AUC (4.5%), precision (5.9-6%), dan F-measure (3.8%).
Abstract
Drug repositioning is the reuse of an existing drug to treat a new disease other than its original medical indication. Drug repositioning can be done by predicting the interaction of drug compounds with disease proteins that react positively. One of the challenges in predicting the interaction of compounds and proteins is imbalanced data. Deep semi-supervised learning can be an alternative to handle prediction models with imbalanced data. The unsupervised learning based pre-training process in deep semi-supervised learning can represent input from unlabeled data (majority data) properly and optimize initialization of weights on the classifier. This study implements the Deep Belief Network (DBN) as a pre-training with Deep Neural Network (DNN) as a classifier. The data used in this study are ion channel, GPCR, and nuclear receptor dataset sourced from KEGG BRITE, BRENDA, SuperTarget, and DrugBank databases. The results of this study indicate that pre-training as feature extraction had an optimization effect. This can be seen from DNN performance improvement in accuracy (3-4.5%), AUC (4.5%), precision (5.9-6%), and F-measure (3.8%).
Downloads
Referensi
ALBERTBUP. 2017. A python implementation of deep belief networks built upon numpy and tensorflow with scikit-learn compatibility. [online] Tersedia pada: <https://github.com/albertbup/deep-belief-network> [diunduh: 2019 Mar 6].
ALI, A., SHAMSUDDIN, S.M. & RALESCU, A.L. 2015. Classification with Class Imbalance Problem: A Review. International Journal Advance Soft Computing Application, 7(3), pp.176, 204.
ASHBURN, T.T. dan THOR, K.B. 2004. Drug Repositioning: Identifying and Developing New Uses for Existing Drugs. Nature Reviews Drug discovery, 3(8), pp.673.
BAHI, M. dan BATOUCHE, M. 2018a. Drug-Target Interaction Prediction in Drug Repositioning Based on Deep Semi-Supervised Learning. Dalam: 6th International Conference on Computer Intelligence and Its Applications (CIIA). Oran (DZ): Springer International Publishing, pp.302-313.
BAHI, M. dan BATOUCHE, M. 2018b. Deep Semi-Supervised Learning for DTI Prediction Using Large Datasets and H2O-Spark Platform. Dalam: 2018 International Conference on Intelligent Systems and Computer Vision (ISCV). Fez (MA): IEEE, pp.1-7.
CAO, D., XIAO, N., XU, Q. & CHEN, A.F. 2015. Rcpi: R/Bioconductor package to generate various descriptors of proteins, compounds and their interactions. Bioinformatics, 31(2), pp.279, 281.
CSERMELY, P., KORCSMÁROS, T., KISS. H.J.M., LONDON, G. & NUSSINOV, R. 2013. Structure and Dynamics of Molecular Networks: A Novel Paradigm of Drug Discovery: A Comprehensive Review. Pharmacology and Therapeutics, 138(3), pp.333, 408.
ERHAN, D., BENGIO, Y., COURVILLE, A., MANZAGOL, P.A., VINCENT, P. & BENGIO, S. 2010. Why Does Unsupervised Pre-Training Help Deep Learning?. Journal of Machine Learning Research, 11(Feb), pp.625, 660.
ERHAN, D., MANZAGOL, P.A., BENGIO, Y., BENGIO, S. & VINCENT, P. 2009. The Difficulty of Training Deep Architectures and The Effect of Unsupervised Pre-Training. Dalam: 12th International Conference on Artificial Intelligence and Statistics (AISTATS). Florida (AS): PMLR. pp.153-160.
EZZAT, A., WU, M., LI, X.L. & KWOH, C.K. 2016. Drug-Target Interaction Prediction Via Class Imbalance-Aware Ensemble Learning. BMC Bioinformatics, 17(19), pp.267, 276.
GHASEMI, F., MEHRIDEHNAVI, A., FASSIHI, A. & PÉREZ-SÁNCHEZ, H. 2018. Deep Neural Network in QSAR Studies Using Deep Belief Network. Applied Soft Computing, 62, pp.251, 258.
GLOROT, X. dan BENGIO, Y. 2010. Understanding The Difficulty of Training Deep Feedforward Neural Networks. Dalam: Thirteenth International Conference on Artificial intelligence and Statistics. Sardiana (IT): Journal of Machine Learning Research. pp.249-256.
HINTON, G.E. & SALAKHUTDINOV, R.R. 2006. Reducing The Dimensionality of Data with Neural Networks. Science 313, 5786, pp.504, 507.
HINTON, G.E. 2012. A practical guide to training restricted Boltzmann machines. Dalam: Neural networks: Tricks of the trade. Berlin (DE): Springer. pp.599-619.
HINTON, G.E., DENG, L., YU, D., DAHL, G.E., MOHAMED, A.R., JAITLY, N., SENIOR, A., VANHOUCKE, V., NGUYEN, P., SAINATH, T.N. & KINGSBURY, B. 2012. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal processing magazine, 29(6), pp.82-97.
JACOB, L. & VERT, J.P. 2008. Protein-Ligand Interaction Prediction: An Improved Chemogenomics Approach. Bioinformatics, 24(19), pp.2149, 2156.
KURNIA, A. 2017. Prediksi formula jamu berkhasiat menggunakan teknik link prediction dari jejaring bipartite senyawa aktif dan protein. [skripsi] Institut Pertanian Bogor, Indonesia.
LARASATI, A., DWIASTUTIK, A., RAMADHANTI, D. & MAHARDIKA, A. 2018. The effect of Kurtosis on the accuracy of artificial neural network predictive model. Dalam: MATEC Web of Conferences, 204, pp.02018.
NOVAC, N. 2013. Challenges and Opportunities of Drug Repositioning. Trends in pharmacological sciences, 34(5), pp.267, 272.
PRUENGKARN, R., WONG, K.W. & FUNG, C.C. 2017. Imbalanced Data Classification Using Complementary Fuzzy Support Vector Machine Technique and SMOTE. Dalam: IEEE International Conference on Systems, Man, and Cybernetics (SMC). Banff (CA): IEEE. pp.978-983.
RAHMI, A.S. 2018. Implementasi Hybrid Sampling Technique untuk Prediksi Interaksi Senyawa Aktif dan Protein pada Data yang Tidak Seimbang. Dalam: ICDALC International Conference on Digital Agriculture from Land to Consumer. Bogor (ID):IPB Press.
XIE, L., EVANGELIDIS, T., XIE, L., BOURNE, P.E. 2011. Drug Discovery Using Chemical Systems Biology: Weak Inhibition of Multiple Kinases May Contribute to The Anti-Cancer Effect of Nelfinavir. PLoS Computational Biology, 7(4), pp.e1002037.
YAMANISHI, Y. 2013. Chemogenomic Approaches to Infer Drug–Target Interaction Networks. Data Min Syst Biol, 939, pp.97, 113.
YAMANISHI, Y., ARAKI, M., GUTTERIDGE, A., HONDA, W. & KAMEHISA, M. 2008. Prediction of Drug–Target Interaction Networks from The Intefration of Chemical and Genomic Spaces. Bioinformatics, 24(8), pp.i232, i240.
ZHAO, R., YAN, R., CHEN, Z., MAO, K., WANG, P. & GAO, R.X. 2019. Deep Learning and Its Applications to Machine Health Monitoring. Mechanical Systems and Signal Processing, 115, pp.213, 237.
ZHONG, L., CHENG, L., XU, H., WU, Y., CHEN, Y., & LI, M. 2016. Segmentation of individual trees from TLS and MLS data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 10(2), pp.774, 787.
Unduhan
Diterbitkan
Terbitan
Bagian
Lisensi
Artikel ini berlisensi Creative Common Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Penulis yang menerbitkan di jurnal ini menyetujui ketentuan berikut:
- Penulis menyimpan hak cipta dan memberikan jurnal hak penerbitan pertama naskah secara simultan dengan lisensi di bawah Creative Common Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) yang mengizinkan orang lain untuk berbagi pekerjaan dengan sebuah pernyataan kepenulisan pekerjaan dan penerbitan awal di jurnal ini.
- Penulis bisa memasukkan ke dalam penyusunan kontraktual tambahan terpisah untuk distribusi non ekslusif versi kaya terbitan jurnal (contoh: mempostingnya ke repositori institusional atau menerbitkannya dalam sebuah buku), dengan pengakuan penerbitan awalnya di jurnal ini.
- Penulis diizinkan dan didorong untuk mem-posting karya mereka online (contoh: di repositori institusional atau di website mereka) sebelum dan selama proses penyerahan, karena dapat mengarahkan ke pertukaran produktif, seperti halnya sitiran yang lebih awal dan lebih hebat dari karya yang diterbitkan. (Lihat Efek Akses Terbuka).