Classification Using Naive Bayes to Predict Radiation Protection in Cancer Drug Discovery: a Case of Mixture Based Grouped Data
One treatment for cancer that is widely used is radiation therapy or radiotherapy using compounds that kill cancer cells.The effectiveness of the radio theraphy is assessed from the percentage of cancer cell death rate. This research examines 84 compounds where each compound is composed by 217 features leading to high dimensionality of the data. Feature selection is carried out based on the mean value of Gini (MDG) and it is able to sort
the most important features used in the classification using Naive Bayes. The Naive Bayes has a weak performance to classify the raw dataset i.e. using threshold of 10% cancer cell death rate. A grouping based on mixture distribution found 30% cell death rate as a new threshold, and it improves the performance of Naive Bayes both in training and testing dataset evaluated using AUC (Area Under Curve). The optimal classification for testing
dataset is obtained by using either 20% or 25% most important features with AUC close to 60%, where it is about 15% higher than classification using threshold of 10%. Meanwhile, the AUC of training dataset reached more than 70%.
Disclaimer/Regarding indexing issue:
We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information.