Optimizing the Trade-off Between Classification Accuracy and Data Privacy in the Area of Data Stream Mining
Abstract
Data perturbation has grabbed the attention of data mining, as preserving the privacy of the data is crucial, especially in sensitive data. But the perturbation process negatively affects the accuracy of predictions, generating a trade-off between privacy and accuracy. We propose seven
different cumulative noise addition based perturbation methods combining a set of techniques such as logistic function, use of absolute noise values, and cycle-wise noise addition as possible solutions for this accuracy-privacy trade-off issue. These techniques are introduced to optimize the trade-off between classification accuracy and data privacy by controlling the maximum noise level.Moreover, we evaluate the performance of the proposed methods compared to the state of art of the noise addition-based perturbation methods to select the best of them.
Keywords
Full Text:
PDFDisclaimer/Regarding indexing issue:
We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information. Also: DOI is paid service which provided by a third party. We never mentioned that we go for this for our any journal. However, journal have no objection if author go directly for this paid DOI service.