Asian Journal of Information Technology

Year: 2016
Volume: 15
Issue: 16
Page No. 3022 - 3042

Prediction of Imbalanced Data Using Cluster Based Approach

Authors : B.V. Sumana and T. Santhanam


Alcala-Fdez, J., A. Fernandez, J. Luengo, J. Derrac, S. Garcia, L. Sanchez and F. Herrera, 2011. KEEL data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework. J. Multiple-Valued Logic Soft Comput., 17: 255-287.
Direct Link  |  

Ali, A., S.M. Shamsuddin and A.L. Ralescu, 2015. Classification with class imbalance problem: A review. Int. J. Adv. Soft Comput. Appl., 7: 176-204.
Direct Link  |  

Batista, G.E.A.P.A., R.C. Prati and M.C. Monard, 2004. A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explorations Newsl., 6: 20-29.
CrossRef  |  Direct Link  |  

Beyan, C. and R. Fisher, 2015. Classifying imbalanced data sets using similarity based hierarchical decomposition. Pattern Recognit., 48: 1653-1672.
Direct Link  |  

Chawla, N.V., K.W. Bowyer, L.O. Hall and W.P. Kegelmeyer, 2002. SMOTE: Synthetic minority Over-sampling technique. J. Artificial Intell. Res., 16: 321-357.
CrossRef  |  

Das, B., N.C. Krishnan and D.J. Cook, 2014. Handling Imbalanced and Overlapping Classes in Smart Environments Prompting Dataset. In: Data Mining for Service, Katsutoshi, Y. (Ed.). Springer, Heidelberg, Germany, ISBN:978-3-642-45251-2, pp: 199-219.

Denil, M. and T. Trappenberg, 2010. Overlap versus imbalance. Proceeding of the 23rd Canadian Conference on Artificial Intelligence, May 31-June 2, 2010, Springer, Heidelberg, Germany, ISBN:978-3-642-13058-8, pp: 220-231.

Elrahman, S.M.A. and A. Abraham, 2013. A review of class imbalance problem. J. Netw. Innovative Comput., 1: 332-340.
Direct Link  |  

Galar, M., A. Fernandez, E. Barrenechea, H. Bustince and F. Herrera, 2016. New ordering-based pruning metrics for ensembles of classifiers in imbalanced datasets. Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015, March 1-5, 2016, Springer, New York, USA., ISBN:978-3-319-26225-3, pp: 3-15.

Ganganwar, V., 2012. An overview of classification algorithms for imbalanced datasets. Int. J. Emerging Technol. Adv. Eng., 2: 42-47.
Direct Link  |  

Garcia, V., J. Sanchez and R. Mollineda, 2007. An Empirical Study of the Behavior of Classifiers on Imbalanced and Overlapped Data Sets. In: Progress in Pattern Recognition, Image Analysis and Applications, Luis, R., D. Mery and K. Josef (Eds.). Springer, Heidelberg, Germany, ISBN:978-3-540-76724-4, pp: 397-406.

Garcia, V., R. Alejo, J.S. Sanchez, J.M. Sotoca and R.A. Mollineda, 2006. Combined Effects of Class Imbalance and Class Overlap on Instance-Based Classification. In: Intelligent Data Engineering and Automated Learning, Emilio, C., Y. Hujun, B. Vicente and F. Colin (Eds.). Springer, Heidelberg, Germany, ISBN:978-3-540-45485-4, pp: 371-378.

Gu, Q., Z. Cai and L. Zhu, 1986. Classification of imbalanced data sets by using the hybrid re-sampling algorithm based on isomap. Proceeding of the 4th International Symposium on Intelligence Computation and Applications, Ocotober 23-25, 2009, ISICA, Huangshi, China, ISBN:978-3-642-04842-5, pp: 287-pp: 24.

Guo, H., W. Zhi, H. Liu and M. Xu, 2016. Imbalanced learning based on logistic discrimination. Comput. Intell. Neurosci., Vol. 2016, 10.1155/2016/5423204

Hall, M.A., 2000. Correlation-based feature selection for discrete and numeric class machine learning. Proceedings of the 17th International Conference on Machine Learning, 29 June-July 2, 2000, California, pp: 359-366.

He, H. and E.A. Garcia, 2009. Learning from imbalanced data. IEEE Trans. Knowledge Data Eng., 21: 1263-1284.
CrossRef  |  

Jain, A.K., 2010. Data clustering: 50 years beyond K-means. Pattern Recogn. Lett., 31: 651-666.
CrossRef  |  Direct Link  |  

Japkowicz, N. and S. Stephen, 2002. The class imbalance problem: A systematic study. Intell. Data Anal., 6: 429-449.
Direct Link  |  

Japkowicz, N., 2003. Class imbalances: Are we focusing on the right issue. Workshop Learn. Imbalanced Data Sets II., 1723: 17-23.

Jensen, R., 2005. Combining rough and fuzzy sets for feature selection. PhD Thesis, University of Edinburgh, Edinburgh, Scotland.

Kaveri, S. and Abhilasha, 2016. A study on effects of intrinsic characteristics of datasets on classification performance. J. Adv. Res. Comput. Sci. Software Eng., 6: 198-204.
Direct Link  |  

Lin, W.J. and J.J. Chen, 2012. Class-imbalanced classifiers for high-dimensional data. Briefings Bioinformatics, 14: 13-26.
CrossRef  |  Direct Link  |  

Liu, C.L., 2008. Partial discriminative training for classification of overlapping classes in document analysis. Int. J. Doc. Anal. Recognit., 11: 53-65.
CrossRef  |  Direct Link  |  

Maszczyk, T. and W. Duch, 2010. Support feature machines: Support vectors are not enough. Proceeding of the 2010 International Joint Conference on Neural Networks (IJCNN), July 18-23, 2010, IEEE, New York, USA., ISBN:978-1-4244-6918-5, pp: 1-8.

Menardi, G. and N. Torelli, 2014. Training and assessing classification rules with imbalanced data. Data Min. Knowl. Discovery, 28: 92-122.
CrossRef  |  Direct Link  |  

Perez, O.M., P.A. Gutierrez, P. Tino and M.C. Hervas, 2015. Oversampling the minority class in the feature space. IEEE Trans. Neural Netw. Learn. Syst., 27: 1947-1961.
CrossRef  |  Direct Link  |  

Prati, R.C., G.E. Batista and M.C. Monard, 2004. Class imbalances versus class overlapping: An analysis of a learning system behavior. Proceeding of the 3rd Mexican International Conference on Artificial Intelligence, April 26-30, 2004, Springer, Heidelberg, Germany Berlin, ISBN:978-3-540-21459-5, pp: 312-321.

Rahman, M.M. and D.N. Davis, 2013. Addressing the class imbalance problem in medical datasets. Int. J. Mach. Learn. Comput., 3: 224-224.
Direct Link  |  

Saranya, C. and G. Manikandan, 2013. A study on normalization techniques for privacy preserving data mining. Int. J. Eng. Technol., 5: 2701-2704.
Direct Link  |  

Stouhi, A.S. and C.K. Reddy, 2015. Transfer learning for class imbalance problems with inadequate data. Knowl. Inf. Syst., 48: 201-228.
Direct Link  |  

Sumana B.V. and T. Santhanam, 2016. Optimizing K-means in cascading clustering and classification. Aust. J. Basic Appl. Sci., 10: 184-206.

Verbiest, N., J. Derrac, C. Cornelis, S. Garcia and F. Herrera, 2016. Evolutionary wrapper approaches for training set selection as preprocessing mechanism for support vector machines: Experimental evaluation and support vector analysis. Appl. Soft Comput., 38: 10-22.
Direct Link  |  

Weiss, G.M., 2004. Mining with rarity: A unifying framework. ACM SIGKDD Explorations Newsl., 6: 7-19.
CrossRef  |  Direct Link  |  

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved