Journal of Engineering and Applied Sciences

Year: 2017
Volume: 12
Issue: 21
Page No. 5554 - 5559

Canonical Data Model for Text Document Clustering

Authors : Siti Sakira Kamaruddin, Yuhanis Yusof, Farzana Kabir Ahmad and Mohammed Ahmed Taiye

References

Balabantaray, R.C., C. Sarma and M. Jha, 2013. Document clustering using K-means and K-medoids. Int. J. Knowledge Based Comput. Syst., 1: 7-13.
Direct Link  |  

Berkhin, P., 2002. Survey of Clustering Data Mining Techniques. Accrue Software Inc, San Jose, California, Pages: 56.

Bloechle, J.L., M. Rigamonti, K. Hadjar, D. Lalanne and R. Ingold, 2006. XCDF: A canonical and structured document format. Proceedings of the 7th International Workshop on Document Analysis Systems, February 13-15, 2006, Springer, Nelson, New Zealand, pp: 141-152.

Dietrich, M. and J. Lemcke, 2011. A refined canonical data model for multi-schema integration and mapping. Proceedings of the 2011 IEEE 8th International Conference on E-Business Engineering (ICEBE), October 19-21, 2011, IEEE, Beijing, China, ISBN:978-1-4577-1404-7, pp: 105-110.

Gonzalez, M.A., J.R. Marti and P. Kruchten, 2011. A canonical data model for simulator interoperation in a collaborative system for disaster response simulation. Proceedings of the 2011 24th International Canadian Conference on Electrical and Computer Engineering (CCECE), May 8-11, 2011, IEEE, Niagara Falls, Ontario, Canada, ISBN:978-1-4244-9788-1, pp: 001519-001522.

Hachey, B., G. Murray and D. Reitter, 2006. Dimensionality reduction aids term co-occurrence based multi-document summarization. Proceedings of the Workshop on Task-Focused Summarization and Question Answering, July 23-23, 2006, Association for Computational Linguistics, Sydney, Australia, ISBN:1-932432-79-5, pp: 1-7.

Han, J., J. Pei and M. Kamber, 2011. Data Mining: Concepts and Techniques. 3rd Edn., Elsevier, Amsterdam, Netherlands, Pages: 702.

Iria, M. and P. Ciravegna, 2005. Relation extraction for mining the Semantic Web. Dagstuhl, Wadern, Germany.

Jing, L., M.K. Ng, J. Xu and J.Z. Huang, 2005. Subspace clustering of text documents with feature weighting K-means algorithm. Proceedings of the 9th International Conference on Pacific-Asia Advances in Knowledge Discovery and Data Mining, May 18-20, 2005, Springer, Hanoi, Vietnam, pp: 802-812.

Luo, C., Y. Li and S.M. Chung, 2009. Text document clustering based on neighbors. Data Knowl. Eng., 68: 1271-1288.
CrossRef  |  Direct Link  |  

Mihalcea, R., 2004. Graph-based ranking algorithms for sentence extraction, applied to text summarization. Proceedings of the 2004 ACL Conference on Interactive poster and demonstration sessions, July 21-26, 2004, Association for Computational Linguistics, Barcelona, Spain, pp: 181-184.

Narang, T., 2015. Hiearchical clustering of documents: A brief study and implementation in Matlab. Intl. J. Innovations Adv. Comput. Sci. IJIACS., 4: 153-159.
Direct Link  |  

Salton, G. and C. Buckley, 1988. Term-weighting approaches in automatic text retrieval. Inform. Process. Manage., 24: 513-523.
CrossRef  |  Direct Link  |  

Song, W. and S.C. Park, 2007. A novel document clustering model based on latent semantic analysis. Proceedings of the 3rd International Conference on Semantics, Knowledge and Grid, October 29-31, 2007, IEEE, Shan Xi, China, ISBN:0-7695-3007-9, pp: 539-542.

Yang, S. and Y. Wang, 2009. Density-based clustering of massive short messages using domain ontology. Proceedings of the Conference on Information Processing Asia-Pacific APCIP Vol. 2, July 18-19, 2009, IEEE, Shenzhen, China, ISBN:978-0-7695-3699-6, pp: 505-508.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved