Research Journal of Applied Sciences

Year: 2014
Volume: 9
Issue: 5
Page No. 288 - 294

Performance Intensification for Automatic Template Using World Wide Web

Authors : G. Naveen Sundar, D. Narmadha and A.P. Haran

References

Arasu, A. and H. Garcia-Molina, 2003. Extracting structured data from web pages. Proceedings of the ACM SIGMOD International Conference on Management of Data, June 9-12, 2003, San Diego, CA., pp: 337-348.

Bar-Yossef, Z. and S. Rajagopalan, 2002. Template detection via data mining and its applications. Proceedings of the 11th International Conference on World Wide, May 7-11, 2002, Honolulu, Hawaii, USA., pp: 580-591.

Chen, L., S. Ye and X. Li, 2006. Template detection for large scale search engines. Proceedings of the ACM Symposium on Applied Computing, April 23-27, 2006, Dijon, France, pp: 1094-1098.

Gupta, S., G. Kaiser, D. Neistadt and P. Grimm, 2003. DOM based content extraction of HTML documents. Proceedings of the 12th international conference on World Wide Web, May 20-24, 2003, NewYork, USA., pp: 207-214.

Jushmerick, N., 1999. Learning to remove internet advertisements. Proceedings of the 3rd Annual Conference on Autonomous Agents, May 1-5, 1999, Seattle, WA., USA., pp: 175-181.

Ma, L., N. Goharian, A. Chowdhury and M. Chung, 2003. Extracting unstructured data from template generated web documents. Proceedings of the 12th International Conference on Information and Knowledge Management, November 3-8, 2003, New Orleans, Louisiana, USA., pp: 512-515.

Reis, D.C., P.B. Golgher, A.S. Silva and A.F. Laender, 2004. Automatic Web news extraction using tree edit distance. Proceedings of the 13th International Conference on World Wide Web, May 17-22, 2004, New York, USA., pp: 502-511.

Vieira, K., A.S. da Silva, N. Pinto, E.S. de Moura, J.M.B. Cavalcanti and J. Friere, 2006. A fast and robust method for web page template detection and removal. Proceedings of the 15th ACM International Conference on Information and Knowledge Management, November 5-11, 2006, Arlington, Virginia, USA., pp: 258-267.

Wang, Y., B. Fang, X. Cheng, G. Li and H. Xu, 2008. Incremental web page template detection by text segments. Proceedings of the IEEE International Workshop on Semantic Computing and Systems, July 14-15, 2008, Huangshan, pp: 174-180.

Weninger, T., W.H. Hsu and J. Han, 2010. CETR: Content extraction via tag ratios. Proceedings of the 19th International Conference on World Wide Web, April 26-30, 2010, Raleigh, North Carolina, USA., pp: 971-980.

Yi, L., B. Liu and X. Li, 2003. Eliminating noisy information in web pages for data mining. Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 24-27, Washington, DC. New York, pp: 296-305.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved