Construction and Reduction Methods of Web Spam Identification Index System

Yuancheng Li; Rong Huang; Xiangqian Nie

doi:10.2174/2213275912666181127130120

ISSN: 2213-2759
E-ISSN: 1874-4796

Construction and Reduction Methods of Web Spam Identification Index System
By Yuancheng Li, Rong Huang and Xiangqian Nie
Source: Recent Patents on Computer Science, Volume 12, Issue 3, Aug 2019, p. 202 - 211
DOI: https://doi.org/10.2174/2213275912666181127130120
- Available online: 01 Aug 2019

Abstract

Background: With the rapid development of the Internet, the number of web spam has increased dramatically in recent years, which has wasted search engine storage and computing power on a massive scale. To identify the web spam effectively, the content features, link features, hidden features and quality features of web page are integrated to establish the corresponding web spam identification index system. However, the index system is highly correlation dimension. Methods: An improved method of autoencoder named stacked autoencoder neural network (SAE) is used to realize the reduction of the web spam identification index system. Results: The experiment results show that our method could reduce effectively the index of web spam and significantly improves the recognition rate in the following work. Conclusion: An autoencoder based web spam indexes reduction method is proposed in this paper. The experimental results show that it greatly reduces the temporal and spatial complexity of the future web spam detection model.

Article metrics loading...

/content/journals/cseng/10.2174/2213275912666181127130120

2019-08-01

2026-02-21

From This Site

/content/journals/cseng/10.2174/2213275912666181127130120

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cseng/10.2174/2213275912666181127130120

Article Type: Research Article

Keyword(s): Autoencoder; detection model; identification index system; index reduction; stacked autoencoder neural network; web spam

Construction and Reduction Methods of Web Spam Identification Index System

Abstract

From This Site

Most Read This Month