[Zurück]


Zeitschriftenartikel:

C. Kolbitsch, M. Egele, Ch. Platzer:
"Removing web spam links from search engine results";
Journal in Computer Virology, Online First (2009), 11416; S. 1 - 12.



Kurzfassung englisch:
Web spam denotes the manipulation of web pages with the sole intent to raise their position in search engine rankings. Since a better position in the rankings directly and positively affects the number of visits to a site, attackers use different techniques to boost their pages to higher ranks. In the best case, web spam pages are a nuisance that provide undeserved advertisement revenues to the page owners. In the worst case, these pages pose a threat to Internet users by hosting malicious content and launching drive-by attacks against unsuspecting victims. When successful, these driveby attacks then install malware on the victims' machines. In this paper, we introduce an approach to detect web spam pages in the list of results that are returned by a search engine. In a first step, we determine the importance of different page features to the ranking in search engine results. Based on this information, we develop a classification technique that uses important features to successfully distinguish spam sites from legitimate entries. By removing spam sites from the results, more slots are available to links that point to pages with useful content. Additionally, and more importantly, the threat posed by malicious web sites can be mitigated, reducing the risk for users to get infected by malicious code that spreads via drive-by attacks.


"Offizielle" elektronische Version der Publikation (entsprechend ihrem Digital Object Identifier - DOI)
http://dx.doi.org/10.1007/s11416-009-0132-6

Elektronische Version der Publikation:
http://publik.tuwien.ac.at/files/PubDat_180382.pdf



Zugeordnete Projekte:
Projektleitung Paolo Milani Comparetti:
Worldwide Observatory of Malicious Behaviors and Attack Threats

Projektleitung Christian Platzer:
SECoverer - Finding Security Vulnerabilities in Web Applications


Erstellt aus der Publikationsdatenbank der Technischen Universität Wien.