FREE Registration is required
Overview:
While Web archive quality is endangered by Web spam, a side effect of the high commercial value of top-ranked search-engine results, so far Web spam filtering technologies are rarely used by Web archivists. This paper makes the first attempt to disseminate existing methodology and envision a solution for Web archives to share knowledge and unite efforts in Web spam hunting. It surveys the state of the art in Web spam filtering illustrated by the recent Web spam challenge data sets and techniques and describe the filtering solution for archives envisioned in the LiWA - Living Web Archives project.
(Is this item miscategorized? Does it need more tags? Let us know.)
| Format: | Size: | 531 KB | |
| Date: | Aug 2008 | ||
| Pages: | 9 |
Top results from Security Management
» View all Security Management listings
Top results from Data Mining - Analysis
White Papers, Webcasts, and Resources
- Live Webcast: Enhanced Availability in a Virtual Data Center with the Dell PS Series and Microsoft Windows Server 2008 R2 Hyper-V Dell EqualLogicLearn how to use the new features of Microsoft Windows Server 2008 R2 Hyper-V to boost the availability of your virtualized data center.
- Live Webcast: Get Control over SaaS Application Access TriCipherLearn to simplify and protect access to your company's data in Software-as-a-Service (SaaS) apps using identity and access management best practices.
- Outsourcing the data centre to a carrier neutral data centre operator in Europe Telecity GroupFind out how to drive down the cost of your IT environment--and drive up the reliability and quality of your service--by outsourcing your data center.











