Publications
Detailed Information
Fast Outlier Detection Despite the Duplicates
Cited 6 time in
Web of Science
Cited 9 time in Scopus
- Authors
- Issue Date
- 2013
- Publisher
- ASSOC COMPUTING MACHINERY
- Citation
- PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), pp.195-196
- Abstract
- Given a large cloud of multi-dimensional points, and an off-the-shelf outlier detection method, why does it take a week to finish? After careful analysis, we discovered that duplicate points create subtle issues, that the literature has ignored: if d(max) is the multiplicity of the most over-plotted point, typical algorithms are quadratic on d(max). We propose several ways to eliminate the problem; we report wall-clock times and our time savings; and we show that our methods give either exact results, or highly accurate approximate ones.
- Files in This Item:
- There are no files associated with this item.
- Appears in Collections:
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.