Publications

Detailed Information

Fast Outlier Detection Despite the Duplicates

Cited 6 time in Web of Science Cited 9 time in Scopus
Authors

Lee, Jay-Yoon; Kang, U.; Koutra, Danai; Faloutsos, Christos

Issue Date
2013
Publisher
ASSOC COMPUTING MACHINERY
Citation
PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), pp.195-196
Abstract
Given a large cloud of multi-dimensional points, and an off-the-shelf outlier detection method, why does it take a week to finish? After careful analysis, we discovered that duplicate points create subtle issues, that the literature has ignored: if d(max) is the multiplicity of the most over-plotted point, typical algorithms are quadratic on d(max). We propose several ways to eliminate the problem; we report wall-clock times and our time savings; and we show that our methods give either exact results, or highly accurate approximate ones.
URI
https://hdl.handle.net/10371/201074
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Related Researcher

  • Graduate School of Data Science
Research Area Constraint injection, Energy-based models, Structured Prediction

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share