Publications

Detailed Information

Fast outlier detection for very large log data

DC Field Value Language
dc.contributor.authorKim, Seung-
dc.contributor.authorCho, Nam Wook-
dc.contributor.authorKang, Bokyoung-
dc.contributor.authorKang, Suk Ho-
dc.date.accessioned2011-12-01T01:44:54Z-
dc.date.available2011-12-01T01:44:54Z-
dc.date.issued2011-08-01-
dc.identifier.citationEXPERT SYSTEMS WITH APPLICATIONS; Vol.38 8; 9587-9596-
dc.identifier.issn0957-4174-
dc.identifier.urihttps://hdl.handle.net/10371/74917-
dc.description.abstractDensity-based outlier detection identifies an outlying observation with reference to the density of the surrounding space. In spite of the several advantages of density-based outlier detections, its computational complexity remains one of the major barriers to its application. The purpose of the present study is to reduce the computation time of LOF (Local Outlier Factor), a density-based outlier detection algorithm. The proposed method incorporates kd-tree indexing and an approximated k-nearest neighbors search algorithm (ANN). Theoretical analysis on the approximation of nearest neighbor search was conducted. A set of experiments was conducted to examine the performance of the proposed algorithm. The results show that the method can effectively detect local outliers in a reduced computation time. (C) 2011 Elsevier Ltd. All rights reserved.-
dc.language.isoen-
dc.publisherPERGAMON-ELSEVIER SCIENCE LTD-
dc.subjectDensity-based outlier detection-
dc.subjectKd-tree-
dc.subjectApproximated k-nearest neighbors-
dc.subjectIntrusion (novelty, anomaly) detection-
dc.titleFast outlier detection for very large log data-
dc.typeArticle-
dc.contributor.AlternativeAuthor김성-
dc.contributor.AlternativeAuthor조남욱-
dc.contributor.AlternativeAuthor강보경-
dc.contributor.AlternativeAuthor강석호-
dc.identifier.doi10.1016/j.eswa.2011.01.162-
dc.citation.journaltitleEXPERT SYSTEMS WITH APPLICATIONS-
dc.description.citedreferenceHwang SS, 2009, COMPUT SECUR, V28, P85, DOI 10.1016/j.cose.2008.10.002-
dc.description.citedreferenceASUNCION A, 2007, UCI MACHINE LEARNING-
dc.description.citedreferencePOKRAJAC D, 2007, IEEE S COMP INT DAT, P504-
dc.description.citedreferenceYUE DM, 2007, P 2007 INT C WIR COM, P5514-
dc.description.citedreferenceMOUNT DM, 2006, ANN LIB APPROXIMATE-
dc.description.citedreferenceZHANG JO, 2006, P 2006 IEEE INT C CO, P2388-
dc.description.citedreferenceBENGAL I, 2005, DATA MINING KNOWLEDG-
dc.description.citedreferenceMALOOF MA, 2005, MACHINE LEARNING DAT-
dc.description.citedreferenceAGYEMANG M, 2004, P 15 INF RES MAN ASS, P5-
dc.description.citedreferenceLAZAREVIC A, 2003, P 3 SIAM INT C DAT M-
dc.description.citedreferenceMedioni G, 2001, IEEE T PATTERN ANAL, V23, P873, DOI 10.1109/34.946990-
dc.description.citedreferenceAGGARWAL CC, 2001, P 2001 ACM SIGMOD IN, P37-
dc.description.citedreferenceBREUNIG MM, 2000, P ACM SIGMOD INT C M, P93-
dc.description.citedreferenceRAMASWAMY S, 2000, P 2000 ACM SIGMOD IN, P427-
dc.description.citedreferenceArya S, 1998, J ACM, V45, P891-
dc.description.citedreferenceAGRAWAL R, 1998, P ACM SIGMOD INT C M, P94-
dc.description.citedreferenceGUHA S, 1998, P ACM SIGMOD INT C M, P73-
dc.description.citedreferenceEzawa KJ, 1996, IEEE EXPERT, V11, P45, DOI 10.1109/64.539016-
dc.description.citedreferenceESTER M, 1996, P 2 INT C KNOWL DISC, P226-
dc.description.citedreferenceZHANG T, 1996, P 1996 ACM SIGMOD IN, P103-
dc.description.citedreferenceFALOUTSOS C, 1995, P 1995 ACM SIGMOD IN, P163-
dc.description.citedreferenceBARNETT V, 1994, OUTLIERS STAT DATA-
dc.description.citedreferenceBENTLEY JL, 1990, P 6 ANN ACM S COMP G, P187-
dc.description.citedreferenceFUKUNAGA K, 1990, INTRO STAT PATTERN R-
dc.description.citedreferencePRESS WH, 1988, NUMERICAL RECIPES C-
dc.description.citedreferenceHAWKINS D, 1980, IDENTIFICATION OUTLI-
dc.description.citedreferenceSTRANG G, 1980, LINEAR ALGEBRA ITS A-
dc.description.citedreferenceFRIEDMAN JH, 1977, ACM T MATH SOFTWARE, V3, P209-
dc.description.citedreferenceHINKLEY DV, 1969, BIOMETRIKA, V56, P635-
dc.description.citedreferenceMACQUEEN J, 1967, P 5 BERK S MATH STAT, P291-
dc.description.tc0-
dc.identifier.wosid000290237500061-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share