Publications

Detailed Information

Clustering short time series gene expression data

DC Field Value Language
dc.contributor.authorErnst, J.-
dc.contributor.authorNau, G. J.-
dc.contributor.authorBar-Joseph, Z.-
dc.date.accessioned2009-12-24T11:55:31Z-
dc.date.available2009-12-24T11:55:31Z-
dc.date.issued2005-06-18-
dc.identifier.citationBioinformatics. 2005 Jun;21 Suppl 1:i159-68.en
dc.identifier.issn1367-4803 (Print)-
dc.identifier.urihttp://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15961453-
dc.identifier.urihttps://hdl.handle.net/10371/22644-
dc.description.abstractMOTIVATION: Time series expression experiments are used to study a wide range of biological systems. More than 80% of all time series expression datasets are short (8 time points or fewer). These datasets present unique challenges. On account of the large number of genes profiled (often tens of thousands) and the small number of time points many patterns are expected to arise at random. Most clustering algorithms are unable to distinguish between real and random patterns. RESULTS: We present an algorithm specifically designed for clustering short time series expression data. Our algorithm works by assigning genes to a predefined set of model profiles that capture the potential distinct patterns that can be expected from the experiment. We discuss how to obtain such a set of profiles and how to determine the significance of each of these profiles. Significant profiles are retained for further analysis and can be combined to form clusters. We tested our method on both simulated and real biological data. Using immune response data we show that our algorithm can correctly detect the temporal profile of relevant functional categories. Using Gene Ontology analysis we show that our algorithm outperforms both general clustering algorithms and algorithms designed specifically for clustering time series gene expression data. AVAILABILITY: Information on obtaining a Java implementation with a graphical user interface (GUI) is available from http://www.cs.cmu.edu/~jernst/st/ SUPPLEMENTARY INFORMATION: Available at http://www.cs.cmu.edu/~jernst/st/en
dc.language.isoenen
dc.publisherOxford University Pressen
dc.subjectAlgorithmsen
dc.subjectCell Line, Tumoren
dc.subjectComputational Biology/*methodsen
dc.subjectComputer Simulationen
dc.subjectHelicobacter pylori/metabolismen
dc.subjectHumansen
dc.subjectImmune Systemen
dc.subjectInterneten
dc.subjectModels, Theoreticalen
dc.subjectNeoplasms/microbiologyen
dc.subjectOligonucleotide Array Sequence Analysisen
dc.subjectProgramming Languagesen
dc.subjectSoftwareen
dc.subjectTime Factorsen
dc.subjectCluster Analysis-
dc.subjectGene Expression Profiling-
dc.subjectGene Expression Regulation-
dc.titleClustering short time series gene expression dataen
dc.typeArticleen
dc.identifier.doi10.1093/bioinformatics/bti1022-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share