Please use this identifier to cite or link to this item:
Files in This Item:
File SizeFormat 
66192016X.pdf3 MBAdobe PDFView/Open
Title: Model-based cluster analysis applied to flow cytometry data of phytoplankton
Authors: Mucha, H.-J.Simon, U.Brüggemann, R.
Issue Date: 2002
Published in: Technical report // Weierstraß-Institut für Angewandte Analysis und Stochastik, Berlin, Volume 5, ISSN 1618-7776
Publisher: Berlin : Weierstraß-Institut für Angewandte Analysis und Stochastik
Abstract: Starting from well-known model-based clustering models equivalent formulations for some special models based on pairwise distances are presented. Moreover, these models can be generalized in order to taking into account both weights of observations and weights of variables. Well-known cluster analysis techniques like the iterative partitional K-means method or the agglomerative hierarchical Ward method are useful for discovering partitions or hierarchies in the underlying data. Here these methods are generalised in two ways, firstly by using weighted observations and secondly by allowing different volumes of clusters. Then a more general K-means approach based on pair-wise distances is recommended. Simulation studies are carried out in order to compare the new clustering techniques with the well-known ones. Afterwards a successful application in the field of freshwater ecology is presented. As an example, the cluster analysis of a snapshot from monitoring of phytoplankton (algae) is considered in more detail. Indeed, monitoring by microscope is very time- and work-consuming. Flow cytometry provides the opportunity to investigate algae communities in a semiautomatic way. Statistical data analysis and cluster analysis can at least support the investigations. Here a combination of agglomerative hierarchical clustering and iterative clustering is recommended. In order to give some insight into the data under investigation several univariate, bivariate and multivariate visualizations are proposed.
Keywords: Cluster analysis; K-means; data mining; principal components analysis; fresh ecology; phytoplankton; flow cytometry
DDC: 510
License: This document may be downloaded, read, stored and printed for your own use within the limits of § 53 UrhG but it may not be distributed via the internet or passed on to external parties.
Dieses Dokument darf im Rahmen von § 53 UrhG zum eigenen Gebrauch kostenfrei heruntergeladen, gelesen, gespeichert und ausgedruckt, aber nicht im Internet bereitgestellt oder an Außenstehende weitergegeben werden.
Appears in Collections:Mathematik

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.