PMID- 34914842 OWN - NLM STAT- MEDLINE DCOM- 20220419 LR - 20220722 IS - 1521-4036 (Electronic) IS - 0323-3847 (Print) IS - 0323-3847 (Linking) VI - 64 IP - 4 DP - 2022 Apr TI - Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome. PG - 714-732 LID - 10.1002/bimj.202000334 [doi] AB - Zeros in compositional data are very common and can be classified into rounded and essential zeros. The rounded zero refers to a small proportion or below detection limit value, while the essential zero refers to the complete absence of the component in the composition. In this article, we propose a new framework for analyzing compositional data with zero entries by introducing a stochastic representation. In particular, a new distribution, namely the Dirichlet composition distribution, is developed to accommodate the possible essential-zero feature in compositional data. We derive its distributional properties (e.g., its moments). The calculation of maximum likelihood estimates via the Expectation-Maximization (EM) algorithm will be proposed. The regression model based on the new Dirichlet composition distribution will be considered. Simulation studies are conducted to evaluate the performance of the proposed methodologies. Finally, our method is employed to analyze a dataset of fluorescence in situ hybridization (FISH) for chromosome detection. CI - (c) 2021 The Authors. Biometrical Journal published by Wiley-VCH GmbH. FAU - Tang, Man-Lai AU - Tang ML AUID- ORCID: 0000-0003-3934-2676 AD - Department of Mathematics, College of Engineering, Design & Physical Sciences, Brunel University London, Uxbridge, United Kingdom. FAU - Wu, Qin AU - Wu Q AUID- ORCID: 0000-0002-9986-7530 AD - Department of Statistics, School of Mathematical Sciences, South China Normal University, Guangzhou City, Guangdong, P. R. China. FAU - Yang, Sheng AU - Yang S AD - Zhongshan People's Hospital, Zhongshan, P. R. China. FAU - Tian, Guo-Liang AU - Tian GL AD - Department of Statistics and Data Science, Southern University of Science and Technology, Shenzhen City, Guangdong, P. R. China. LA - eng PT - Journal Article PT - Research Support, Non-U.S. Gov't DEP - 20211216 PL - Germany TA - Biom J JT - Biometrical journal. Biometrische Zeitschrift JID - 7708048 SB - IM MH - *Algorithms MH - *Chromosomes MH - Computer Simulation MH - In Situ Hybridization, Fluorescence MH - Likelihood Functions MH - Poisson Distribution PMC - PMC9300144 OTO - NOTNLM OT - Dirichlet distribution OT - EM algorithm OT - compositional data OT - essential zero OT - gamma distribution OT - rounded zeros OT - stochastic representation COIS- The authors have declared no conflict of interest. EDAT- 2021/12/17 06:00 MHDA- 2022/04/20 06:00 PMCR- 2022/07/20 CRDT- 2021/12/16 17:28 PHST- 2021/08/24 00:00 [revised] PHST- 2020/11/09 00:00 [received] PHST- 2021/08/31 00:00 [accepted] PHST- 2021/12/17 06:00 [pubmed] PHST- 2022/04/20 06:00 [medline] PHST- 2021/12/16 17:28 [entrez] PHST- 2022/07/20 00:00 [pmc-release] AID - BIMJ2328 [pii] AID - 10.1002/bimj.202000334 [doi] PST - ppublish SO - Biom J. 2022 Apr;64(4):714-732. doi: 10.1002/bimj.202000334. Epub 2021 Dec 16.