Abstract: Local general depth (LGD) functions are used for describing the local geometric features and mode(s) in multivariate distributions. In this paper, we undertake a rigorous systematic study of LGD and establish several analytical and statistical properties. First, we show that, when the underlying probability distribution is absolutely continuous with density f(·), the scaled version of LGD (referred to as t-approximation) converges, uniformly and in Ld (Rp) to f(·) when t converges to zero. Second, we establish that, as the sample size diverges to infinity the centered and scaled sample LGD converge in distribution to a centered Gaussian process uniformly in the space of bounded functions on HG, a class of functions yield-ing LGD. Third, using the sample version of the t-approximation (StA) and the gradient system analysis, we develop a new clustering algorithm. The validity of this algorithm requires several results concerning the uniform finite difference approximation of the gradient system associated with StA. For this reason, we establish Bernstein-type inequality for deviations between the centered and scaled sample LGD, which is also of indepen-dent interest. Finally, invoking the above results, we establish consistency of the clustering algorithm. Applications of the proposed methods to mode estimation and upper level set estimation are also provided. Finite sample performance of the methodology are evaluated using numerical experiments and data analysis.
Authorship: Francisci G., Agostinelli C., Nieto-Reyes A., Vidyashankar A.N.,
Fuente: Electronic Journal of Statistics, 2023, 17(1), 688-722
Publisher: Institute of Mathematical Statistics and Bernoulli Society
Year of publication: 2023
No. of pages: 35
Publication type: Article
DOI: 10.1214/23-EJS2110
ISSN: 1935-7524
Publication Url: https://doi.org/10.1214/23-EJS2110