\externaldocument

direg_SM

Structural adaptation via directional regularity: rate accelerated estimation in multivariate functional data

Omar Kassi111Ensai, CREST - UMR 9194, France; [email protected] , Sunny G.W. Wang222Ensai, CREST - UMR 9194, France; [email protected]
(September 5, 2024)
Abstract

We introduce directional regularity, a new definition of anisotropy for multivariate functional data. Instead of taking the conventional view which determines anisotropy as a notion of smoothness along a dimension, directional regularity additionally views anisotropy through the lens of directions. We show that faster rates of convergence can be obtained through a change-of-basis by adapting to the directional regularity of a multivariate process. An algorithm for the estimation and identification of the change-of-basis matrix is constructed, made possible due to the unique replication structure of functional data. Non-asymptotic bounds are provided for our algorithm, supplemented by numerical evidence from an extensive simulation study. We discuss two possible applications of the directional regularity approach, and advocate its consideration as a standard pre-processing step in multivariate functional data analysis.

Key words: Anisotropy; Multivariate functional data; Hölder exponent

MSC2020: 62R10; 62G07; 62M99; 60G22

1 Introduction

Anisotropy is a key concept in probability and statistics. In the context of non-parametric statistics, it roughly states that the regularity is different between dimensions. Let (Xm,Ym)subscript𝑋𝑚subscript𝑌𝑚(X_{m},Y_{m})( italic_X start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) be independent and identically distributed (iid) pairs observed under the model

Ym=f(Xm)+ϵm,m=1,,M0,formulae-sequencesubscript𝑌𝑚𝑓subscript𝑋𝑚subscriptitalic-ϵ𝑚𝑚1subscript𝑀0Y_{m}=f(X_{m})+\epsilon_{m},\qquad m=1,\dots,M_{0},italic_Y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_f ( italic_X start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) + italic_ϵ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_m = 1 , … , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ,

with f:[0,1]d:𝑓superscript01𝑑f:[0,1]^{d}\rightarrow\mathbb{R}italic_f : [ 0 , 1 ] start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT → blackboard_R, and the design points Xmsubscript𝑋𝑚X_{m}italic_X start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT’s are uniformly distributed on the hypercube [0,1]dsuperscript01𝑑[0,1]^{d}[ 0 , 1 ] start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. The errors ϵmsubscriptitalic-ϵ𝑚\epsilon_{m}italic_ϵ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT are assumed to be uncorrelated, centered random variables. When f𝑓fitalic_f belongs to an anisotropic Hölder class, it is well known that under suitable assumptions on the noise, the minimax rate of estimation is Mβ/(2β+1)superscript𝑀𝛽2𝛽1M^{-\beta/(2\beta+1)}italic_M start_POSTSUPERSCRIPT - italic_β / ( 2 italic_β + 1 ) end_POSTSUPERSCRIPT, where β𝛽\betaitalic_β is given by

1β=i=1d1βi,1𝛽superscriptsubscript𝑖1𝑑1subscript𝛽𝑖\frac{1}{\beta}=\sum_{i=1}^{d}\frac{1}{\beta_{i}},divide start_ARG 1 end_ARG start_ARG italic_β end_ARG = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG , (1)

with βisubscript𝛽𝑖\beta_{i}italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denoting the regularity along dimension i𝑖iitalic_i. The parameter β𝛽\betaitalic_β is also known as the “effective smoothness" of f𝑓fitalic_f, since it characterizes the regularity of f𝑓fitalic_f across different dimensions. Since β1mini=1,,dβi1superscript𝛽1subscript𝑖1𝑑superscriptsubscript𝛽𝑖1\beta^{-1}\leq\min_{i=1,\dots,d}\beta_{i}^{-1}italic_β start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ≤ roman_min start_POSTSUBSCRIPT italic_i = 1 , … , italic_d end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, anisotropy brings along better rates of convergence in the non-parametric regression setup of (1).

The regularities βi,i=1,,dformulae-sequencesubscript𝛽𝑖𝑖1𝑑\beta_{i},i=1,\dots,ditalic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i = 1 , … , italic_d are usually considered along the canonical basis, the domain where the data is commonly observed on. However, this can lead to misleading conclusions since anisotropy is not invariant to directions. A relatively recent body of work in non-parametric statistics aims to give consideration to the directional nature of anisotropy, called structural adaptation (Lepski, (2015)). See Samarov and Tsybakov, (2004), Lepski and Rebelles, (2020), Ammous et al., (2024) for some examples in the density model. In spatial data, this has been considered under the umbrella of fractal analysis, for example in the seminal work of Davies and Hall, (1999).

Surprisingly, to the best of our knowledge, the directional nature of anisotropy has yet to be examined in multivariate functional data analysis. We aim to fill this gap by giving a rigorous definition of anisotropy through the directional regularity of a process. Through several examples, we demonstrate that performing a change-of-basis can increase the effective smoothness of a process along the canonical basis. This can lead to faster rates of convergence for ubiquitous tasks such as smoothing in functional data analysis. This change-of-basis matrix is intimately related to the notion of directional regularity that we introduce in this paper, and an algorithm is constructed to estimate it.

Functional data analysis (FDA) is a burgeoning field that deals with modern, complex data, such as those collected by sensors. For an introduction, see the textbooks Ramsay and Silverman, (2005), Horváth and Kokoszka, (2012), Kokoszka and Reimherr, (2017). FDA deals with data sets where the datum are functions, regarded as realizations of a stochastic process or random field, defined over a continuous domain. A unique feature of functional data is replication, where several sample paths are observed.

The remainder of this paper is organized as follows. In Section 2 we give a formal definition of directional regularity, which is elucidated through Lemma 1 and several examples. Lemma 1, which states that the regularity of a surface is the same in every direction except possibly one, is a key property that motivates much of the paper. In Section 3 we describe the estimation procedure of the parameters associated to the directional regularity approach. Section 4 studies the theoretical properties of our Algorithm, where non-asymptotic bounds are provided. Section 5 describes an extensive simulation study that illustrates the good finite sample properties of our Algorithms. Section 6 discusses additional properties of our Algorithm, such as computational considerations and other interesting insights. Section 7 describes some natural applications of our directional regularity approach, including surface smoothing and anisotropic detection. All the proofs are relegated to the Supplementary Material, which also contain additional simulation results.

Figure 1: In general, the worst regularity H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT will be obtained on each direction of canonical basis, leading to isotropy (here H1<H2)H_{1}<H_{2})italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). This can lead to slower rates of convergence. A change-of-basis from (𝐞𝟏,𝐞𝟐)subscript𝐞1subscript𝐞2(\mathbf{e_{1}},\mathbf{e_{2}})( bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) to (𝐮𝟏,𝐮𝟐)subscript𝐮1subscript𝐮2(\mathbf{u_{1}},\mathbf{u_{2}})( bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) enables one to obtain an anisotropic process. Locating the basis (𝐮𝟏,𝐮𝟐subscript𝐮1subscript𝐮2\mathbf{u_{1}},\mathbf{u_{2}}bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT) is equivalent to locating the angle between 𝐮𝟏subscript𝐮1\mathbf{u_{1}}bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT and 𝐞𝟏subscript𝐞1\mathbf{e_{1}}bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT.
Refer to caption

2 Directional Regularity

2.1 Data setting and definition

Let {X(𝐭),𝐭𝒯}𝑋𝐭𝐭𝒯\{X(\mathbf{t}),\mathbf{t}\in\mathcal{T}\}{ italic_X ( bold_t ) , bold_t ∈ caligraphic_T } be a second-order stochastic process on an open, bounded domain 𝒯2𝒯superscript2\mathcal{T}\subset\mathbb{R}^{2}caligraphic_T ⊂ blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. In this paper, we will focus on the common design framework, which represents many applications. In particular, functional data collected by sensors and images fall into such a framework. The observations come in the form of pairs (Y(j)(𝐭m),𝐭m)×𝒯superscript𝑌𝑗subscript𝐭𝑚subscript𝐭𝑚𝒯(Y^{(j)}(\mathbf{t}_{m}),\mathbf{t}_{m})\in\mathbb{R}\times\mathcal{T}( italic_Y start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ∈ blackboard_R × caligraphic_T, generated under the model

Y(j)(𝐭m)=X(j)(𝐭m)+εm(j),1jN,1mM0,formulae-sequenceformulae-sequencesuperscript𝑌𝑗subscript𝐭𝑚superscript𝑋𝑗subscript𝐭𝑚superscriptsubscript𝜀𝑚𝑗1𝑗𝑁1𝑚subscript𝑀0Y^{(j)}(\mathbf{t}_{m})=X^{(j)}(\mathbf{t}_{m})+\varepsilon_{m}^{(j)},\qquad 1% \leq j\leq N,1\leq m\leq M_{0},italic_Y start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) = italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) + italic_ε start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT , 1 ≤ italic_j ≤ italic_N , 1 ≤ italic_m ≤ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , (2)

where ε(j)superscript𝜀𝑗\varepsilon^{(j)}italic_ε start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT are independent, centered errors with variance σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. We suppose without loss of generality that the design points 𝐭msubscript𝐭𝑚\mathbf{t}_{m}bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT are observed on the canonical basis, and that 𝒯=(0,1)2𝒯superscript012\mathcal{T}=(0,1)^{2}caligraphic_T = ( 0 , 1 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The random design case with heteroscedastic errors, together with other possible extensions, are discussed in Section 6.

Definition 1.

Let X𝑋Xitalic_X be a stochastic process with continuous and non-differentiable sample paths, and 𝐮𝕊𝐮𝕊\mathbf{u}\in\mathbb{S}bold_u ∈ blackboard_S be a unit vector. The process X𝑋Xitalic_X has local regularity H𝐮subscript𝐻𝐮H_{\mathbf{u}}italic_H start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT at 𝐭𝒯𝐭𝒯\mathbf{t}\in\mathcal{T}bold_t ∈ caligraphic_T along the direction 𝐮𝐮\mathbf{u}bold_u if a non-negative function Lu:𝒯+:subscript𝐿𝑢𝒯subscriptL_{u}:\mathcal{T}\rightarrow\mathbb{R}_{+}italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT : caligraphic_T → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT exists such that :

θ𝐮(𝐭,Δ):=𝔼[{X(𝐭Δ2𝐮)X(𝐭+Δ2𝐮)}2]=L𝐮(𝐭)Δ2H𝐮(𝐭)+G(𝐭,Δ),assignsubscript𝜃𝐮𝐭Δ𝔼delimited-[]superscript𝑋𝐭Δ2𝐮𝑋𝐭Δ2𝐮2subscript𝐿𝐮𝐭superscriptΔ2subscript𝐻𝐮𝐭𝐺𝐭Δ\theta_{\mathbf{u}}(\mathbf{t},\Delta):=\mathbb{E}\left[\left\{X\left(\mathbf{% t}-\frac{\Delta}{2}\mathbf{u}\right)-X\left(\mathbf{t}+\frac{\Delta}{2}\mathbf% {u}\right)\right\}^{2}\right]=L_{\mathbf{u}}(\mathbf{t})\Delta^{2H_{\mathbf{u}% }(\mathbf{t})}+G(\mathbf{t},\Delta),italic_θ start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t , roman_Δ ) := blackboard_E [ { italic_X ( bold_t - divide start_ARG roman_Δ end_ARG start_ARG 2 end_ARG bold_u ) - italic_X ( bold_t + divide start_ARG roman_Δ end_ARG start_ARG 2 end_ARG bold_u ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = italic_L start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t ) roman_Δ start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t ) end_POSTSUPERSCRIPT + italic_G ( bold_t , roman_Δ ) , (3)

where G𝐮(𝐭,Δ)=Δ0o(Δ2H𝐮(𝐭))superscriptΔ0subscript𝐺𝐮𝐭Δ𝑜superscriptΔ2subscript𝐻𝐮𝐭G_{\mathbf{u}}(\mathbf{t},\Delta)\stackrel{{\scriptstyle\Delta\rightarrow 0}}{% {=}}o\left(\Delta^{2H_{\mathbf{u}}(\mathbf{t})}\right)italic_G start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t , roman_Δ ) start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG roman_Δ → 0 end_ARG end_RELOP italic_o ( roman_Δ start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t ) end_POSTSUPERSCRIPT ) for each fixed 𝐭𝐭\mathbf{t}bold_t. The map

H:{𝕊×𝒯(0,1)(𝐮,𝐭)H𝐮(𝐭):𝐻cases𝕊𝒯01otherwisemaps-to𝐮𝐭subscript𝐻𝐮𝐭otherwiseH:\begin{cases}\mathbb{S}\times\mathcal{T}\rightarrow(0,1)\\ (\mathbf{u},\mathbf{t})\mapsto H_{\mathbf{u}}(\mathbf{t})\end{cases}italic_H : { start_ROW start_CELL blackboard_S × caligraphic_T → ( 0 , 1 ) end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( bold_u , bold_t ) ↦ italic_H start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t ) end_CELL start_CELL end_CELL end_ROW (4)

is called the directional regularity of X𝑋Xitalic_X.

Definition 1 is local in the sense that the local regularity may vary along the domain 𝒯𝒯\mathcal{T}caligraphic_T. In the following, we consider the case where G𝐮(𝐭,Δ)=O(Δ2H𝐮(𝐭)+ζ)subscript𝐺𝐮𝐭Δ𝑂superscriptΔ2subscript𝐻𝐮𝐭𝜁G_{\mathbf{u}}(\mathbf{t},\Delta)=O(\Delta^{2H_{\mathbf{u}}(\mathbf{t})+\zeta})italic_G start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t , roman_Δ ) = italic_O ( roman_Δ start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t ) + italic_ζ end_POSTSUPERSCRIPT ) for some ζ>0𝜁0\zeta>0italic_ζ > 0. If the function H𝐻Hitalic_H does not depend on the direction 𝐮𝐮\mathbf{u}bold_u, we say that X𝑋Xitalic_X is an isotropic process, otherwise we call it an anisotropic process. Definition 1 formalizes anisotropy as a notion of regularity not only along a dimension, but also along a direction. This is in contrast to the usual consideration of anisotropy, which typically views it through the lens of the canonical basis. Some examples of processes with prescribed directional regularity will be discussed in Section 2.2.

A natural question that arises is the number of possible regularities that an anisotropic process can take, and how these regularities change with the directions. Lemma 1 provides a definitive answer to the preceding question, stating that there can be at most two different regularities. Furthermore, there exists a unique direction, up to a reflection, for the maximal regularity.

Lemma 1.

Let 𝐭𝒯𝐭𝒯\mathbf{t}\in\mathcal{T}bold_t ∈ caligraphic_T and assume that there exists basis vectors (𝐮𝟏(𝐭),𝐮𝟐(𝐭))𝕊subscript𝐮1𝐭subscript𝐮2𝐭𝕊(\mathbf{u_{1}}(\mathbf{t}),\mathbf{u_{2}}(\mathbf{t}))\in\mathbb{S}( bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT ( bold_t ) , bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ( bold_t ) ) ∈ blackboard_S such that H𝐮𝟏(𝐭)<H𝐮𝟐(𝐭)subscript𝐻subscript𝐮1𝐭subscript𝐻subscript𝐮2𝐭H_{\mathbf{u_{1}}}(\mathbf{t})<H_{\mathbf{u_{2}}}(\mathbf{t})italic_H start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t ) < italic_H start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t ). Moreover, suppose that the functions L𝐮𝟏subscript𝐿subscript𝐮1L_{\mathbf{u_{1}}}italic_L start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and L𝐮𝟐subscript𝐿subscript𝐮2L_{\mathbf{u_{2}}}italic_L start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT are continuously differentiable. For any 𝐯𝕊𝐯𝕊\mathbf{v}\in\mathbb{S}bold_v ∈ blackboard_S, we have the following dichotomous relationship:

  • If 𝐯±𝐮𝟐𝐯plus-or-minussubscript𝐮2\mathbf{v}\neq\pm\mathbf{u_{2}}bold_v ≠ ± bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT, then H𝐯(𝐭)=H𝐮𝟏(𝐭)subscript𝐻𝐯𝐭subscript𝐻subscript𝐮1𝐭H_{\mathbf{v}}(\mathbf{t})=H_{\mathbf{u_{1}}}(\mathbf{t})italic_H start_POSTSUBSCRIPT bold_v end_POSTSUBSCRIPT ( bold_t ) = italic_H start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t ).

  • Otherwise, we have H𝐯(𝐭)=H𝐮𝟐(𝐭)subscript𝐻𝐯𝐭subscript𝐻subscript𝐮2𝐭H_{\mathbf{v}}(\mathbf{t})=H_{\mathbf{u_{2}}}(\mathbf{t})italic_H start_POSTSUBSCRIPT bold_v end_POSTSUBSCRIPT ( bold_t ) = italic_H start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t ).

The proof of Lemma 1 is provided in the Supplementary Material. It states that the map 𝐯H𝐯maps-to𝐯subscript𝐻𝐯\mathbf{v}\mapsto H_{\mathbf{v}}bold_v ↦ italic_H start_POSTSUBSCRIPT bold_v end_POSTSUBSCRIPT can take at most two possible values. This is in alignment with the results in Davies and Hall, (1999), who studied anisotropy from the lens of the fractal dimension of surfaces, and showed that “the fractal dimension of line transects across a surface must either be constant in every direction or be constant in each direction except one".

Let 𝐯(𝐭)=argmax𝐯𝕊H𝐯(𝐭)superscript𝐯𝐭subscript𝐯𝕊subscript𝐻𝐯𝐭\mathbf{v}^{*}(\mathbf{t})={\arg\max}_{\mathbf{v}\in\mathbb{S}}H_{\mathbf{v}}(% \mathbf{t})bold_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_t ) = roman_arg roman_max start_POSTSUBSCRIPT bold_v ∈ blackboard_S end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT bold_v end_POSTSUBSCRIPT ( bold_t ) be the direction that maximizes the regularity of X𝑋Xitalic_X in the sense of Definition 1. Then 𝐯(𝐭)superscript𝐯𝐭\mathbf{v}^{*}(\mathbf{t})bold_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_t ) and 𝐯(𝐭)superscript𝐯𝐭-\mathbf{v}^{*}(\mathbf{t})- bold_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_t ) are the only two possible maximizers, and are “singularities" in the sense that if we locally examine any vector other than ±𝐯(𝐭)plus-or-minussuperscript𝐯𝐭\pm\mathbf{v}^{*}(\mathbf{t})± bold_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_t ), the associated regularity is minu𝕊H𝐮(𝐭)subscript𝑢𝕊subscript𝐻𝐮𝐭\min_{u\in\mathbb{S}}H_{\mathbf{u}}(\mathbf{t})roman_min start_POSTSUBSCRIPT italic_u ∈ blackboard_S end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t ). Parameterizing the vectors by their angle 𝐮(β)=cos(β)𝐞1+sin(β)𝐞2𝐮𝛽𝛽subscript𝐞1𝛽subscript𝐞2\mathbf{u}(\beta)=\cos(\beta)\mathbf{e}_{1}+\sin(\beta)\mathbf{e}_{2}bold_u ( italic_β ) = roman_cos ( italic_β ) bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + roman_sin ( italic_β ) bold_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, the maximization problem (𝒫):argmax𝐯𝕊H𝐯(𝐭):𝒫subscript𝐯𝕊subscript𝐻𝐯𝐭(\mathcal{P}):\arg\max_{\mathbf{v}\in\mathbb{S}}H_{\mathbf{v}}(\mathbf{t})( caligraphic_P ) : roman_arg roman_max start_POSTSUBSCRIPT bold_v ∈ blackboard_S end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT bold_v end_POSTSUBSCRIPT ( bold_t ) is equivalent to argmaxβ[0,2π)H𝐮(β)(𝐭)subscript𝛽02𝜋subscript𝐻𝐮𝛽𝐭{\arg\max}_{\beta\in[0,2\pi)}H_{\mathbf{u}(\beta)}(\mathbf{t})roman_arg roman_max start_POSTSUBSCRIPT italic_β ∈ [ 0 , 2 italic_π ) end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( bold_t ). By restricting the domain to [0,π)0𝜋[0,\pi)[ 0 , italic_π ), (𝒫)𝒫(\mathcal{P})( caligraphic_P ) admits an unique solution. (𝒫)𝒫(\mathcal{P})( caligraphic_P ) can thus be parameterized as locating the angle between 𝐯superscript𝐯\mathbf{v}^{*}bold_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and 𝐞𝟏subscript𝐞1\mathbf{e_{1}}bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT, given by

α(𝐭)=argmaxβ[0,π)H𝐮(β)(𝐭).𝛼𝐭subscript𝛽0𝜋subscript𝐻𝐮𝛽𝐭\alpha(\mathbf{t})={\arg\max}_{\beta\in[0,\pi)}H_{\mathbf{u}(\beta)}(\mathbf{t% }).italic_α ( bold_t ) = roman_arg roman_max start_POSTSUBSCRIPT italic_β ∈ [ 0 , italic_π ) end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( bold_t ) . (5)

An illustration can be found in Figure 1.

2.2 Examples

In the following, we provide some examples of processes that motivate Definition 1.

Example 1 (Sums and products of two fractional Brownian motions).

Let H(0,1)𝐻01H\in(0,1)italic_H ∈ ( 0 , 1 ) be the Hurst parameter of the fractional Brownian motion (fBm) {BH(t),t(0,1)}superscript𝐵𝐻𝑡𝑡01\{B^{H}(t),t\in(0,1)\}{ italic_B start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_t ) , italic_t ∈ ( 0 , 1 ) }. BHsuperscript𝐵𝐻B^{H}italic_B start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT is a centered Gaussian process with covariance function

𝔼[BH(t)BH(s)]=12{|t|2H+|s|2H|ts|2H},(t,s)(0,1)2.formulae-sequence𝔼delimited-[]superscript𝐵𝐻𝑡superscript𝐵𝐻𝑠12superscript𝑡2𝐻superscript𝑠2𝐻superscript𝑡𝑠2𝐻for-all𝑡𝑠superscript012\mathbb{E}\left[B^{H}(t)B^{H}(s)\right]=\frac{1}{2}\left\{|t|^{2H}+|s|^{2H}-|t% -s|^{2H}\right\},\qquad\forall(t,s)\in(0,1)^{2}.blackboard_E [ italic_B start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_t ) italic_B start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_s ) ] = divide start_ARG 1 end_ARG start_ARG 2 end_ARG { | italic_t | start_POSTSUPERSCRIPT 2 italic_H end_POSTSUPERSCRIPT + | italic_s | start_POSTSUPERSCRIPT 2 italic_H end_POSTSUPERSCRIPT - | italic_t - italic_s | start_POSTSUPERSCRIPT 2 italic_H end_POSTSUPERSCRIPT } , ∀ ( italic_t , italic_s ) ∈ ( 0 , 1 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (6)

The fBm is a generalization of the standard Brownian motion, which has Hurst parameter H=1/2𝐻12H=1/2italic_H = 1 / 2. Unlike the standard Brownian motion, the increments of the fBm are not necessarily independent. A simple calculation reveals that

𝔼[{BH(t)BH(s)}2]=|ts|2H,(t,s)(0,1)2.formulae-sequence𝔼delimited-[]superscriptsuperscript𝐵𝐻𝑡superscript𝐵𝐻𝑠2superscript𝑡𝑠2𝐻for-all𝑡𝑠superscript012\mathbb{E}\left[\{B^{H}(t)-B^{H}(s)\}^{2}\right]=|t-s|^{2H},\qquad\forall(t,s)% \in(0,1)^{2}.blackboard_E [ { italic_B start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_t ) - italic_B start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_s ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = | italic_t - italic_s | start_POSTSUPERSCRIPT 2 italic_H end_POSTSUPERSCRIPT , ∀ ( italic_t , italic_s ) ∈ ( 0 , 1 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (7)

Let (𝐮𝟏,𝐮𝟐)subscript𝐮1subscript𝐮2(\mathbf{u_{1}},\mathbf{u_{2}})( bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) be a basis of 2superscript2\mathbb{R}^{2}blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, (H1,H2)(0,1)2subscript𝐻1subscript𝐻2superscript012(H_{1},H_{2})\in(0,1)^{2}( italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∈ ( 0 , 1 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, and B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT,B2subscript𝐵2B_{2}italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT be two independent fBms with Hurst parameters H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and H2subscript𝐻2H_{2}italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT respectively. Set

X1(𝐭)=B1(t1)+B(t2), for any 𝐭(0,1)2,and 𝐭=t1𝐮𝟏+t2𝐮𝟐.formulae-sequencesubscript𝑋1𝐭subscript𝐵1subscript𝑡1𝐵subscript𝑡2formulae-sequence for any 𝐭superscript012and 𝐭subscript𝑡1subscript𝐮1subscript𝑡2subscript𝐮2X_{1}(\mathbf{t})=B_{1}(t_{1})+B(t_{2}),\qquad\text{ for any }\mathbf{t}\in(0,% 1)^{2},\quad\text{and }\mathbf{t}=t_{1}\mathbf{u_{1}}+t_{2}\mathbf{u_{2}}.italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t ) = italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + italic_B ( italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , for any bold_t ∈ ( 0 , 1 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , and bold_t = italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT + italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT . (8)

The independence of B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and B2subscript𝐵2B_{2}italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, together with (7), implies that for any Δ>0Δ0\Delta>0roman_Δ > 0 sufficiently small, we have

𝔼[{X1(𝐭Δ2𝐮𝐢)X1(𝐭+Δ2𝐮𝐢)}2]=Δ2Hi,i=1,2.formulae-sequence𝔼delimited-[]superscriptsubscript𝑋1𝐭Δ2subscript𝐮𝐢subscript𝑋1𝐭Δ2subscript𝐮𝐢2superscriptΔ2subscript𝐻𝑖𝑖12\mathbb{E}\left[\left\{X_{1}\left(\mathbf{t}-\frac{\Delta}{2}\mathbf{u_{i}}% \right)-X_{1}\left(\mathbf{t}+\frac{\Delta}{2}\mathbf{u_{i}}\right)\right\}^{2% }\right]=\Delta^{2H_{i}},\quad i=1,2.blackboard_E [ { italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t - divide start_ARG roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT ) - italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t + divide start_ARG roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = roman_Δ start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_i = 1 , 2 . (9)

By Definition 1, for i=1,2𝑖12i=1,2italic_i = 1 , 2, the process X1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT has a local regularity Hisubscript𝐻𝑖H_{i}italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT along the direction 𝐮𝐢subscript𝐮𝐢\mathbf{u_{i}}bold_u start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT with G(𝐭,Δ)=0𝐺𝐭Δ0G(\mathbf{t},\Delta)=0italic_G ( bold_t , roman_Δ ) = 0 (ζ=)𝜁(\zeta=\infty)( italic_ζ = ∞ ), and L𝐮𝐢1subscript𝐿subscript𝐮𝐢1L_{\mathbf{u_{i}}}\equiv 1italic_L start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≡ 1.

Similarly, we can define the tensor product of B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and B2subscript𝐵2B_{2}italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT w.r.t the basis (𝐮𝟏,𝐮𝟐)subscript𝐮1subscript𝐮2(\mathbf{u_{1}},\mathbf{u_{2}})( bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ):

X2(𝐭)=B1(t1)×B(t2), for any 𝐭(0,1)2,and 𝐭=t1𝐮𝟏+t2𝐮𝟐.formulae-sequencesubscript𝑋2𝐭subscript𝐵1subscript𝑡1𝐵subscript𝑡2formulae-sequence for any 𝐭superscript012and 𝐭subscript𝑡1subscript𝐮1subscript𝑡2subscript𝐮2X_{2}(\mathbf{t})=B_{1}(t_{1})\times B(t_{2}),\qquad\text{ for any }\mathbf{t}% \in(0,1)^{2},\quad\text{and }\mathbf{t}=t_{1}\mathbf{u_{1}}+t_{2}\mathbf{u_{2}}.italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t ) = italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) × italic_B ( italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , for any bold_t ∈ ( 0 , 1 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , and bold_t = italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT + italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT . (10)

For any Δ>0Δ0\Delta>0roman_Δ > 0 sufficiently small, we analogously have

𝔼[{X2(𝐭Δ2𝐮𝐢)X2(𝐭+Δ2𝐮𝐢)}2]=|tj|2HjΔ2Hi,i,j{1,2} and ij,formulae-sequence𝔼delimited-[]superscriptsubscript𝑋2𝐭Δ2subscript𝐮𝐢subscript𝑋2𝐭Δ2subscript𝐮𝐢2superscriptsubscript𝑡𝑗2subscript𝐻𝑗superscriptΔ2subscript𝐻𝑖𝑖𝑗12 and 𝑖𝑗\mathbb{E}\left[\left\{X_{2}\left(\mathbf{t}-\frac{\Delta}{2}\mathbf{u_{i}}% \right)-X_{2}\left(\mathbf{t}+\frac{\Delta}{2}\mathbf{u_{i}}\right)\right\}^{2% }\right]=|t_{j}|^{2H_{j}}\Delta^{2H_{i}},\quad i,j\in\{1,2\}\text{ and }i\neq j,blackboard_E [ { italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t - divide start_ARG roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT ) - italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t + divide start_ARG roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = | italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_i , italic_j ∈ { 1 , 2 } and italic_i ≠ italic_j , (11)

so the process X2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT has a local regularity Hisubscript𝐻𝑖H_{i}italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT along the direction 𝐮𝐢subscript𝐮𝐢\mathbf{u_{i}}bold_u start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT.

Example 2 (Sum and product of two independent Ornstein-Uhlenbeck processes).

Another example that is regularly used in practice is the stationary fractional Ornstein-Uhlenbeck process {U(t),t(0,1)}𝑈𝑡𝑡01\{U(t),t\in(0,1)\}{ italic_U ( italic_t ) , italic_t ∈ ( 0 , 1 ) }, with index ρ(0,2)𝜌02\rho\in(0,2)italic_ρ ∈ ( 0 , 2 ). It is a centered Gaussian process with a covariance function given by

𝔼[U(t)U(s)]=exp(a|ts|ρ), for some a>0 and t,s(0,1).formulae-sequence𝔼delimited-[]𝑈𝑡𝑈𝑠𝑎superscript𝑡𝑠𝜌formulae-sequence for some 𝑎0 and 𝑡𝑠01\mathbb{E}\left[U(t)U(s)\right]=\exp\left(-a|t-s|^{\rho}\right),\quad\text{ % for some }a>0\text{ and }t,s\in(0,1).blackboard_E [ italic_U ( italic_t ) italic_U ( italic_s ) ] = roman_exp ( - italic_a | italic_t - italic_s | start_POSTSUPERSCRIPT italic_ρ end_POSTSUPERSCRIPT ) , for some italic_a > 0 and italic_t , italic_s ∈ ( 0 , 1 ) . (12)

The covariance structure exhibits the following property:

𝔼[{U(t)U(s)}2]=2a|ts|ρ+O(|ts|2ρ),for any t,s(0,1).formulae-sequence𝔼delimited-[]superscript𝑈𝑡𝑈𝑠22𝑎superscript𝑡𝑠𝜌𝑂superscript𝑡𝑠2𝜌for any 𝑡𝑠01\mathbb{E}\left[\{U(t)-U(s)\}^{2}\right]=2a|t-s|^{\rho}+O(|t-s|^{2\rho}),% \qquad\text{for any }t,s\in(0,1).blackboard_E [ { italic_U ( italic_t ) - italic_U ( italic_s ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = 2 italic_a | italic_t - italic_s | start_POSTSUPERSCRIPT italic_ρ end_POSTSUPERSCRIPT + italic_O ( | italic_t - italic_s | start_POSTSUPERSCRIPT 2 italic_ρ end_POSTSUPERSCRIPT ) , for any italic_t , italic_s ∈ ( 0 , 1 ) . (13)

Its bi-dimensional counterpart can be constructed by considering either the sum or the tensor product of two independent Ornstein-Uhlenbeck processes, similar to the fBm in (8) and (10).

Example 3 (Multi-fractional Brownian sheet).

The local regularity in previous examples is constant along the domain 𝒯𝒯\mathcal{T}caligraphic_T. The multifractional Brownian sheet (MfBs) is a generalization of the standard fractional Brownian sheet, where the Hurst parameter is allowed to vary along the domain. See Herbin, (2006) among others. (See Kassi et al., , 2023, Proposition 6) for an illustration of the directional regularity property in the case of MfBs.

3 Methodology

3.1 Estimating equations

Let (𝐞𝟏,𝐞𝟐)subscript𝐞1subscript𝐞2(\mathbf{e_{1}},\mathbf{e_{2}})( bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) be the canonical basis of 𝒯𝒯\mathcal{T}caligraphic_T, and (𝐮𝟏,𝐮𝟐)subscript𝐮1subscript𝐮2(\mathbf{u_{1}},\mathbf{u_{2}})( bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) be orthonormal basis vectors such that |H𝐮1(𝐭)H𝐮2(𝐭)|>0subscript𝐻subscript𝐮1𝐭subscript𝐻subscript𝐮2𝐭0|H_{\mathbf{u}_{1}}(\mathbf{t})-H_{\mathbf{u}_{2}}(\mathbf{t})|>0| italic_H start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t ) - italic_H start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t ) | > 0. Let α(𝐭)[0,π)𝛼𝐭0𝜋\alpha(\mathbf{t})\in[0,\pi)italic_α ( bold_t ) ∈ [ 0 , italic_π ) be the solution of the problem (5). Denote H1(𝐭)=H𝐮𝟏(𝐭)subscript𝐻1𝐭subscript𝐻subscript𝐮1𝐭H_{1}(\mathbf{t})=H_{\mathbf{u_{1}}}(\mathbf{t})italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t ) = italic_H start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t ) and H2(𝐭)=H𝐮𝟐(𝐭)subscript𝐻2𝐭subscript𝐻subscript𝐮2𝐭H_{2}(\mathbf{t})=H_{\mathbf{u_{2}}}(\mathbf{t})italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t ) = italic_H start_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t ). The estimating equation for α𝛼\alphaitalic_α is found in Proposition 1. Denote ab=min{a,b}𝑎𝑏𝑎𝑏a\wedge b=\min\{a,b\}italic_a ∧ italic_b = roman_min { italic_a , italic_b }, for a,b𝑎𝑏a,b\in\mathbb{R}italic_a , italic_b ∈ blackboard_R, and ab=max{a,b}𝑎𝑏𝑎𝑏a\vee b=\max\{a,b\}italic_a ∨ italic_b = roman_max { italic_a , italic_b }. Finally, we use #𝒮#𝒮\#\mathcal{S}# caligraphic_S to denote the cardinality of a finite set 𝒮𝒮\mathcal{S}caligraphic_S.

Proposition 1.

Suppose that 𝐮𝟏±𝐞𝐢subscript𝐮1plus-or-minussubscript𝐞𝐢\mathbf{u_{1}}\neq\pm\mathbf{e_{i}}bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT ≠ ± bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT, for i=1,2𝑖12i=1,2italic_i = 1 , 2. If a process X𝑋Xitalic_X satisfies (3) for any 𝐭𝒯𝐭𝒯\mathbf{t}\in\mathcal{T}bold_t ∈ caligraphic_T, then we have

g(α(𝐭))=(θ𝐞𝟐(𝐭,Δ)θ𝐞𝟏(𝐭,Δ))12H¯(𝐭){1+O(Δζ|H1(𝐭)H2(𝐭)|)},𝑔𝛼𝐭superscriptsubscript𝜃subscript𝐞2𝐭Δsubscript𝜃subscript𝐞1𝐭Δ12¯𝐻𝐭1𝑂superscriptΔ𝜁subscript𝐻1𝐭subscript𝐻2𝐭g\left(\alpha(\mathbf{t})\right)=\left(\frac{\theta_{\mathbf{e_{2}}}(\mathbf{t% },\Delta)}{\theta_{\mathbf{e_{1}}}(\mathbf{t},\Delta)}\right)^{\frac{1}{2% \underline{H}(\mathbf{t})}}\left\{1+O\left(\Delta^{\zeta\wedge|H_{1}(\mathbf{t% })-H_{2}(\mathbf{t})|}\right)\right\},italic_g ( italic_α ( bold_t ) ) = ( divide start_ARG italic_θ start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t , roman_Δ ) end_ARG start_ARG italic_θ start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t , roman_Δ ) end_ARG ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 under¯ start_ARG italic_H end_ARG ( bold_t ) end_ARG end_POSTSUPERSCRIPT { 1 + italic_O ( roman_Δ start_POSTSUPERSCRIPT italic_ζ ∧ | italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t ) - italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t ) | end_POSTSUPERSCRIPT ) } , (14)

where g(𝐭)=|tan|𝟏{H1(𝐭)<H2(𝐭)}+|cot|𝟏{H1(𝐭)>H2(𝐭)}𝑔𝐭1subscript𝐻1𝐭subscript𝐻2𝐭1subscript𝐻1𝐭subscript𝐻2𝐭g(\mathbf{t})=|\tan|\mathbf{1}\{H_{1}(\mathbf{t})<H_{2}(\mathbf{t})\}+|\cot|% \mathbf{1}\{H_{1}(\mathbf{t})>H_{2}(\mathbf{t})\}italic_g ( bold_t ) = | roman_tan | bold_1 { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t ) < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t ) } + | roman_cot | bold_1 { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t ) > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t ) }, and H¯(𝐭)=H1(𝐭)H2(𝐭)¯𝐻𝐭subscript𝐻1𝐭subscript𝐻2𝐭\underline{H}(\mathbf{t})=H_{1}(\mathbf{t})\wedge H_{2}(\mathbf{t})under¯ start_ARG italic_H end_ARG ( bold_t ) = italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t ) ∧ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t ).

The remainder term O(Δζ|H1(𝐭)H2(𝐭)|)𝑂superscriptΔ𝜁subscript𝐻1𝐭subscript𝐻2𝐭O\left(\Delta^{\zeta\wedge|H_{1}(\mathbf{t})-H_{2}(\mathbf{t})|}\right)italic_O ( roman_Δ start_POSTSUPERSCRIPT italic_ζ ∧ | italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_t ) - italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_t ) | end_POSTSUPERSCRIPT ) will be discussed in Section 3.3. Proposition 1 allows us to find a function of the angle by examining the ratios of mean-squared variations, as in (3), by searching solely along the canonical basis. In order to focus on the main idea of directional regularity, we will consider the case where H𝐻Hitalic_H and α𝛼\alphaitalic_α are constant over 𝒯𝒯\mathcal{T}caligraphic_T. In theory, the general case can be established by considering a local, pointwise study of our methodology. Hereafter, the notational dependence of H𝐻Hitalic_H and α𝛼\alphaitalic_α on 𝐭𝐭\mathbf{t}bold_t will sometimes be suppressed. When the directional regularity is constant over 𝒯𝒯\mathcal{T}caligraphic_T, more stable estimates can be obtained by averaging over the design points.

Due to the unique replication nature of function data, the quantities in (14) can be easily estimated. A natural plug-in estimator for the mean-squared variations is its empirical counterpart, given by

θˇ𝐞𝐢(𝐭,Δ)=1Nj=1N{X~(j)(𝐭(Δ/2)𝐞𝐢)X~(j)(𝐭+(Δ/2)𝐞𝐢)}2,subscriptˇ𝜃subscript𝐞𝐢𝐭Δ1𝑁superscriptsubscript𝑗1𝑁superscriptsuperscript~𝑋𝑗𝐭Δ2subscript𝐞𝐢superscript~𝑋𝑗𝐭Δ2subscript𝐞𝐢2\widecheck{\theta}_{\mathbf{e_{i}}}(\mathbf{t},\Delta)=\frac{1}{N}\sum_{j=1}^{% N}\left\{\widetilde{X}^{(j)}\left(\mathbf{t}-(\Delta/2)\mathbf{e_{i}}\right)-% \widetilde{X}^{(j)}\left(\mathbf{t}+(\Delta/2)\mathbf{e_{i}}\right)\right\}^{2},overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t , roman_Δ ) = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT { over~ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t - ( roman_Δ / 2 ) bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT ) - over~ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t + ( roman_Δ / 2 ) bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (15)

where X~(j)superscript~𝑋𝑗\widetilde{X}^{(j)}over~ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT denotes an observable approximation of X(j)superscript𝑋𝑗X^{(j)}italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT. Let 𝒯~~𝒯\widetilde{\mathcal{T}}over~ start_ARG caligraphic_T end_ARG be a finite approximation of 𝒯𝒯\mathcal{T}caligraphic_T with a deterministic number of grid points. Averaging over the design points, we obtain

θ^𝐞𝐢(Δ)=1#𝒯~𝐭𝒯~θˇ𝐞𝐢(𝐭,Δ)2σ^2,i=1,2,formulae-sequencesubscript^𝜃subscript𝐞𝐢Δ1#~𝒯subscript𝐭~𝒯subscriptˇ𝜃subscript𝐞𝐢𝐭Δ2superscript^𝜎2𝑖12\widehat{\theta}_{\mathbf{e_{i}}}(\Delta)=\frac{1}{\#\widetilde{\mathcal{T}}}% \sum_{\mathbf{t}\in\widetilde{\mathcal{T}}}\widecheck{\theta}_{\mathbf{e_{i}}}% (\mathbf{t},\Delta)-2\widehat{\sigma}^{2},\qquad i=1,2,over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) = divide start_ARG 1 end_ARG start_ARG # over~ start_ARG caligraphic_T end_ARG end_ARG ∑ start_POSTSUBSCRIPT bold_t ∈ over~ start_ARG caligraphic_T end_ARG end_POSTSUBSCRIPT overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_t , roman_Δ ) - 2 over^ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_i = 1 , 2 , (16)

where

σ^2=1M0m=1M012Nj=1N(Y(j)(𝐭m)Y(j)(𝐭m,1))2,superscript^𝜎21subscript𝑀0superscriptsubscript𝑚1subscript𝑀012𝑁superscriptsubscript𝑗1𝑁superscriptsuperscript𝑌𝑗subscript𝐭𝑚superscript𝑌𝑗subscript𝐭𝑚12\widehat{\sigma}^{2}=\frac{1}{M_{0}}\sum_{m=1}^{M_{0}}\frac{1}{2N}\sum_{j=1}^{% N}\left(Y^{(j)}(\mathbf{t}_{m})-Y^{(j)}(\mathbf{t}_{m,1})\right)^{2},over^ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_Y start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) - italic_Y start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m , 1 end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (17)

is an estimator of the noise term in (2), with 𝐭m,1subscript𝐭𝑚1\mathbf{t}_{m,1}bold_t start_POSTSUBSCRIPT italic_m , 1 end_POSTSUBSCRIPT denoting the closest observed point to 𝐭𝐦subscript𝐭𝐦\mathbf{t_{m}}bold_t start_POSTSUBSCRIPT bold_m end_POSTSUBSCRIPT. Since the mean-squared variations are associated to the true realizations of the process X𝑋Xitalic_X and not the observed, noisy version, denoising is important to obtain an estimator with good rates of convergence. Theoretical properties for the estimator of σ^2superscript^𝜎2\widehat{\sigma}^{2}over^ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT in (17) are provided in Section 4.

In principle, one can choose between a variety of methods to construct the approximations X~(j)superscript~𝑋𝑗\widetilde{X}^{(j)}over~ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT. Only a mild moment condition of the form R2(M0)M0νless-than-or-similar-tosubscript𝑅2subscript𝑀0superscriptsubscript𝑀0𝜈R_{2}(M_{0})\lesssim M_{0}^{-\nu}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≲ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_ν end_POSTSUPERSCRIPT, for some ν>0𝜈0\nu>0italic_ν > 0 is required, where Rp(M0)=sup𝐭𝒯𝔼[|X~j(𝐭)Xj(𝐭)|p]subscript𝑅𝑝subscript𝑀0subscriptsupremum𝐭𝒯𝔼delimited-[]superscriptsubscript~𝑋𝑗𝐭subscript𝑋𝑗𝐭𝑝R_{p}(M_{0})=\sup_{\mathbf{t}\in\mathcal{T}}\mathbb{E}[|\widetilde{X}_{j}(% \mathbf{t})-X_{j}(\mathbf{t})|^{p}]italic_R start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = roman_sup start_POSTSUBSCRIPT bold_t ∈ caligraphic_T end_POSTSUBSCRIPT blackboard_E [ | over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_t ) - italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_t ) | start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ]. This is satisfied by many non-parametric smoothers, such as kernel smoothers and series expansions. See for instance Fan and Guerre, (2016) and Belloni et al., (2015). We will use nearest-neighbor interpolation, breaking ties by the lexicographic order, to construct X~(j)superscript~𝑋𝑗\widetilde{X}^{(j)}over~ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT since it is sufficient for our purposes, when coupled with a denoising step as in (16). Nearest-neighbor interpolation has the added advantage of requiring no tuning parameters, and is computationally efficient.

A recent line of work in regularity estimation within the context of FDA has emerged, which we can exploit for our purposes. See for example (Golovkine et al., , 2022), Golovkine et al., (2023), Wang et al., (2023), Maissoro et al., (2024), Kassi et al., (2023), and Shen and Hsing, (2020). We will adopt the multivariate version of Kassi et al., (2023), where the minimum regularity can be estimated as

H¯^(Δ)={mini=1,2log(θ^𝐞𝐢(2Δ))log(θ^𝐞𝐢(Δ))2log(2)ifθ^𝐞𝐢(2Δ),θ^𝐞𝐢(Δ)>0,1otherwise.^¯𝐻Δcasessubscript𝑖12subscript^𝜃subscript𝐞𝐢2Δsubscript^𝜃subscript𝐞𝐢Δ22ifsubscript^𝜃subscript𝐞𝐢2Δsubscript^𝜃subscript𝐞𝐢Δ0otherwise1otherwiseotherwise\widehat{\underline{H}}(\Delta)=\begin{cases}\min_{i=1,2}\frac{\log(\widehat{% \theta}_{\mathbf{e_{i}}}(2\Delta))-\log(\widehat{\theta}_{\mathbf{e_{i}}}(% \Delta))}{2\log(2)}\qquad\text{if}\qquad\widehat{\theta}_{\mathbf{e_{i}}}(2% \Delta),\widehat{\theta}_{\mathbf{e_{i}}}(\Delta)>0,\\ 1\qquad\text{otherwise}.\end{cases}over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG ( roman_Δ ) = { start_ROW start_CELL roman_min start_POSTSUBSCRIPT italic_i = 1 , 2 end_POSTSUBSCRIPT divide start_ARG roman_log ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 2 roman_Δ ) ) - roman_log ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) ) end_ARG start_ARG 2 roman_log ( 2 ) end_ARG if over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 2 roman_Δ ) , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) > 0 , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 1 otherwise . end_CELL start_CELL end_CELL end_ROW (18)

The regularity estimator in (18) is a slight adaptation of Kassi et al., (2023) by taking the minimum over the index of the basis vectors. This is motivated by Lemma 1, which says that at least one of the two vectors gives us the smallest regularity for the process X𝑋Xitalic_X. Collecting (16), (17) and (18), a plug-in estimator of (14) is given by

g^(α;Δ)=(θ^𝐞𝟐(Δ)𝟙θ^𝐞𝟐(Δ)>0+𝟙θ^𝐞𝟐(Δ)0θ^𝐞𝟏(Δ)𝟙θ^𝐞𝟏(Δ)>0+𝟙θ^𝐞𝟏(Δ)0)1/(2H¯^(Δ)).^𝑔𝛼Δsuperscriptsubscript^𝜃subscript𝐞2Δsubscript1subscript^𝜃subscript𝐞2Δ0subscript1subscript^𝜃subscript𝐞2Δ0subscript^𝜃subscript𝐞1Δsubscript1subscript^𝜃subscript𝐞1Δ0subscript1subscript^𝜃subscript𝐞1Δ012^¯𝐻Δ\widehat{g}(\alpha;\Delta)=\left(\frac{\widehat{\theta}_{\mathbf{e_{2}}}(% \Delta)\mathbbm{1}_{\widehat{\theta}_{\mathbf{e_{2}}}(\Delta)>0}+\mathbbm{1}_{% \widehat{\theta}_{\mathbf{e_{2}}}(\Delta)\leq 0}}{\widehat{\theta}_{\mathbf{e_% {1}}}(\Delta)\mathbbm{1}_{\widehat{\theta}_{\mathbf{e_{1}}}(\Delta)>0}+% \mathbbm{1}_{\widehat{\theta}_{\mathbf{e_{1}}}(\Delta)\leq 0}}\right)^{1/(2% \widehat{\underline{H}}(\Delta))}.over^ start_ARG italic_g end_ARG ( italic_α ; roman_Δ ) = ( divide start_ARG over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) blackboard_1 start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) > 0 end_POSTSUBSCRIPT + blackboard_1 start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) ≤ 0 end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) blackboard_1 start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) > 0 end_POSTSUBSCRIPT + blackboard_1 start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) ≤ 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 1 / ( 2 over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG ( roman_Δ ) ) end_POSTSUPERSCRIPT . (19)

By construction, the only relevant tuning parameter in (19) is the spacing ΔΔ\Deltaroman_Δ. We address this issue in Section 3.2, immediately after Algorithm 1 is presented.

3.2 Identification issues

The estimating equation in (14) reveals two identification problems. The first issue is related to the function g𝑔gitalic_g, which can either be the tangent (tan\tanroman_tan) or cotangent (cot\cotroman_cot) function depending on the ordinal nature of H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and H2subscript𝐻2H_{2}italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. This reflects the phenomenon that the dominating term arising from the ratio of mean squared variations differ, depending on whether H1>H2subscript𝐻1subscript𝐻2H_{1}>H_{2}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. However, this ordinality is unknown a priori.

The second issue is associated to the absolute value on the LHS of (14). By using the estimator in (19), one obtains either g(α)𝑔𝛼g(\alpha)italic_g ( italic_α ) or g(πα)𝑔𝜋𝛼g(\pi-\alpha)italic_g ( italic_π - italic_α ). Similarly, the sign is unknown a prior to the statistician. Fortunately, an identification procedure can be constructed to resolve these two issues.

Let 𝐮𝐮\mathbf{u}bold_u be a unit vector represented in the canonical basis as 𝐮(β)=cos(β)𝐞𝟏+sin(β)𝐞𝟐𝐮𝛽𝛽subscript𝐞1𝛽subscript𝐞2\mathbf{u}(\beta)=\cos(\beta)\mathbf{e_{1}}+\sin(\beta)\mathbf{e_{2}}bold_u ( italic_β ) = roman_cos ( italic_β ) bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT + roman_sin ( italic_β ) bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT. By construction, the regularity H𝐮(α)subscript𝐻𝐮𝛼H_{\mathbf{u}(\alpha)}italic_H start_POSTSUBSCRIPT bold_u ( italic_α ) end_POSTSUBSCRIPT is maximal when built using the angle α𝛼\alphaitalic_α in (14). The true value of α𝛼\alphaitalic_α can thus be identified as

α=argmaxβ{γ,πγ,π/2γ,π/2+γ}H𝐮(β),𝛼subscript𝛽𝛾𝜋𝛾𝜋2𝛾𝜋2𝛾subscript𝐻𝐮𝛽\alpha={\arg\max}_{\beta\in\left\{\gamma,\pi-\gamma,\pi/2-\gamma,\pi/2+\gamma% \right\}}H_{\mathbf{u(\beta)}},italic_α = roman_arg roman_max start_POSTSUBSCRIPT italic_β ∈ { italic_γ , italic_π - italic_γ , italic_π / 2 - italic_γ , italic_π / 2 + italic_γ } end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT , (20)

where

γγ(Δ)=arccot(θ𝐞𝟐(Δ)θ𝐞𝟏(Δ))12H¯(Δ).𝛾𝛾Δarccotsuperscriptsubscript𝜃subscript𝐞2Δsubscript𝜃subscript𝐞1Δ12¯𝐻Δ\gamma\approx\gamma(\Delta)=\text{arccot}\left(\frac{\theta_{\mathbf{e_{2}}}(% \Delta)}{\theta_{\mathbf{e_{1}}}(\Delta)}\right)^{\frac{1}{2\underline{H}(% \Delta)}}.italic_γ ≈ italic_γ ( roman_Δ ) = arccot ( divide start_ARG italic_θ start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) end_ARG start_ARG italic_θ start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ ) end_ARG ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 under¯ start_ARG italic_H end_ARG ( roman_Δ ) end_ARG end_POSTSUPERSCRIPT . (21)

The angles γ𝛾\gammaitalic_γ are computed by applying the relevant inverse maps to (19). An estimate of the regularity H𝐻Hitalic_H along an arbitrary direction 𝐮(β)𝐮𝛽\mathbf{u}(\beta)bold_u ( italic_β ) can be obtained by

H^𝐮(β)(Δ)={log(θ^𝐮(β)(2Δ))log(θ^𝐮(β)(Δ))2log(2)ifθ^𝐮(β)(2Δ)θ^𝐮(β)(Δ)>0,1otherwise.subscript^𝐻𝐮𝛽Δcasessubscript^𝜃𝐮𝛽2Δsubscript^𝜃𝐮𝛽Δ22ifsubscript^𝜃𝐮𝛽2Δsubscript^𝜃𝐮𝛽Δ0otherwise1otherwiseotherwise\widehat{H}_{\mathbf{u}(\beta)}(\Delta)=\begin{cases}\frac{\log(\widehat{% \theta}_{\mathbf{u}(\beta)}(2\Delta))-\log(\widehat{\theta}_{\mathbf{u}(\beta)% }(\Delta))}{2\log(2)}\qquad\text{if}\qquad\widehat{\theta}_{\mathbf{u}(\beta)}% (2\Delta)\geq\widehat{\theta}_{\mathbf{u}(\beta)}(\Delta)>0,\\ 1\qquad\text{otherwise}.\end{cases}over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) = { start_ROW start_CELL divide start_ARG roman_log ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( 2 roman_Δ ) ) - roman_log ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) ) end_ARG start_ARG 2 roman_log ( 2 ) end_ARG if over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( 2 roman_Δ ) ≥ over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) > 0 , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 1 otherwise . end_CELL start_CELL end_CELL end_ROW (22)

In order to obtain more robust estimates H^𝐮(β)subscript^𝐻𝐮𝛽\widehat{H}_{\mathbf{u}(\beta)}over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT with respect to the spacing ΔΔ\Deltaroman_Δ, we propose to compute H^𝐮(β)subscript^𝐻𝐮𝛽\widehat{H}_{\mathbf{u}(\beta)}over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT over a grid of points 𝚫=(Δ1,,ΔK0)𝚫superscriptsubscriptΔ1subscriptΔsubscript𝐾0top\mathbf{\Delta}=(\Delta_{1},\dots,\Delta_{K_{0}})^{\top}bold_Δ = ( roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_Δ start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. The angle which maximizes the sum over 𝚫𝚫\mathbf{\Delta}bold_Δ is then selected, yielding the angle

α^=argmaxβ{γ^,πγ^,π/2γ^,π/2+γ^}k=1K0H^𝐮(β)(Δk).^𝛼subscript𝛽^𝛾𝜋^𝛾𝜋2^𝛾𝜋2^𝛾superscriptsubscript𝑘1subscript𝐾0subscript^𝐻𝐮𝛽subscriptΔ𝑘\widehat{\alpha}=\arg\max_{\beta\in\left\{\widehat{\gamma},\pi-\widehat{\gamma% },\pi/2-\widehat{\gamma},\pi/2+\widehat{\gamma}\right\}}\sum_{k=1}^{K_{0}}% \widehat{H}_{\mathbf{u}(\beta)}(\Delta_{k}).over^ start_ARG italic_α end_ARG = roman_arg roman_max start_POSTSUBSCRIPT italic_β ∈ { over^ start_ARG italic_γ end_ARG , italic_π - over^ start_ARG italic_γ end_ARG , italic_π / 2 - over^ start_ARG italic_γ end_ARG , italic_π / 2 + over^ start_ARG italic_γ end_ARG } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) . (23)

As seen from Theorem 1, the spacing ΔΔ\Deltaroman_Δ used for identification should be chosen such that it is slower in rate compared to the one used for estimation, providing additional justification for (23). The estimation and identification procedure is summarized in Algorithm 1.

Algorithm 1 Estimation and identification of α𝛼\alphaitalic_α
1:Data (Y(j)(𝐭m),𝐭m)superscript𝑌𝑗subscript𝐭𝑚subscript𝐭𝑚(Y^{(j)}(\mathbf{t}_{m}),\mathbf{t}_{m})( italic_Y start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ), Grid 𝒯~={𝐭𝟏,,𝐭𝐩}~𝒯subscript𝐭1subscript𝐭𝐩\widetilde{\mathcal{T}}=\left\{\mathbf{t_{1}},\dots,\mathbf{t_{p}}\right\}over~ start_ARG caligraphic_T end_ARG = { bold_t start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , … , bold_t start_POSTSUBSCRIPT bold_p end_POSTSUBSCRIPT }; Initialize H^𝐮(β)subscript^𝐻𝐮𝛽\widehat{H}_{\mathbf{u}(\beta)}\leftarrow\emptysetover^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ← ∅;
2:for ΔksubscriptΔ𝑘\Delta_{k}roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in 𝚫𝚫\mathbf{\Delta}bold_Δ do
3:     Compute g^(α,Δk)^𝑔𝛼subscriptΔ𝑘\widehat{g}(\alpha,\Delta_{k})over^ start_ARG italic_g end_ARG ( italic_α , roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) according to (19);
4:     α^tanarctan(g^(α,Δk))superscript^𝛼𝑡𝑎𝑛^𝑔𝛼subscriptΔ𝑘\widehat{\alpha}^{tan}\leftarrow\arctan\left(\widehat{g}(\alpha,\Delta_{k})\right)over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT italic_t italic_a italic_n end_POSTSUPERSCRIPT ← roman_arctan ( over^ start_ARG italic_g end_ARG ( italic_α , roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) );
5:     α^cotarccot(g^(α,Δk))superscript^𝛼𝑐𝑜𝑡arccot^𝑔𝛼subscriptΔ𝑘\widehat{\alpha}^{cot}\leftarrow\text{arccot}\left(\widehat{g}(\alpha,\Delta_{% k})\right)over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT italic_c italic_o italic_t end_POSTSUPERSCRIPT ← arccot ( over^ start_ARG italic_g end_ARG ( italic_α , roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) );
6:     𝐮(β)(cos(β),sin(β))𝐮𝛽superscript𝛽𝛽top\mathbf{u}(\beta)\leftarrow(\cos(\beta),\sin(\beta))^{\top}bold_u ( italic_β ) ← ( roman_cos ( italic_β ) , roman_sin ( italic_β ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT;\triangleright β{α^tan,α^cot,πα^tan,πα^cot}for-all𝛽superscript^𝛼𝑡𝑎𝑛superscript^𝛼𝑐𝑜𝑡𝜋superscript^𝛼𝑡𝑎𝑛𝜋superscript^𝛼𝑐𝑜𝑡\forall\beta\in\{\widehat{\alpha}^{tan},\widehat{\alpha}^{cot},\pi-\widehat{% \alpha}^{tan},\pi-\widehat{\alpha}^{cot}\}∀ italic_β ∈ { over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT italic_t italic_a italic_n end_POSTSUPERSCRIPT , over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT italic_c italic_o italic_t end_POSTSUPERSCRIPT , italic_π - over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT italic_t italic_a italic_n end_POSTSUPERSCRIPT , italic_π - over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT italic_c italic_o italic_t end_POSTSUPERSCRIPT };
7:     Compute H^𝐮(β)(Δk)subscript^𝐻𝐮𝛽subscriptΔ𝑘\widehat{H}_{\mathbf{u}(\beta)}(\Delta_{k})over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) according to (22);
8:     H^𝐮(β)H^𝐮(β)H^𝐮(β)(Δ)subscript^𝐻𝐮𝛽subscript^𝐻𝐮𝛽subscript^𝐻𝐮𝛽Δ\widehat{H}_{\mathbf{u}(\beta)}\leftarrow\widehat{H}_{\mathbf{u}(\beta)}% \bigcup\widehat{H}_{\mathbf{u}(\beta)}(\Delta)over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ← over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ⋃ over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ );
9:end forCompute α^^𝛼\widehat{\alpha}over^ start_ARG italic_α end_ARG according to (23);
10:return α^^𝛼\widehat{\alpha}over^ start_ARG italic_α end_ARG

In the estimation of g^(α,Δ)^𝑔𝛼Δ\widehat{g}(\alpha,\Delta)over^ start_ARG italic_g end_ARG ( italic_α , roman_Δ ), it is sufficient to choose a fixed spacing ΔΔ\Deltaroman_Δ. In order to ensure that the nearest-neighbor of 𝐭+(Δ/2)𝐞𝐢𝐭Δ2subscript𝐞𝐢\mathbf{t}+(\Delta/2)\mathbf{e_{i}}bold_t + ( roman_Δ / 2 ) bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT is distinct to the nearest-neighbor of 𝐭(Δ/2)𝐞𝐢𝐭Δ2subscript𝐞𝐢\mathbf{t}-(\Delta/2)\mathbf{e_{i}}bold_t - ( roman_Δ / 2 ) bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT when computing θ𝐞𝐢subscript𝜃subscript𝐞𝐢\theta_{\mathbf{e_{i}}}italic_θ start_POSTSUBSCRIPT bold_e start_POSTSUBSCRIPT bold_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT in g^(α,Δ)^𝑔𝛼Δ\widehat{g}(\alpha,\Delta)over^ start_ARG italic_g end_ARG ( italic_α , roman_Δ ), ΔΔ\Deltaroman_Δ should be chosen at least as large as (2M0)1/2superscript2subscript𝑀012(2M_{0})^{-1/2}( 2 italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT. However, as seen from the theoretical results in Section 4, we need ΔΔ\Deltaroman_Δ to be strictly larger than the minimum size required to capture one point in order to obtain asymptotic concentration. In the non-asymptotic setting, we propose a conservative choice of Δ=M01/4Δsuperscriptsubscript𝑀014\Delta=M_{0}^{-1/4}roman_Δ = italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 / 4 end_POSTSUPERSCRIPT. Simulation results in Section 5 confirm that this choice works well in practice.

3.3 Correction for the remainder term

Let ~(α)=~(α,ζ)=O(Δζ|H1H2|)~𝛼~𝛼𝜁𝑂superscriptΔ𝜁subscript𝐻1subscript𝐻2\widetilde{\mathcal{F}}(\alpha)=\widetilde{\mathcal{F}}(\alpha,\zeta)=O\left(% \Delta^{\zeta\wedge|H_{1}-H_{2}|}\right)over~ start_ARG caligraphic_F end_ARG ( italic_α ) = over~ start_ARG caligraphic_F end_ARG ( italic_α , italic_ζ ) = italic_O ( roman_Δ start_POSTSUPERSCRIPT italic_ζ ∧ | italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT ) denote the remainder term in (14). It turns out that \mathcal{F}caligraphic_F can become big enough to affect the estimation of g(α)𝑔𝛼g(\alpha)italic_g ( italic_α ) through the constants, which is more pronounced for certain angles α𝛼\alphaitalic_α. Let L𝐮¯subscript𝐿¯𝐮L_{\overline{\mathbf{u}}}italic_L start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT be the local Hölder constant in the direction of the maximizing regularity H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG, and let L𝐮¯subscript𝐿¯𝐮L_{\underline{\mathbf{u}}}italic_L start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT be the local Hölder constant in the orthogonal direction of 𝐮¯¯𝐮\overline{\mathbf{u}}over¯ start_ARG bold_u end_ARG. The explicit form of the remainder term is given by

~(α)=(α)+O(Δζ1)=(1+num1+denom)12H¯+O(Δζ1),~𝛼𝛼𝑂superscriptΔ𝜁1superscript1subscript𝑛𝑢𝑚1subscript𝑑𝑒𝑛𝑜𝑚12¯𝐻𝑂superscriptΔ𝜁1\widetilde{\mathcal{F}}(\alpha)=\mathcal{F}(\alpha)+O\left(\Delta^{\zeta\wedge 1% }\right)=\left(\frac{1+\mathcal{F}_{num}}{1+\mathcal{F}_{denom}}\right)^{\frac% {1}{2\underline{H}}}+O\left(\Delta^{\zeta\wedge 1}\right),over~ start_ARG caligraphic_F end_ARG ( italic_α ) = caligraphic_F ( italic_α ) + italic_O ( roman_Δ start_POSTSUPERSCRIPT italic_ζ ∧ 1 end_POSTSUPERSCRIPT ) = ( divide start_ARG 1 + caligraphic_F start_POSTSUBSCRIPT italic_n italic_u italic_m end_POSTSUBSCRIPT end_ARG start_ARG 1 + caligraphic_F start_POSTSUBSCRIPT italic_d italic_e italic_n italic_o italic_m end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 under¯ start_ARG italic_H end_ARG end_ARG end_POSTSUPERSCRIPT + italic_O ( roman_Δ start_POSTSUPERSCRIPT italic_ζ ∧ 1 end_POSTSUPERSCRIPT ) , (24)

where

num=1L𝐮¯(Δ)L𝐮¯(Δ)|cos(α)𝟙{H1<H2}+sin(α)𝟙{H1>H2}|2H¯Δ2H¯+R(Δ)|sin(α)𝟙{H1<H2}+cos(α)𝟙{H1>H2}|2H¯Δ2H¯,subscript𝑛𝑢𝑚1subscript𝐿¯𝐮Δsubscript𝐿¯𝐮Δsuperscript𝛼subscript1subscript𝐻1subscript𝐻2𝛼subscript1subscript𝐻1subscript𝐻22¯𝐻superscriptΔ2¯𝐻𝑅Δsuperscript𝛼subscript1subscript𝐻1subscript𝐻2𝛼subscript1subscript𝐻1subscript𝐻22¯𝐻superscriptΔ2¯𝐻\mathcal{F}_{num}=\frac{1}{L_{\underline{\mathbf{u}}}(\Delta)}\frac{L_{% \overline{\mathbf{u}}}(\Delta)\left|\cos(\alpha)\mathbbm{1}_{\{H_{1}<H_{2}\}}+% \sin(\alpha)\mathbbm{1}_{\{H_{1}>H_{2}\}}\right|^{2\overline{H}}\Delta^{2% \overline{H}}+R(\Delta)}{\left|\sin(\alpha)\mathbbm{1}_{\{H_{1}<H_{2}\}}+\cos(% \alpha)\mathbbm{1}_{\{H_{1}>H_{2}\}}\right|^{2\underline{H}}\Delta^{2% \underline{H}}},caligraphic_F start_POSTSUBSCRIPT italic_n italic_u italic_m end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG divide start_ARG italic_L start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) | roman_cos ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT + roman_sin ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 over¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT 2 over¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT + italic_R ( roman_Δ ) end_ARG start_ARG | roman_sin ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT + roman_cos ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG , (25)

and

denom=1L𝐮¯(Δ)L𝐮¯(Δ)|sin(α)𝟙{H1<H2}+cos(α)𝟙{H1>H2}|2H¯Δ2H¯+R(Δ)|cos(α)𝟙{H1<H2}+sin(α)𝟙{H1>H2}|2H¯Δ2H¯,subscript𝑑𝑒𝑛𝑜𝑚1subscript𝐿¯𝐮Δsubscript𝐿¯𝐮Δsuperscript𝛼subscript1subscript𝐻1subscript𝐻2𝛼subscript1subscript𝐻1subscript𝐻22¯𝐻superscriptΔ2¯𝐻𝑅Δsuperscript𝛼subscript1subscript𝐻1subscript𝐻2𝛼subscript1subscript𝐻1subscript𝐻22¯𝐻superscriptΔ2¯𝐻\mathcal{F}_{denom}=\frac{1}{L_{\underline{\mathbf{u}}}(\Delta)}\frac{L_{% \overline{\mathbf{u}}}(\Delta)\left|\sin(\alpha)\mathbbm{1}_{\{H_{1}<H_{2}\}}+% \cos(\alpha)\mathbbm{1}_{\{H_{1}>H_{2}\}}\right|^{2\overline{H}}\Delta^{2% \overline{H}}+R(\Delta)}{\left|\cos(\alpha)\mathbbm{1}_{\{H_{1}<H_{2}\}}+\sin(% \alpha)\mathbbm{1}_{\{H_{1}>H_{2}\}}\right|^{2\underline{H}}\Delta^{2% \underline{H}}},caligraphic_F start_POSTSUBSCRIPT italic_d italic_e italic_n italic_o italic_m end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG divide start_ARG italic_L start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) | roman_sin ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT + roman_cos ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 over¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT 2 over¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT + italic_R ( roman_Δ ) end_ARG start_ARG | roman_cos ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT + roman_sin ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG , (26)

and

R(Δ)=𝔼[{X(𝐭a1Δ2𝐮𝟏a2Δ2𝐮𝟐)X(𝐭+a1Δ2𝐮𝟏a2Δ2𝐮𝟐)}×{X(𝐭+a1Δ2𝐮𝟏a2Δ2𝐮𝟐)X(𝐭+a1Δ2𝐮𝟏+a2Δ2𝐮𝟐)}].𝑅Δ𝔼delimited-[]𝑋𝐭subscript𝑎1Δ2subscript𝐮1subscript𝑎2Δ2subscript𝐮2𝑋𝐭subscript𝑎1Δ2subscript𝐮1subscript𝑎2Δ2subscript𝐮2𝑋𝐭subscript𝑎1Δ2subscript𝐮1subscript𝑎2Δ2subscript𝐮2𝑋𝐭subscript𝑎1Δ2subscript𝐮1subscript𝑎2Δ2subscript𝐮2R(\Delta)=\mathbb{E}\left[\left\{X\left(\mathbf{t}-\frac{a_{1}\Delta}{2}% \mathbf{u_{1}}-\frac{a_{2}\Delta}{2}\mathbf{u_{2}}\right)-X\left(\mathbf{t}+% \frac{a_{1}\Delta}{2}\mathbf{u_{1}}-\frac{a_{2}\Delta}{2}\mathbf{u_{2}}\right)% \right\}\right.\\ \left.\times\left\{X\left(\mathbf{t}+\frac{a_{1}\Delta}{2}\mathbf{u_{1}}-\frac% {a_{2}\Delta}{2}\mathbf{u_{2}}\right)-X\left(\mathbf{t}+\frac{a_{1}\Delta}{2}% \mathbf{u_{1}}+\frac{a_{2}\Delta}{2}\mathbf{u_{2}}\right)\right\}\right].start_ROW start_CELL italic_R ( roman_Δ ) = blackboard_E [ { italic_X ( bold_t - divide start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT - divide start_ARG italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) - italic_X ( bold_t + divide start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT - divide start_ARG italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) } end_CELL end_ROW start_ROW start_CELL × { italic_X ( bold_t + divide start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT - divide start_ARG italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) - italic_X ( bold_t + divide start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT + divide start_ARG italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_Δ end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT ) } ] . end_CELL end_ROW (27)

It is easy to see from (24) that (π/4)=1𝜋41\mathcal{F}(\pi/4)=1caligraphic_F ( italic_π / 4 ) = 1, regardless of the ordinal nature of H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and H2subscript𝐻2H_{2}italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. This implies that when α=π/4𝛼𝜋4\alpha=\pi/4italic_α = italic_π / 4, the estimating equations in (14) are most accurate. However as α0𝛼0\alpha\rightarrow 0italic_α → 0 or απ/2𝛼𝜋2\alpha\rightarrow\pi/2italic_α → italic_π / 2, the estimating equations can be dramatically affected by this remainder term. A plot of (α)𝛼\mathcal{F}(\alpha)caligraphic_F ( italic_α ) can be seen in Figure 2.

Refer to caption
Figure 2: Plot showing the remainder term for different angles α𝛼\alphaitalic_α. (α)𝛼\mathcal{F}(\alpha)\rightarrow\inftycaligraphic_F ( italic_α ) → ∞ as απ/2𝛼𝜋2\alpha\rightarrow\pi/2italic_α → italic_π / 2. Going closer to the boundary 00 and π/2𝜋2\pi/2italic_π / 2 can dramatically affect the estimation of the true angles.

The remainder term R𝑅Ritalic_R is intrinsic to the process X𝑋Xitalic_X. It can be bounded by Cauchy-Schwarz inequality to obtain R=O(Δ|H1H2|)𝑅𝑂superscriptΔsubscript𝐻1subscript𝐻2R=O\left(\Delta^{|H_{1}-H_{2}|}\right)italic_R = italic_O ( roman_Δ start_POSTSUPERSCRIPT | italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT ). However, this is a pessimistic approach, since R𝑅Ritalic_R can be equal to zero, for processes such as (8). In order to adjust for this possibly non-negligible remainder term, the remainder terms R𝑅Ritalic_R and (α)𝛼\mathcal{F}(\alpha)caligraphic_F ( italic_α ) can be estimated, provided that 2(H¯H¯)<ζ12¯𝐻¯𝐻𝜁12(\overline{H}-\underline{H})<\zeta\wedge 12 ( over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG ) < italic_ζ ∧ 1. Since R𝑅Ritalic_R only involves expectations of the process, it can be estimated by (16). We now focus on the estimation of \mathcal{F}caligraphic_F.

Let θ𝐮¯subscript𝜃¯𝐮\theta_{\overline{\mathbf{u}}}italic_θ start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT and θ𝐮¯subscript𝜃¯𝐮\theta_{\underline{\mathbf{u}}}italic_θ start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT be the mean-squared variations such that H𝐮¯=H¯subscript𝐻¯𝐮¯𝐻H_{\overline{\mathbf{u}}}=\overline{H}italic_H start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT = over¯ start_ARG italic_H end_ARG, and H𝐮¯=H¯subscript𝐻¯𝐮¯𝐻H_{\underline{\mathbf{u}}}=\underline{H}italic_H start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT = under¯ start_ARG italic_H end_ARG respectively. Since

L𝐮¯(Δ)L𝐮¯(Δ)ΔH¯H¯θ𝐮¯(Δ)θ𝐮¯(Δ),subscript𝐿¯𝐮Δsubscript𝐿¯𝐮ΔsuperscriptΔ¯𝐻¯𝐻subscript𝜃¯𝐮Δsubscript𝜃¯𝐮Δ\frac{L_{\overline{\mathbf{u}}}(\Delta)}{L_{\underline{\mathbf{u}}}(\Delta)}% \Delta^{\overline{H}-\underline{H}}\approx\frac{\theta_{\overline{\mathbf{u}}}% (\Delta)}{\theta_{\underline{\mathbf{u}}}(\Delta)},divide start_ARG italic_L start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG start_ARG italic_L start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG roman_Δ start_POSTSUPERSCRIPT over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT ≈ divide start_ARG italic_θ start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG start_ARG italic_θ start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG , (28)

we have

numθ𝐮¯(Δ)θ𝐮¯(Δ)|cos(α)𝟙{H1<H2}+sin(α)𝟙{H1>H2}|2H¯|sin(α)𝟙{H1<H2}+cos(α)𝟙{H1>H2}|2H¯+R(Δ)θ𝐮¯(Δ),subscript𝑛𝑢𝑚subscript𝜃¯𝐮Δsubscript𝜃¯𝐮Δsuperscript𝛼subscript1subscript𝐻1subscript𝐻2𝛼subscript1subscript𝐻1subscript𝐻22¯𝐻superscript𝛼subscript1subscript𝐻1subscript𝐻2𝛼subscript1subscript𝐻1subscript𝐻22¯𝐻𝑅Δsubscript𝜃¯𝐮Δ\mathcal{F}_{num}\approx\frac{\theta_{\overline{\mathbf{u}}}(\Delta)}{\theta_{% \underline{\mathbf{u}}}(\Delta)}\frac{\left|\cos(\alpha)\mathbbm{1}_{\{H_{1}<H% _{2}\}}+\sin(\alpha)\mathbbm{1}_{\{H_{1}>H_{2}\}}\right|^{2\overline{H}}}{% \left|\sin(\alpha)\mathbbm{1}_{\{H_{1}<H_{2}\}}+\cos(\alpha)\mathbbm{1}_{\{H_{% 1}>H_{2}\}}\right|^{2\underline{H}}}+\frac{R(\Delta)}{\theta_{\underline{% \mathbf{u}}}(\Delta)},caligraphic_F start_POSTSUBSCRIPT italic_n italic_u italic_m end_POSTSUBSCRIPT ≈ divide start_ARG italic_θ start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG start_ARG italic_θ start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG divide start_ARG | roman_cos ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT + roman_sin ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 over¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG | roman_sin ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT + roman_cos ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_R ( roman_Δ ) end_ARG start_ARG italic_θ start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG , (29)

and

denomθ𝐮¯(Δ)θ𝐮¯(Δ)|sin(α)𝟙{H1<H2}+cos(α)𝟙{H1>H2}|2H¯|cos(α)𝟙{H1<H2}+sin(α)𝟙{H1>H2}|2H¯+R(Δ)θ𝐮¯(Δ),subscript𝑑𝑒𝑛𝑜𝑚subscript𝜃¯𝐮Δsubscript𝜃¯𝐮Δsuperscript𝛼subscript1subscript𝐻1subscript𝐻2𝛼subscript1subscript𝐻1subscript𝐻22¯𝐻superscript𝛼subscript1subscript𝐻1subscript𝐻2𝛼subscript1subscript𝐻1subscript𝐻22¯𝐻𝑅Δsubscript𝜃¯𝐮Δ\mathcal{F}_{denom}\approx\frac{\theta_{\overline{\mathbf{u}}}(\Delta)}{\theta% _{\underline{\mathbf{u}}}(\Delta)}\frac{\left|\sin(\alpha)\mathbbm{1}_{\{H_{1}% <H_{2}\}}+\cos(\alpha)\mathbbm{1}_{\{H_{1}>H_{2}\}}\right|^{2\overline{H}}}{% \left|\cos(\alpha)\mathbbm{1}_{\{H_{1}<H_{2}\}}+\sin(\alpha)\mathbbm{1}_{\{H_{% 1}>H_{2}\}}\right|^{2\underline{H}}}+\frac{R(\Delta)}{\theta_{\underline{% \mathbf{u}}}(\Delta)},caligraphic_F start_POSTSUBSCRIPT italic_d italic_e italic_n italic_o italic_m end_POSTSUBSCRIPT ≈ divide start_ARG italic_θ start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG start_ARG italic_θ start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG divide start_ARG | roman_sin ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT + roman_cos ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 over¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG | roman_cos ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT + roman_sin ( italic_α ) blackboard_1 start_POSTSUBSCRIPT { italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_R ( roman_Δ ) end_ARG start_ARG italic_θ start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT ( roman_Δ ) end_ARG , (30)

in the sense that the ratios of the LHS and RHS in (30) goes to 1 in the limit as Δ0Δ0\Delta\rightarrow 0roman_Δ → 0. ^(α)^𝛼\widehat{\mathcal{F}}(\alpha)over^ start_ARG caligraphic_F end_ARG ( italic_α ) can be computed by replacing θ𝐮¯subscript𝜃¯𝐮\theta_{\underline{\mathbf{u}}}italic_θ start_POSTSUBSCRIPT under¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT, θ𝐮¯subscript𝜃¯𝐮\theta_{\overline{\mathbf{u}}}italic_θ start_POSTSUBSCRIPT over¯ start_ARG bold_u end_ARG end_POSTSUBSCRIPT, and α𝛼\alphaitalic_α with their estimates obtained by Algorithm 1. An adjusted estimate of α𝛼\alphaitalic_α will then be given by

g^^(α;Δ)=g^(α;Δ)/(α^),^^𝑔𝛼Δ^𝑔𝛼Δ^𝛼\widehat{\widehat{g}}(\alpha;\Delta)=\widehat{g}(\alpha;\Delta)/\mathcal{F}(% \widehat{\alpha}),over^ start_ARG over^ start_ARG italic_g end_ARG end_ARG ( italic_α ; roman_Δ ) = over^ start_ARG italic_g end_ARG ( italic_α ; roman_Δ ) / caligraphic_F ( over^ start_ARG italic_α end_ARG ) , (31)

where g^(α,Δ)^𝑔𝛼Δ\widehat{g}(\alpha,\Delta)over^ start_ARG italic_g end_ARG ( italic_α , roman_Δ ) is computed by (19). The simulation results in Section 5 suggest that using the adjusted estimates in (31) yields significant better results when α𝛼\alphaitalic_α gets further away from π/4𝜋4\pi/4italic_π / 4. In order to avoid introducing greater dependence between quantities, we suggest to only compute the adjusted estimates once. Moreover, we suggest to set R=0𝑅0R=0italic_R = 0 in practice to decrease the computational load. This seems to produce good results , even when R0𝑅0R\neq 0italic_R ≠ 0, as seen from the simulation results in the Supplementary Material. Likewise, the final estimate α^^^^𝛼\widehat{\widehat{\alpha}}over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG can be obtained by applying the appropriate inverse function (either arctan\arctanroman_arctan or arccot) obtained by the identification process when computing the initial estimates α^^𝛼\widehat{\alpha}over^ start_ARG italic_α end_ARG. This not only saves computational time, but also results in valid inference, as opposed to repeating the identification process using α^^^^𝛼\widehat{\widehat{\alpha}}over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG.

4 Theoretical Properties

In the following, we provide non-asymptotic bounds for our main algorithms, as well as the auxiliary estimates. The proofs are provided in detail in the Supplementary Material. The following mild assumptions are imposed.

Assumptions.

  1. (H1)

    Let X𝑋Xitalic_X be anisotropic process with the two regularities (H1,H2)subscript𝐻1subscript𝐻2(H_{1},H_{2})( italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), where X(j)superscript𝑋𝑗X^{(j)}italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT, 1jN1𝑗𝑁1\leq j\leq N1 ≤ italic_j ≤ italic_N, are independent realizations of X𝑋Xitalic_X.

  2. (H2)

    The errror terms em(j)subscriptsuperscript𝑒𝑗𝑚e^{(j)}_{m}italic_e start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT in the data model equation (2) are iid. The random variables em(j)subscriptsuperscript𝑒𝑗𝑚e^{(j)}_{m}italic_e start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT and X are independents

  3. (H3)

    Three positive constants 𝔞𝔞\mathfrak{a}fraktur_a, 𝔄𝔄\mathfrak{A}fraktur_A and r𝑟ritalic_r exist such that, for any 𝒕𝒯𝒕𝒯\boldsymbol{t}\in\mathcal{T}bold_italic_t ∈ caligraphic_T,

    𝔼|X(j)(𝒕)X(j)(𝒔)|2pp!2𝔞𝔄p2𝒕𝒔2pH¯𝒔B(𝒕;r),p1.formulae-sequence𝔼superscriptsuperscript𝑋𝑗𝒕superscript𝑋𝑗𝒔2𝑝𝑝2𝔞superscript𝔄𝑝2superscriptnorm𝒕𝒔2𝑝¯𝐻formulae-sequencefor-all𝒔𝐵𝒕𝑟for-all𝑝1\mathbb{E}\left|X^{(j)}\left(\boldsymbol{t}\right)-X^{(j)}\left(\boldsymbol{s}% \right)\right|^{2p}\leq\frac{p!}{2}\mathfrak{a}\mathfrak{A}^{p-2}\|\boldsymbol% {t}-\boldsymbol{s}\|^{2p\underline{H}}\qquad\forall\boldsymbol{s}\in B(% \boldsymbol{t};r),\;\forall p\geq 1.blackboard_E | italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_italic_t ) - italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_italic_s ) | start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT ≤ divide start_ARG italic_p ! end_ARG start_ARG 2 end_ARG fraktur_a fraktur_A start_POSTSUPERSCRIPT italic_p - 2 end_POSTSUPERSCRIPT ∥ bold_italic_t - bold_italic_s ∥ start_POSTSUPERSCRIPT 2 italic_p under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT ∀ bold_italic_s ∈ italic_B ( bold_italic_t ; italic_r ) , ∀ italic_p ≥ 1 .
  4. (H4)

    A constant 𝔊𝔊\mathfrak{G}fraktur_G exists such that

    𝔼(ε2p)p!2𝔊p2σ2,p1.formulae-sequence𝔼superscript𝜀2𝑝𝑝2superscript𝔊𝑝2superscript𝜎2for-all𝑝1\mathbb{E}(\varepsilon^{2p})\leq\frac{p!}{2}\mathfrak{G}^{p-2}\sigma^{2},% \qquad\forall p\geq 1.blackboard_E ( italic_ε start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT ) ≤ divide start_ARG italic_p ! end_ARG start_ARG 2 end_ARG fraktur_G start_POSTSUPERSCRIPT italic_p - 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∀ italic_p ≥ 1 . (32)
Remark 1.

Assumption (H3) states that X𝑋Xitalic_X has sub-Gaussian increments in a local sense, for every point 𝐭𝒯𝐭𝒯\mathbf{t}\in\mathcal{T}bold_t ∈ caligraphic_T. It is satisfied by the processes considered in the simulations, as well as those presented in Section 2.2. In fact, under Definition 1, Assumption (H3) is equivalent to

𝔼|X(j)(𝒕)X(j)(𝒔)|2pp!2𝔞𝔄p2𝔼[|X(j)(𝒕)X(j)(𝒔)|2]p,𝔼superscriptsuperscript𝑋𝑗𝒕superscript𝑋𝑗𝒔2𝑝𝑝2𝔞superscript𝔄𝑝2𝔼superscriptdelimited-[]superscriptsuperscript𝑋𝑗𝒕superscript𝑋𝑗𝒔2𝑝\mathbb{E}\left|X^{(j)}\left(\boldsymbol{t}\right)-X^{(j)}\left(\boldsymbol{s}% \right)\right|^{2p}\leq\frac{p!}{2}\mathfrak{a}\mathfrak{A}^{p-2}\mathbb{E}% \left[\left|X^{(j)}\left(\boldsymbol{t}\right)-X^{(j)}\left(\boldsymbol{s}% \right)\right|^{2}\right]^{p},blackboard_E | italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_italic_t ) - italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_italic_s ) | start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT ≤ divide start_ARG italic_p ! end_ARG start_ARG 2 end_ARG fraktur_a fraktur_A start_POSTSUPERSCRIPT italic_p - 2 end_POSTSUPERSCRIPT blackboard_E [ | italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_italic_t ) - italic_X start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_italic_s ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ,

an assumption which is more familiar and widely used in the literature.

Let H𝐮(β)(Δ)subscript𝐻𝐮𝛽ΔH_{\mathbf{u}(\beta)}(\Delta)italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) denote a “proxy" of the directional regularity in (22), given by

H𝐮(β)(Δ)=log(θ𝐮(β)(2Δ))log(θ𝐮(β)(Δ))2log(2).subscript𝐻𝐮𝛽Δsubscript𝜃𝐮𝛽2Δsubscript𝜃𝐮𝛽Δ22H_{\mathbf{u}(\beta)}(\Delta)=\frac{\log\left(\theta_{\mathbf{u}(\beta)}(2% \Delta)\right)-\log\left(\theta_{\mathbf{u}(\beta)}(\Delta)\right)}{2\log(2)}.italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) = divide start_ARG roman_log ( italic_θ start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( 2 roman_Δ ) ) - roman_log ( italic_θ start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) ) end_ARG start_ARG 2 roman_log ( 2 ) end_ARG . (33)

Whenever we write H𝐮(β)(Δ)subscript𝐻𝐮𝛽ΔH_{\mathbf{u}(\beta)}(\Delta)italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) with an explicit dependence on ΔΔ\Deltaroman_Δ, or mention the proxy, we are referring to the quantity in (33). We start by providing a bound for this quantity for different angles, which provides us with important insight into the behavior of the directional regularity in the “real world", where ΔΔ\Deltaroman_Δ cannot be infinitesimally small. See Section 6 for a more detailed discussion. Recall that we are working with G(𝐭,Δ)=O(ΔH𝐮(β)+ζ)𝐺𝐭Δ𝑂superscriptΔsubscript𝐻𝐮𝛽𝜁G(\mathbf{t},\Delta)=O(\Delta^{H_{\mathbf{u}(\beta)}+\zeta})italic_G ( bold_t , roman_Δ ) = italic_O ( roman_Δ start_POSTSUPERSCRIPT italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT + italic_ζ end_POSTSUPERSCRIPT ); see Definition 1.

Proposition 2.

We have

|H𝐮(β)H𝐮(β)(Δ)|=O(Δζ).subscript𝐻𝐮𝛽subscript𝐻𝐮𝛽Δ𝑂superscriptΔ𝜁\left|H_{\mathbf{u}(\beta)}-H_{\mathbf{u}(\beta)}(\Delta)\right|=O\left(\Delta% ^{\zeta}\right).| italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT - italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) | = italic_O ( roman_Δ start_POSTSUPERSCRIPT italic_ζ end_POSTSUPERSCRIPT ) . (34)

Suppose that Assumption (H3) holds. Then for any ΔΔ\Deltaroman_Δ sufficiently small, and for any pair of angles (β1,β2)[0,2π]subscript𝛽1subscript𝛽202𝜋(\beta_{1},\beta_{2})\in[0,2\pi]( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∈ [ 0 , 2 italic_π ], the following bound holds:

|H𝐮(β1)(Δ)H𝐮(β2)(Δ)|subscript𝐻𝐮subscript𝛽1Δsubscript𝐻𝐮subscript𝛽2Δabsent\displaystyle\left|H_{\mathbf{u}(\beta_{1})}(\Delta)-H_{\mathbf{u}(\beta_{2})}% (\Delta)\right|\leq| italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( roman_Δ ) - italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( roman_Δ ) | ≤ 𝔞{21+H¯2H𝐮(β2)L𝐮(β2)(𝐭)Δ2(H¯H𝐮(β2))+21H¯L𝐮(β1)(𝐭)Δ2(H¯H𝐮(β1))}𝔞superscript21¯𝐻2subscript𝐻𝐮subscript𝛽2subscript𝐿𝐮subscript𝛽2𝐭superscriptΔ2¯𝐻subscript𝐻𝐮subscript𝛽2superscript21¯𝐻subscript𝐿𝐮subscript𝛽1𝐭superscriptΔ2¯𝐻subscript𝐻𝐮subscript𝛽1\displaystyle\mathfrak{a}\left\{\frac{2^{1+\underline{H}-2H_{\mathbf{u}(\beta_% {2})}}}{L_{\mathbf{u}(\beta_{2})}(\mathbf{t})}\Delta^{2(\underline{H}-H_{% \mathbf{u}(\beta_{2})})}+\frac{2^{1-\underline{H}}}{L_{\mathbf{u}(\beta_{1})}(% \mathbf{t})}\Delta^{2(\underline{H}-H_{\mathbf{u}(\beta_{1})})}\right\}fraktur_a { divide start_ARG 2 start_POSTSUPERSCRIPT 1 + under¯ start_ARG italic_H end_ARG - 2 italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_L start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( bold_t ) end_ARG roman_Δ start_POSTSUPERSCRIPT 2 ( under¯ start_ARG italic_H end_ARG - italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT + divide start_ARG 2 start_POSTSUPERSCRIPT 1 - under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG italic_L start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( bold_t ) end_ARG roman_Δ start_POSTSUPERSCRIPT 2 ( under¯ start_ARG italic_H end_ARG - italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT } (35)
×\displaystyle\times× (β1β2)2H¯×{1+O(Δζ)}.superscriptsubscript𝛽1subscript𝛽22¯𝐻1𝑂superscriptΔ𝜁\displaystyle(\beta_{1}-\beta_{2})^{2\underline{H}}\times\{1+O\left(\Delta^{% \zeta}\right)\}.( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT × { 1 + italic_O ( roman_Δ start_POSTSUPERSCRIPT italic_ζ end_POSTSUPERSCRIPT ) } . (36)

Proposition 3 is a critical ingredient in allowing us to derive rates of convergence associated to the identification process in Algorithm 1.

Proposition 3.

Assume that (H1), (H2), (H3) and (H32) hold true. Recall that H¯=H1H2¯𝐻subscript𝐻1subscript𝐻2\underline{H}=H_{1}\wedge H_{2}under¯ start_ARG italic_H end_ARG = italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∧ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and H¯=H1H2¯𝐻subscript𝐻1subscript𝐻2\overline{H}=H_{1}\vee H_{2}over¯ start_ARG italic_H end_ARG = italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∨ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. For any η𝜂\etaitalic_η which satisfies

η4Δ2H¯𝔞L¯×{(2)H¯{(2)2H¯M012H¯+2ΔH¯}M012H¯+M0H¯},𝜂4superscriptΔ2¯𝐻𝔞¯𝐿superscript2¯𝐻superscript22¯𝐻superscriptsubscript𝑀012¯𝐻2superscriptΔ¯𝐻superscriptsubscript𝑀012¯𝐻superscriptsubscript𝑀0¯𝐻\eta\geq\frac{4\Delta^{-2\overline{H}}\mathfrak{a}}{\underline{L}}\times\left% \{\left(\sqrt{2}\right)^{-\underline{H}}\left\{\left(\sqrt{2}\right)^{2-% \underline{H}}M_{0}^{-\frac{1}{2}\underline{H}}+2\Delta^{\underline{H}}\right% \}M_{0}^{-\frac{1}{2}\underline{H}}+M_{0}^{-\underline{H}}\right\},italic_η ≥ divide start_ARG 4 roman_Δ start_POSTSUPERSCRIPT - 2 over¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT fraktur_a end_ARG start_ARG under¯ start_ARG italic_L end_ARG end_ARG × { ( square-root start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT - under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT { ( square-root start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT 2 - under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT + 2 roman_Δ start_POSTSUPERSCRIPT under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT } italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT + italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT } , (37)

we have for any r[0,H¯)𝑟0¯𝐻r\in[0,\underline{H})italic_r ∈ [ 0 , under¯ start_ARG italic_H end_ARG )

(supβ[0,2π)|H^𝐮(β)(Δ)H𝐮(β)(Δ)|2η)subscriptsupremum𝛽02𝜋subscript^𝐻𝐮𝛽Δsubscript𝐻𝐮𝛽Δ2𝜂absent\displaystyle\mathbb{P}\left(\sup_{\beta\in[0,2\pi)}|\widehat{H}_{\mathbf{u}(% \beta)}(\Delta)-H_{\mathbf{u}(\beta)}(\Delta)|\geq 2\eta\right)\leqblackboard_P ( roman_sup start_POSTSUBSCRIPT italic_β ∈ [ 0 , 2 italic_π ) end_POSTSUBSCRIPT | over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) - italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) | ≥ 2 italic_η ) ≤ 4exp(14×η2NΔ4H¯2𝔟+𝔅ηΔ2H¯)414superscript𝜂2𝑁superscriptΔ4¯𝐻2𝔟𝔅𝜂superscriptΔ2¯𝐻\displaystyle 4\exp\left(-\frac{1}{4}\times\frac{\eta^{2}N\Delta^{4\underline{% H}}}{2\mathfrak{b}+\mathfrak{B}\eta\Delta^{2\underline{H}}}\right)4 roman_exp ( - divide start_ARG 1 end_ARG start_ARG 4 end_ARG × divide start_ARG italic_η start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N roman_Δ start_POSTSUPERSCRIPT 4 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG 2 fraktur_b + fraktur_B italic_η roman_Δ start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG ) (38)
+\displaystyle++ 4exp(18×η2Δ4H¯N8𝔞1+𝔄1ηΔ2H¯),418superscript𝜂2superscriptΔ4¯𝐻𝑁8subscript𝔞1subscript𝔄1𝜂superscriptΔ2¯𝐻\displaystyle 4\exp\left(-\frac{1}{8}\times\frac{\eta^{2}\Delta^{4\underline{H% }}N}{8\mathfrak{a}_{1}+\mathfrak{A}_{1}\eta\Delta^{2\underline{H}}}\right),4 roman_exp ( - divide start_ARG 1 end_ARG start_ARG 8 end_ARG × divide start_ARG italic_η start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT 4 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT italic_N end_ARG start_ARG 8 fraktur_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + fraktur_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_η roman_Δ start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG ) , (39)

where

𝔄1=4max{𝔄(2M0)H¯,4𝔊},𝔞1=𝔞(2M0)2H¯+24σ2,formulae-sequencesubscript𝔄14𝔄superscript2subscript𝑀0¯𝐻4𝔊subscript𝔞1𝔞superscript2subscript𝑀02¯𝐻superscript24superscript𝜎2\mathfrak{A}_{1}=4\max\left\{\frac{\mathfrak{A}}{(2M_{0})^{\underline{H}}},4% \mathfrak{G}\right\},\qquad\mathfrak{a}_{1}=\frac{\mathfrak{a}}{(2M_{0})^{2% \underline{H}}}+2^{4}\sigma^{2},fraktur_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 4 roman_max { divide start_ARG fraktur_A end_ARG start_ARG ( 2 italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG , 4 fraktur_G } , fraktur_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = divide start_ARG fraktur_a end_ARG start_ARG ( 2 italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG + 2 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

and

𝔅=(22H¯𝔄log(2))2(H¯r)3{2M01/2+Δ}2r,𝔟=𝔅2(𝔞2𝔄2)H¯r1H¯+r.formulae-sequence𝔅superscript2superscript2¯𝐻𝔄22superscript¯𝐻𝑟3superscript2superscriptsubscript𝑀012Δ2𝑟𝔟superscript𝔅2superscript𝔞2superscript𝔄2¯𝐻𝑟1¯𝐻𝑟\mathfrak{B}=\left(\frac{\sqrt{2}2^{\underline{H}}\mathfrak{A}}{\log(2)}\right% )^{2}(\underline{H}-r)^{-3}\left\{\sqrt{2}M_{0}^{-1/2}+\Delta\right\}^{2r},% \qquad\mathfrak{b}=\mathfrak{B}^{2}\left(\frac{\mathfrak{a}}{2\mathfrak{A}^{2}% }\right)^{\frac{\underline{H}-r}{1-\underline{H}+r}}.fraktur_B = ( divide start_ARG square-root start_ARG 2 end_ARG 2 start_POSTSUPERSCRIPT under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT fraktur_A end_ARG start_ARG roman_log ( 2 ) end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( under¯ start_ARG italic_H end_ARG - italic_r ) start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT { square-root start_ARG 2 end_ARG italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT + roman_Δ } start_POSTSUPERSCRIPT 2 italic_r end_POSTSUPERSCRIPT , fraktur_b = fraktur_B start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( divide start_ARG fraktur_a end_ARG start_ARG 2 fraktur_A start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT divide start_ARG under¯ start_ARG italic_H end_ARG - italic_r end_ARG start_ARG 1 - under¯ start_ARG italic_H end_ARG + italic_r end_ARG end_POSTSUPERSCRIPT . (40)

The condition on η𝜂\etaitalic_η in (37) represents the bias term that arises from performing denoised interpolation in order to compute the mean-squared variations. This bias term is an intrinsic feature of the “discretely observed" setup, where the surfaces are observed only at a finite number of discrete points, instead of in continuous time.

Corollary 1.

Suppose that assumptions of Proposition 3 hold true, and set

Δ=exp(log(M0)ξ),for 0<ξ<1.\Delta=\exp\left(-\log(M_{0})^{\xi}\right),\qquad\text{for }0<\xi<1.roman_Δ = roman_exp ( - roman_log ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) , for 0 < italic_ξ < 1 . (41)

Moreover, we assume that a non-negative constant 𝔢>0𝔢0\mathfrak{e}>0fraktur_e > 0 (independent of M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and N𝑁Nitalic_N) exist such that

𝔢1log(M0)log(N)𝔢.superscript𝔢1subscript𝑀0𝑁𝔢\mathfrak{e}^{-1}\leq\frac{\log(M_{0})}{\log(N)}\leq\mathfrak{e}.fraktur_e start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ≤ divide start_ARG roman_log ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG start_ARG roman_log ( italic_N ) end_ARG ≤ fraktur_e . (42)

Then

supβ[0,2π)|H^𝐮(β)(Δ)H𝐮(β)(Δ)|=O(max{M012H¯exp((2H¯H¯)log(M0ξ)),N12exp(2H¯log(M0ξ))}).subscriptsupremum𝛽02𝜋subscript^𝐻𝐮𝛽Δsubscript𝐻𝐮𝛽Δsubscript𝑂superscriptsubscript𝑀012¯𝐻2¯𝐻¯𝐻superscriptsubscript𝑀0𝜉superscript𝑁122¯𝐻superscriptsubscript𝑀0𝜉\sup_{\beta\in[0,2\pi)}|\widehat{H}_{\mathbf{u}(\beta)}(\Delta)-H_{\mathbf{u}(% \beta)}(\Delta)|\\ =O_{\mathbb{P}}\left(\max\left\{M_{0}^{-\frac{1}{2}\underline{H}}\exp\left((2% \overline{H}-\underline{H})\log(M_{0}^{\xi})\right),N^{-\frac{1}{2}}\exp\left(% 2\underline{H}\log(M_{0}^{\xi})\right)\right\}\right).start_ROW start_CELL roman_sup start_POSTSUBSCRIPT italic_β ∈ [ 0 , 2 italic_π ) end_POSTSUBSCRIPT | over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) - italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) | end_CELL end_ROW start_ROW start_CELL = italic_O start_POSTSUBSCRIPT blackboard_P end_POSTSUBSCRIPT ( roman_max { italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT roman_exp ( ( 2 over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG ) roman_log ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) ) , italic_N start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_exp ( 2 under¯ start_ARG italic_H end_ARG roman_log ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) ) } ) . end_CELL end_ROW (43)

The condition (42) requires that M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT falls within two power of N𝑁Nitalic_N, equivalently, the converse hold true. The choice of ΔΔ\Deltaroman_Δ in (41) is done is such way that log(M0)bΔ(M0)M0b\log(M_{0})^{-b}\gg\Delta(M_{0})\gg M_{0}^{-b}roman_log ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - italic_b end_POSTSUPERSCRIPT ≫ roman_Δ ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≫ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_b end_POSTSUPERSCRIPT for any fixed b>0𝑏0b>0italic_b > 0, this choice can be found also in Golovkine et al., (2023) with ξ=1/3𝜉13\xi=1/3italic_ξ = 1 / 3. the rate in (43) converge to 00 since ΔΔ\Deltaroman_Δ is chosen to be larger than any negative power of M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and keeping (42) in mind.

Proposition 4.

Under the assumptions of Corollary 1, three positive constants C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and 𝔲𝔲\mathfrak{u}fraktur_u exist such that for any ε𝜀\varepsilonitalic_ε satisfying

|log(Δ)|𝔲Δ2H¯ε𝔲max{Δ3H¯|log(Δ)|M012H¯,Δζ|H1H2|},Δ𝔲superscriptΔ2¯𝐻𝜀𝔲superscriptΔ3¯𝐻Δsuperscriptsubscript𝑀012¯𝐻superscriptΔ𝜁subscript𝐻1subscript𝐻2\frac{|\log(\Delta)|}{\mathfrak{u}\Delta^{2\underline{H}}}\geq\varepsilon\geq% \mathfrak{u}\max\{\Delta^{-3\underline{H}}|\log(\Delta)|M_{0}^{-\frac{1}{2}% \underline{H}},\Delta^{\zeta\wedge|H_{1}-H_{2}|}\},divide start_ARG | roman_log ( roman_Δ ) | end_ARG start_ARG fraktur_u roman_Δ start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG ≥ italic_ε ≥ fraktur_u roman_max { roman_Δ start_POSTSUPERSCRIPT - 3 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT | roman_log ( roman_Δ ) | italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT , roman_Δ start_POSTSUPERSCRIPT italic_ζ ∧ | italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT } , (44)

we obtain

(|g(α,Δ)^g(α)|ε)C1exp(C2ε2NΔ8H¯log2(Δ)).^𝑔𝛼Δ𝑔𝛼𝜀subscript𝐶1subscript𝐶2superscript𝜀2𝑁superscriptΔ8¯𝐻superscript2Δ\mathbb{P}\left(|\widehat{g(\alpha,\Delta)}-g(\alpha)|\geq\varepsilon\right)% \leq C_{1}\exp\left(-C_{2}\varepsilon^{2}N\frac{\Delta^{8\underline{H}}}{\log^% {2}(\Delta)}\right).blackboard_P ( | over^ start_ARG italic_g ( italic_α , roman_Δ ) end_ARG - italic_g ( italic_α ) | ≥ italic_ε ) ≤ italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT roman_exp ( - italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N divide start_ARG roman_Δ start_POSTSUPERSCRIPT 8 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Δ ) end_ARG ) . (45)

where g𝑔gitalic_g is defined in Proposition 1.

The condition on ε𝜀\varepsilonitalic_ε in (44) represent the reminder term in Proposition 1 combined with the condition (37) on η𝜂\etaitalic_η in Proposition 3, such condition on ε𝜀\varepsilonitalic_ε can be satisfied by choosing ΔΔ\Deltaroman_Δ as in (41) provided that M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is large enough. The probability bound (45) converge to zero as long as we choose ΔΔ\Deltaroman_Δ as in (41) and the condition (42) is satisfied.

Theorem 1.

Suppose that assumptions of Proposition 3 hold true. Moreover assume that for any k{1,,K0}𝑘1subscript𝐾0k\in\{1,\dots,K_{0}\}italic_k ∈ { 1 , … , italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } we have

Δk=Δ0o(Δζ{H¯H¯}H¯H¯H¯).subscriptΔ𝑘Δ0𝑜superscriptΔ𝜁¯𝐻¯𝐻¯𝐻¯𝐻¯𝐻\Delta_{k}\underset{\Delta\rightarrow 0}{=}o\left(\Delta^{\frac{\zeta\wedge\{% \overline{H}-\underline{H}\}}{\overline{H}-\underline{H}}\underline{H}}\right).roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_UNDERACCENT roman_Δ → 0 end_UNDERACCENT start_ARG = end_ARG italic_o ( roman_Δ start_POSTSUPERSCRIPT divide start_ARG italic_ζ ∧ { over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG } end_ARG start_ARG over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG end_ARG under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT ) . (46)

Then for any ε𝜀\varepsilonitalic_ε such that Δζ{H¯H¯}εmuch-less-thansuperscriptΔ𝜁¯𝐻¯𝐻𝜀\Delta^{\zeta\wedge\{\overline{H}-\underline{H}\}}\ll\varepsilonroman_Δ start_POSTSUPERSCRIPT italic_ζ ∧ { over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG } end_POSTSUPERSCRIPT ≪ italic_ε, we have for M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT sufficiently large

(|α^α|2ε)(|g(α,Δ)^g(α)|ε)+(|g(α,Δ)^g(α)|2H¯L¯(H¯H¯)4𝔞23H¯K0)+(supβ[0,2π)|H^𝐮(β)(Δ)H𝐮(β)(Δ)|H¯H¯8K0).^𝛼𝛼2𝜀^𝑔𝛼Δ𝑔𝛼𝜀superscript^𝑔𝛼Δ𝑔𝛼2¯𝐻¯𝐿¯𝐻¯𝐻4𝔞superscript23¯𝐻subscript𝐾0subscriptsupremum𝛽02𝜋subscript^𝐻𝐮𝛽Δsubscript𝐻𝐮𝛽Δ¯𝐻¯𝐻8subscript𝐾0\mathbb{P}\left(|\widehat{\alpha}-\alpha|\geq 2\varepsilon\right)\leq\mathbb{P% }\left(|\widehat{g(\alpha,\Delta)}-g(\alpha)|\geq\varepsilon\right)\\ +\mathbb{P}\left(|\widehat{g(\alpha,\Delta)}-g(\alpha)|^{2\underline{H}}\geq% \frac{\underline{L}(\overline{H}-\underline{H})}{4\mathfrak{a}2^{3-\underline{% H}}K_{0}}\right)\\ +\mathbb{P}\left(\sup_{\beta\in[0,2\pi)}|\widehat{H}_{\mathbf{u}(\beta)}(% \Delta)-H_{\mathbf{u}(\beta)}(\Delta)|\geq\frac{\overline{H}-\underline{H}}{8K% _{0}}\right).start_ROW start_CELL blackboard_P ( | over^ start_ARG italic_α end_ARG - italic_α | ≥ 2 italic_ε ) ≤ blackboard_P ( | over^ start_ARG italic_g ( italic_α , roman_Δ ) end_ARG - italic_g ( italic_α ) | ≥ italic_ε ) end_CELL end_ROW start_ROW start_CELL + blackboard_P ( | over^ start_ARG italic_g ( italic_α , roman_Δ ) end_ARG - italic_g ( italic_α ) | start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT ≥ divide start_ARG under¯ start_ARG italic_L end_ARG ( over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG ) end_ARG start_ARG 4 fraktur_a 2 start_POSTSUPERSCRIPT 3 - under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) end_CELL end_ROW start_ROW start_CELL + blackboard_P ( roman_sup start_POSTSUBSCRIPT italic_β ∈ [ 0 , 2 italic_π ) end_POSTSUBSCRIPT | over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) - italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ) | ≥ divide start_ARG over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG end_ARG start_ARG 8 italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) . end_CELL end_ROW (47)

The convergence of the RHS in (47) can be guaranteed following the discussion provided for Proposition 3, and Proposition 4 and Corollary 1. The main message of Theorem 1 is that, in order to identify the right angle that gives the maximal regularity, we need to choose the values in the grid 𝚫𝚫\mathbf{\Delta}bold_Δ introduced in (23) to have a slower rate of decrease than the ΔΔ\Deltaroman_Δ used to estimate the function g𝑔gitalic_g in (19). This is because, in (46), we have (ζ{H¯H¯}){H¯H¯}1H¯<1𝜁¯𝐻¯𝐻superscript¯𝐻¯𝐻1¯𝐻1(\zeta\wedge\{\overline{H}-\underline{H}\})\{\overline{H}-\underline{H}\}^{-1}% \underline{H}<1( italic_ζ ∧ { over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG } ) { over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG } start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT under¯ start_ARG italic_H end_ARG < 1.

5 Numerical Properties

We begin this section by briefly describing a simple, novel simulator for a class of bivariate anisotropic processes. The simulator, along with the other algorithms mentioned in this paper, are implemented in the R package direg333Freely accessible at https://github.com/sunnywang93/direg..

5.1 Anisotropic simulator

Let B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, B2subscript𝐵2B_{2}italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT be two processes with Hurst exponents H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and H2subscript𝐻2H_{2}italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT respectively. Suppose without loss of generality that H1H2subscript𝐻1subscript𝐻2H_{1}\neq H_{2}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≠ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. In the following, our exposition will be structured assuming that B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and B2subscript𝐵2B_{2}italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are fractional brownian motions (fBms). In principle, our simulation approach can be generalized to other processes that exhibit the stationary increments property.

Many methods are available to simulate these types of processes, and we do not attempt to provide an exhaustive list. For examples of some papers, see Wood and Chan, (1994), Stein, (2002) and Coeurjolly and Porcu, (2018). A survey can be found in Dieker, (2004). In particular, the circulant embedding method described in Wood and Chan, (1994) is attractive due to its speed, and can be adapted to simulate processes such as the fBm with stationary increments.

The anisotropic component of the simulator will be our main focus, since there is already a wide array of methods available for simulating the individual processes. Let 𝐮𝟏subscript𝐮1\mathbf{u_{1}}bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT and 𝐮𝟐subscript𝐮2\mathbf{u_{2}}bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT be unit vectors with associated regularities H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and H2subscript𝐻2H_{2}italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. The vectors can be represented in the canonical basis as 𝐮𝟏=cos(α)𝐞𝟏+sin(α)𝐞𝟐subscript𝐮1𝛼subscript𝐞1𝛼subscript𝐞2\mathbf{u_{1}}=\cos(\alpha)\mathbf{e_{1}}+\sin(\alpha)\mathbf{e_{2}}bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT = roman_cos ( italic_α ) bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT + roman_sin ( italic_α ) bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT and 𝐮𝟐=sin(α)𝐞𝟏+cos(α)𝐞𝟐subscript𝐮2𝛼subscript𝐞1𝛼subscript𝐞2\mathbf{u_{2}}=-\sin(\alpha)\mathbf{e_{1}}+\cos(\alpha)\mathbf{e_{2}}bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT = - roman_sin ( italic_α ) bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT + roman_cos ( italic_α ) bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT. The main difficulty of simulating processes on an equally spaced grid along 𝐮𝟏subscript𝐮1\mathbf{u_{1}}bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT, 𝐮𝟐subscript𝐮2\mathbf{u_{2}}bold_u start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT is the existence of negative values when α[0,3π/2]𝛼03𝜋2\alpha\in[0,3\pi/2]italic_α ∈ [ 0 , 3 italic_π / 2 ], while the fBm has a domain in +subscript\mathbb{R}_{+}blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT.

This problem can be resolved by exploiting the stationary increments property of the fBm, given by B(t)B(s)=dB(ts)superscript𝑑𝐵𝑡𝐵𝑠𝐵𝑡𝑠B(t)-B(s)\stackrel{{\scriptstyle d}}{{=}}B(t-s)italic_B ( italic_t ) - italic_B ( italic_s ) start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG italic_d end_ARG end_RELOP italic_B ( italic_t - italic_s ), where =dsuperscript𝑑\stackrel{{\scriptstyle d}}{{=}}start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG italic_d end_ARG end_RELOP denotes equality in distribution. By using the reference point t=0𝑡0t=0italic_t = 0, we obtain

B(s)=dB(s).superscript𝑑𝐵𝑠𝐵𝑠-B(s)\stackrel{{\scriptstyle d}}{{=}}B(-s).- italic_B ( italic_s ) start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG italic_d end_ARG end_RELOP italic_B ( - italic_s ) . (48)

An anisotropic fBm can thus be simulated on any α[0,π]𝛼0𝜋\alpha\in[0,\pi]italic_α ∈ [ 0 , italic_π ] by first simulating a fBm on an equally spaced grid in [0,|cos(α)|+sin(α)]0𝛼𝛼[0,|\cos(\alpha)|+\sin(\alpha)][ 0 , | roman_cos ( italic_α ) | + roman_sin ( italic_α ) ], before transforming the negative part using (48). A bivariate process can finally be constructed by applying a function f𝑓fitalic_f to the individual processes, for example f(B1,B2)=B1+B2𝑓subscript𝐵1subscript𝐵2subscript𝐵1subscript𝐵2f(B_{1},B_{2})=B_{1}+B_{2}italic_f ( italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Our simulator is similar in spirit to the Turning bands approach; see for example Matheron, (1973). Psuedocode for our simulator can be found in Algorithm SM.1 of the Supplementary Material.

5.2 Parameter settings and error measures

Observations (Y(i)(𝐭m),𝐭m),1iN,1mM0formulae-sequencesuperscript𝑌𝑖subscript𝐭𝑚subscript𝐭𝑚1𝑖𝑁1𝑚subscript𝑀0(Y^{(i)}(\mathbf{t}_{m}),\mathbf{t}_{m}),1\leq i\leq N,1\leq m\leq M_{0}( italic_Y start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , 1 ≤ italic_i ≤ italic_N , 1 ≤ italic_m ≤ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT were simulated using our algorithm for the sum of two fBms. Other processes (e.g product of two fBms) can be found in the Supplementary Material.

A total of 60 different parameter configurations were explored, consisting of all possible combinations of the following parameter sets: number of curves N{100,150}𝑁100150N\in\{100,150\}italic_N ∈ { 100 , 150 }, number of points along each curve M0{512,1012}subscript𝑀0superscript512superscript1012M_{0}\in\{51^{2},101^{2}\}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ { 51 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , 101 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT }, noise level σ{0.1,0.5,1}𝜎0.10.51\sigma\in\{0.1,0.5,1\}italic_σ ∈ { 0.1 , 0.5 , 1 }, and angles α{π/30,π/5,π/4,π/3,π/2π/30}𝛼𝜋30𝜋5𝜋4𝜋3𝜋2𝜋30\alpha\in\{\pi/30,\pi/5,\pi/4,\pi/3,\pi/2-\pi/30\}italic_α ∈ { italic_π / 30 , italic_π / 5 , italic_π / 4 , italic_π / 3 , italic_π / 2 - italic_π / 30 }. In both processes, the regularities were fixed to be H1=0.8subscript𝐻10.8H_{1}=0.8italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.8 and H2=0.5subscript𝐻20.5H_{2}=0.5italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0.5. In line with our discussion in Section 3.2, the parameter ΔΔ\Deltaroman_Δ was set to be Δ=M01/4Δsuperscriptsubscript𝑀014\Delta=M_{0}^{-1/4}roman_Δ = italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 / 4 end_POSTSUPERSCRIPT. The grid of spacings 𝚫𝚫\mathbf{\Delta}bold_Δ involved in the identification process was set to 𝚫={M1/4,Δ1,,ΔK01,0.4}𝚫superscript𝑀14subscriptΔ1subscriptΔsubscript𝐾010.4\mathbf{\Delta}=\{M^{-1/4},\Delta_{1},\dots,\Delta_{K_{0}-1},0.4\}bold_Δ = { italic_M start_POSTSUPERSCRIPT - 1 / 4 end_POSTSUPERSCRIPT , roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_Δ start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT , 0.4 }, an evenly spaced grid consisting of K0=15subscript𝐾015K_{0}=15italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 15 points. The absolute error is used as a risk measure for each experiment, given by

α=|α^^α|,subscript𝛼^^𝛼𝛼\mathcal{R}_{\alpha}=|\widehat{\widehat{\alpha}}-\alpha|,caligraphic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT = | over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG - italic_α | , (49)

where α^^^^𝛼\widehat{\widehat{\alpha}}over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG is the adjusted estimate given by (31). Parameter settings for anisotropic detection can be found in Table 1.

5.3 Empirical Results

Boxplots for the simulation results can be seen in Figure 3. The risk can be seen to be very small, below 0.1 for all different configurations, even in the extremely high noise setting σ=1𝜎1\sigma=1italic_σ = 1. In fact, we see that our Algorithms are fairly robust to noise, in the sense that the risk increase less than proportionately to the noise levels. As seen in Section 7, the levels of accuracy observed in Figure 3 is sufficient to achieve lower risk levels for applications such as smoothing.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 3: Results for risk of estimated angles α𝛼\alphaitalic_α (with correction). N𝑁Nitalic_N is the number of curves, M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT the number of observed points along each curve, and σ𝜎\sigmaitalic_σ is the noise level.

6 Discussion

In this section we discuss some key aspects of our methodology. Extensions of our approach to more general settings, such as the random design framework with heteroscedastic noise, can be found in the Supplementary Material.

6.1 Computational Considerations

The computational complexity of Algorithm 1 is driven primarily by the identification process, due to the additional loop over the grid of spacings 𝚫𝚫\mathbf{\Delta}bold_Δ. Since we are performing a linear grid search in each basis vector containing M0subscript𝑀0\sqrt{M_{0}}square-root start_ARG italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG points, the complexity of nearest neighbor interpolation for each surface is O(M0)𝑂subscript𝑀0O(M_{0})italic_O ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), resulting in O(N×M0)𝑂𝑁subscript𝑀0O(N\times M_{0})italic_O ( italic_N × italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) for N𝑁Nitalic_N sample paths. By searching over M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT points to determine the closest point to each 𝐭msubscript𝐭𝑚\mathbf{t}_{m}bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT in terms of 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norm, the complexity of computing σ^2superscript^𝜎2\widehat{\sigma}^{2}over^ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is O(N×M0)𝑂𝑁subscript𝑀0O(N\times M_{0})italic_O ( italic_N × italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), so the complexity of H^𝐯(β)(𝐭,Δ)subscript^𝐻𝐯𝛽𝐭Δ\widehat{H}_{\mathbf{v}(\beta)}(\mathbf{t},\Delta)over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_v ( italic_β ) end_POSTSUBSCRIPT ( bold_t , roman_Δ ) is O(N×M0)𝑂𝑁subscript𝑀0O(N\times M_{0})italic_O ( italic_N × italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) for each 𝐭𝐭\mathbf{t}bold_t, ΔΔ\Deltaroman_Δ. The complexity of the full identification process is therefore O(N×#𝒯~×K0×M0)𝑂𝑁#~𝒯subscript𝐾0subscript𝑀0O(N\times\#\widetilde{\mathcal{T}}\times K_{0}\times M_{0})italic_O ( italic_N × # over~ start_ARG caligraphic_T end_ARG × italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT × italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). In the context of fda, a complexity of at least O(N×#𝒯~×M0)𝑂𝑁#~𝒯subscript𝑀0O(N\times\#\widetilde{\mathcal{T}}\times M_{0})italic_O ( italic_N × # over~ start_ARG caligraphic_T end_ARG × italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) is to be expected. Since K0subscript𝐾0K_{0}italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is relatively small (e.g 15 points), our algorithm is reasonably good in terms of computational complexity, considering the possible gains in the rates of convergence.

Clearly, the simulation described in Section 5.1 depends on the method used to generate the individual fBms. Once the fBms are given, only a linear search over #𝒯~#~𝒯\#\widetilde{\mathcal{T}}# over~ start_ARG caligraphic_T end_ARG grid points is required. Using a fast simulation method such as the circulant embedding method in Wood and Chan, (1994), individual one dimensional curves can be simulated in O({#𝒯~×log(#𝒯~)}1/2)𝑂superscript#~𝒯#~𝒯12O(\{\#\widetilde{\mathcal{T}}\times\log(\#\widetilde{\mathcal{T}})\}^{1/2})italic_O ( { # over~ start_ARG caligraphic_T end_ARG × roman_log ( # over~ start_ARG caligraphic_T end_ARG ) } start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ) time. Thus the simulation of N𝑁Nitalic_N surfaces can be done in O(N×#𝒯~×log(#𝒯~))𝑂𝑁#~𝒯#~𝒯O(N\times\#\widetilde{\mathcal{T}}\times\log(\#\widetilde{\mathcal{T}}))italic_O ( italic_N × # over~ start_ARG caligraphic_T end_ARG × roman_log ( # over~ start_ARG caligraphic_T end_ARG ) ). To the best of our knowledge, we do not know of any simulation algorithm in the context of fda that has better computational complexity.

6.2 Singularity along the maximizing direction

Lemma 1 states that there is a singularity present in the concept of directional regularity, in the sense that there exists only one direction for which the regularity is maximal. Since the angles α𝛼\alphaitalic_α are not expected to be estimated perfectly, why can there be a gain in convergence rates, as seen in the empirical results in Sections 5.3?

The answer lies in the intrinsic nature of directional regularity, which is itself intimately related to the mean-squared variations of a process. It is an “infinitesimal" concept, in the sense that

limΔ0θ𝐮(𝐭)L𝐮(𝐭)Δ2H𝐮=1.subscriptΔ0subscript𝜃𝐮𝐭subscript𝐿𝐮𝐭superscriptΔ2subscript𝐻𝐮1\lim_{\Delta\rightarrow 0}\frac{\theta_{\mathbf{u}}(\mathbf{t})}{L_{\mathbf{u}% }(\mathbf{t})\Delta^{2H_{\mathbf{u}}}}=1.roman_lim start_POSTSUBSCRIPT roman_Δ → 0 end_POSTSUBSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t ) end_ARG start_ARG italic_L start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT ( bold_t ) roman_Δ start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG = 1 . (50)

However, the limiting case cannot be obtained with finite precision computers. In reality, the “true" directional regularity" that one observes is given by the proxy H𝐮(β)(Δ)subscript𝐻𝐮𝛽ΔH_{\mathbf{u}(\beta)}(\Delta)italic_H start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ ). This partly explains the observed “continuity" associated to the directional regularity in practice, so that the estimate of α𝛼\alphaitalic_α only needs to be sufficiently small in order for gains in the rate to be achieved.

A more precise, formal perspective can be taken through the lens of Proposition 2, found in the Supplementary Material. In the upper bound of H𝐮(β1)H𝐮(β2)subscript𝐻𝐮subscript𝛽1subscript𝐻𝐮subscript𝛽2H_{\mathbf{u}(\beta_{1})}-H_{\mathbf{u}(\beta_{2})}italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT - italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT, we observe two competing terms, in the form of Δ2(H¯H𝐮(β2))superscriptΔ2¯𝐻subscript𝐻𝐮subscript𝛽2\Delta^{2(\underline{H}-H_{\mathbf{u}(\beta_{2})})}roman_Δ start_POSTSUPERSCRIPT 2 ( under¯ start_ARG italic_H end_ARG - italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT and (β1β2)2H¯superscriptsubscript𝛽1subscript𝛽22¯𝐻(\beta_{1}-\beta_{2})^{2\underline{H}}( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT. In order for H𝐮(β1)H𝐮(β2)subscript𝐻𝐮subscript𝛽1subscript𝐻𝐮subscript𝛽2H_{\mathbf{u}(\beta_{1})}-H_{\mathbf{u}(\beta_{2})}italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT - italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT to converge to zero, it is necessary that the condition

limΔ0(β1β2)2H¯Δ2(H𝐮(β2)H¯)=0subscriptΔ0superscriptsubscript𝛽1subscript𝛽22¯𝐻superscriptΔ2subscript𝐻𝐮subscript𝛽2¯𝐻0\lim_{\Delta\rightarrow 0}\frac{(\beta_{1}-\beta_{2})^{2\underline{H}}}{\Delta% ^{2(H_{\mathbf{u}(\beta_{2})}-\underline{H})}}=0roman_lim start_POSTSUBSCRIPT roman_Δ → 0 end_POSTSUBSCRIPT divide start_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 under¯ start_ARG italic_H end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG roman_Δ start_POSTSUPERSCRIPT 2 ( italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT - under¯ start_ARG italic_H end_ARG ) end_POSTSUPERSCRIPT end_ARG = 0 (51)

is satisfied. In other words, β1β2=o(ΔH𝐮(β2)/H¯1)=o(1)subscript𝛽1subscript𝛽2𝑜superscriptΔsubscript𝐻𝐮subscript𝛽2¯𝐻1𝑜1\beta_{1}-\beta_{2}=o(\Delta^{H_{\mathbf{u}(\beta_{2})}/\underline{H}-1})=o(1)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_o ( roman_Δ start_POSTSUPERSCRIPT italic_H start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT / under¯ start_ARG italic_H end_ARG - 1 end_POSTSUPERSCRIPT ) = italic_o ( 1 ), (51) can be loosely interpreted as saying “for directions that are close enough to each other, the proxies in (22), of the directional regularity stays somewhat close"; see Proposition 2.

7 Applications

In this section, we discuss concrete applications of our directional regularity methodology. They serve as a strong motivation for performing a change of basis using our methodology as a standard pre-processing step.

7.1 Smoothing Bivariate Functional Data

Let {X(𝐭),𝐭𝒯2}𝑋𝐭𝐭𝒯superscript2\{X(\mathbf{t}),\mathbf{t}\in\mathcal{T}\subset\mathbb{R}^{2}\}{ italic_X ( bold_t ) , bold_t ∈ caligraphic_T ⊂ blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT } be a bi-variate stochastic process satisfying (3), with maximizing regularity along the direction 𝐮𝟏=cos(α)𝐞𝟏+sin(α)𝐞𝟐subscript𝐮1𝛼subscript𝐞1𝛼subscript𝐞2\mathbf{u_{1}}=\cos(\alpha)\mathbf{e_{1}}+\sin(\alpha)\mathbf{e_{2}}bold_u start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT = roman_cos ( italic_α ) bold_e start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT + roman_sin ( italic_α ) bold_e start_POSTSUBSCRIPT bold_2 end_POSTSUBSCRIPT. Following the preceding sections, our exposition will be centered around the common design case, although this is not necessary. Observations come in the form of pairs 𝒟0=(Ym(i),𝐭m),1mM0,1iNformulae-sequenceformulae-sequencesubscript𝒟0superscriptsubscript𝑌𝑚𝑖subscript𝐭𝑚1𝑚subscript𝑀01𝑖𝑁\mathcal{D}_{0}=(Y_{m}^{(i)},\mathbf{t}_{m}),1\leq m\leq M_{0},1\leq i\leq Ncaligraphic_D start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( italic_Y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , 1 ≤ italic_m ≤ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , 1 ≤ italic_i ≤ italic_N, generated under

Ym(i)=X(i)(𝐭m)+σem(i),1mM0,1iN,formulae-sequenceformulae-sequencesubscriptsuperscript𝑌𝑖𝑚superscript𝑋𝑖subscript𝐭𝑚𝜎subscriptsuperscript𝑒𝑖𝑚1𝑚subscript𝑀01𝑖𝑁Y^{(i)}_{m}=X^{(i)}(\mathbf{t}_{m})+\sigma e^{(i)}_{m},\qquad 1\leq m\leq M_{0% },1\leq i\leq N,italic_Y start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_X start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) + italic_σ italic_e start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , 1 ≤ italic_m ≤ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , 1 ≤ italic_i ≤ italic_N , (52)

where em(i)superscriptsubscript𝑒𝑚𝑖e_{m}^{(i)}italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT are i.i.d centered random variables with unit variance. We call 𝒟0subscript𝒟0\mathcal{D}_{0}caligraphic_D start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT the learning set.

Consider a new realization Xnewsuperscript𝑋𝑛𝑒𝑤X^{new}italic_X start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT of X𝑋Xitalic_X, where the observed pairs 𝒟1=(Ymnew,𝐭m),1mM0formulae-sequencesubscript𝒟1subscriptsuperscript𝑌𝑛𝑒𝑤𝑚subscript𝐭𝑚1𝑚subscript𝑀0\mathcal{D}_{1}=(Y^{new}_{m},\mathbf{t}_{m}),1\leq m\leq M_{0}caligraphic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( italic_Y start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , 1 ≤ italic_m ≤ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT are generated from

Ymnew=Xnew(𝐭m)+σemnew,1mM0,formulae-sequencesubscriptsuperscript𝑌𝑛𝑒𝑤𝑚superscript𝑋𝑛𝑒𝑤subscript𝐭𝑚𝜎subscriptsuperscript𝑒𝑛𝑒𝑤𝑚1𝑚subscript𝑀0Y^{new}_{m}=X^{new}(\mathbf{t}_{m})+\sigma e^{new}_{m},\quad\quad 1\leq m\leq M% _{0},italic_Y start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_X start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) + italic_σ italic_e start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , 1 ≤ italic_m ≤ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , (53)

where emnewsuperscriptsubscript𝑒𝑚𝑛𝑒𝑤e_{m}^{new}italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT are i.i.d centered random variables with unit variance and Xnewsuperscript𝑋𝑛𝑒𝑤X^{new}italic_X start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT and emnewsuperscriptsubscript𝑒𝑚𝑛𝑒𝑤e_{m}^{new}italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT are independent. We refer to 𝒟1subscript𝒟1\mathcal{D}_{1}caligraphic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT as the online set.

Our goal is the recovery of Xnew(𝐭m)superscript𝑋𝑛𝑒𝑤subscript𝐭𝑚X^{new}({\mathbf{t}_{m}})italic_X start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) with 𝒟0subscript𝒟0\mathcal{D}_{0}caligraphic_D start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, using a suitable estimator X^new(𝐭m)superscript^𝑋𝑛𝑒𝑤subscript𝐭𝑚\widehat{X}^{new}({\mathbf{t}_{m}})over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ). Several methodologies for smoothing multivariate functional data currently exist. For example, Ramsay, (2002), Sangalli et al., (2013), Wood et al., (2008) consider spatial regression using a penalty that involves the Laplacian. These do not take the anisotropy of the process into account. Azzimonti et al., (2015) and Bernardi et al., (2018) extends these previous approaches by using a penalty term involving the partial different operator, and takes the anisotropy of the process into account. All these methods assume that the processes are differentiable for the penalty to be well defined. This assumption does not always hold in practice, and we propose an approach which works for non-differentiable processes.

The parameters of X^new(𝐭m)superscript^𝑋𝑛𝑒𝑤subscript𝐭𝑚\widehat{X}^{new}({\mathbf{t}_{m}})over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) are estimated from the learning set. Let Rαsubscript𝑅𝛼R_{\alpha}italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT be a clockwise rotation matrix given by

Rα=(cos(α)sin(α)sin(α)cos(α)).subscript𝑅𝛼matrix𝛼𝛼𝛼𝛼R_{\alpha}=\begin{pmatrix}\cos(\alpha)&\sin(\alpha)\\ -\sin(\alpha)&\cos(\alpha)\end{pmatrix}.italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL roman_cos ( italic_α ) end_CELL start_CELL roman_sin ( italic_α ) end_CELL end_ROW start_ROW start_CELL - roman_sin ( italic_α ) end_CELL start_CELL roman_cos ( italic_α ) end_CELL end_ROW end_ARG ) . (54)

Define a new process by applying the rotation matrix to the sampling points, denoted by

Z(𝐭):=X(Rα1𝐭),𝐭𝒯.formulae-sequenceassign𝑍𝐭𝑋superscriptsubscript𝑅𝛼1𝐭for-all𝐭𝒯Z(\mathbf{t}):=X(R_{\alpha}^{-1}\cdot\mathbf{t}),\qquad\forall\mathbf{t}\in% \mathcal{T}.italic_Z ( bold_t ) := italic_X ( italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ bold_t ) , ∀ bold_t ∈ caligraphic_T . (55)

It can be seen easily that Z(𝐭)𝑍𝐭Z(\mathbf{t})italic_Z ( bold_t ) admits a larger effective smoothness along the canonical basis. It is thus beneficial to work instead with Z𝑍Zitalic_Z instead of X𝑋Xitalic_X on the canonical basis. In practice, this transformed process can be obtained by performing a change-of-basis after estimating the rotation matrix Rαsubscript𝑅𝛼R_{\alpha}italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT using our methodology.

For simplicity, we fix the smoother to be the Nadaraya-Watson estimator with the multiplicative Epanechnikov kernel K:2+:𝐾superscript2subscriptK:\mathbb{R}^{2}\to\mathbb{R}_{+}italic_K : blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT, supported on [1,1]×[1,1]1111[-1,1]\times[-1,1][ - 1 , 1 ] × [ - 1 , 1 ], where

K(𝐬)=Kep(s1)×Kep(s2),𝐬=(s1,s2)2,and Kep(x)=34(1x2)𝟙{|x|1}.formulae-sequenceformulae-sequence𝐾𝐬subscript𝐾𝑒𝑝subscript𝑠1subscript𝐾𝑒𝑝subscript𝑠2for-all𝐬subscript𝑠1subscript𝑠2superscript2and subscript𝐾𝑒𝑝𝑥341superscript𝑥2subscript1𝑥1K(\mathbf{s})=K_{ep}(s_{1})\times K_{ep}(s_{2}),\quad\forall\mathbf{s}=(s_{1},% s_{2})\in\mathbb{R}^{2},\quad\text{and }K_{ep}(x)=\frac{3}{4}(1-x^{2})\mathbbm% {1}_{\{|x|\leq 1\}}.italic_K ( bold_s ) = italic_K start_POSTSUBSCRIPT italic_e italic_p end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) × italic_K start_POSTSUBSCRIPT italic_e italic_p end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , ∀ bold_s = ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , and italic_K start_POSTSUBSCRIPT italic_e italic_p end_POSTSUBSCRIPT ( italic_x ) = divide start_ARG 3 end_ARG start_ARG 4 end_ARG ( 1 - italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) blackboard_1 start_POSTSUBSCRIPT { | italic_x | ≤ 1 } end_POSTSUBSCRIPT . (56)

Furthermore, 𝐁=diag(h11,h21)𝐁diagsuperscriptsubscript11superscriptsubscript21\mathbf{B}=\operatorname{diag}(h_{1}^{-1},h_{2}^{-1})bold_B = roman_diag ( italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) is a positive definite bandwidth matrix. Using the rule 0/0=00000/0=00 / 0 = 0, the Nadaraya-Watson estimator is given by

Z^new(𝐭;𝐁)=m=1M0YmnewK(𝐁(Rα𝐭mnew𝐭))m=1M0K(𝐁(Rα𝐭mnew𝐭)).superscript^𝑍𝑛𝑒𝑤𝐭𝐁superscriptsubscript𝑚1subscript𝑀0subscriptsuperscript𝑌𝑛𝑒𝑤𝑚𝐾𝐁subscript𝑅𝛼subscriptsuperscript𝐭𝑛𝑒𝑤𝑚𝐭superscriptsubscript𝑚1subscript𝑀0𝐾𝐁subscript𝑅𝛼subscriptsuperscript𝐭𝑛𝑒𝑤𝑚𝐭\widehat{Z}^{new}(\mathbf{t};\mathbf{B})=\sum_{m=1}^{M_{0}}Y^{new}_{m}\frac{K% \left(\mathbf{B}(R_{\alpha}\mathbf{t}^{new}_{m}-\mathbf{t})\right)}{\sum_{m=1}% ^{M_{0}}K\left(\mathbf{B}(R_{\alpha}\mathbf{t}^{new}_{m}-\mathbf{t})\right)}.over^ start_ARG italic_Z end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( bold_t ; bold_B ) = ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_Y start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT divide start_ARG italic_K ( bold_B ( italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT bold_t start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - bold_t ) ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K ( bold_B ( italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT bold_t start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - bold_t ) ) end_ARG . (57)

The bandwidths should adapt to the regularity of the process Z𝑍Zitalic_Z to obtain the optimal rate of convergence. In particular, the bandwidths should be selected adaptively to the intrinsic anisotropy of the process. This can be achieved through a plug-in bandwidth rule when explicit risk bounds are available and directly estimable from the data. The estimate of Xnewsuperscript𝑋𝑛𝑒𝑤X^{new}italic_X start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT is then obtained by

X^new(𝐭,𝐁)=Z^new(Rα𝐭,𝐁),𝐭𝒯.formulae-sequencesuperscript^𝑋𝑛𝑒𝑤𝐭𝐁superscript^𝑍𝑛𝑒𝑤subscript𝑅𝛼𝐭𝐁for-all𝐭𝒯\widehat{X}^{new}(\mathbf{t},\mathbf{B})=\widehat{Z}^{new}(R_{\alpha}\mathbf{t% },\mathbf{B}),\qquad\forall\mathbf{t}\in\mathcal{T}.over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( bold_t , bold_B ) = over^ start_ARG italic_Z end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT bold_t , bold_B ) , ∀ bold_t ∈ caligraphic_T . (58)

Let B(𝟎,r)𝐵0𝑟B(\mathbf{0},r)italic_B ( bold_0 , italic_r ) denote the ball centered at the origin of 2superscript2\mathbb{R}^{2}blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT with radius r𝑟ritalic_r. Consider the L2superscript𝐿2L^{2}italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT risk of Z^newsuperscript^𝑍𝑛𝑒𝑤\widehat{Z}^{new}over^ start_ARG italic_Z end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT, given by

(B,M0)=𝔼[Z^new(;B)Znew()2]=𝔼[X^new(;B)Xnew()2],Bsubscript𝑀0𝔼delimited-[]subscriptnormsuperscript^𝑍𝑛𝑒𝑤Bsuperscript𝑍𝑛𝑒𝑤2𝔼delimited-[]subscriptnormsuperscript^𝑋𝑛𝑒𝑤Bsuperscript𝑋𝑛𝑒𝑤2\mathcal{R}\left(\textbf{B},M_{0}\right)=\mathbb{E}\left[\|\widehat{Z}^{new}(% \cdot\hskip 2.84544pt;\textbf{B})-Z^{new}(\cdot)\|_{2}\right]=\mathbb{E}\left[% \|\widehat{X}^{new}(\cdot\hskip 2.84544pt;\textbf{B})-X^{new}(\cdot)\|_{2}% \right],caligraphic_R ( B , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = blackboard_E [ ∥ over^ start_ARG italic_Z end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( ⋅ ; B ) - italic_Z start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( ⋅ ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ] = blackboard_E [ ∥ over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( ⋅ ; B ) - italic_X start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( ⋅ ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ] , (59)

where the second inequality is given by substitution and the fact that the matrix Rαsubscript𝑅𝛼R_{\alpha}italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT is with determinant 1. The pointwise risk is given by

(𝐭,B,M0)=𝔼[|X^new(𝐭;B)Xnew(𝐭)|2],𝐭Bsubscript𝑀0𝔼delimited-[]superscriptsuperscript^𝑋𝑛𝑒𝑤𝐭Bsuperscript𝑋𝑛𝑒𝑤𝐭2\mathcal{R}\left(\mathbf{t},\textbf{B},M_{0}\right)=\mathbb{E}\left[\left|% \widehat{X}^{new}(\mathbf{t};\textbf{B})-X^{new}(\mathbf{t})\right|^{2}\right],caligraphic_R ( bold_t , B , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = blackboard_E [ | over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( bold_t ; B ) - italic_X start_POSTSUPERSCRIPT italic_n italic_e italic_w end_POSTSUPERSCRIPT ( bold_t ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] , (60)

Observe that Fubini’s theorem implies (B,M0)=(,B,M0),Bsubscript𝑀0normBsubscript𝑀0\mathcal{R}\left(\textbf{B},M_{0}\right)=\|\mathcal{R}\left(\cdot,\textbf{B},M% _{0}\right)\|,caligraphic_R ( B , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = ∥ caligraphic_R ( ⋅ , B , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∥ , so the integrated risk can readily be recovered from the pointwise risk. Proposition 5 provides a pointwise risk bound. Assumptions can be found in the Appendix.

Proposition 5.

Let h1>0subscript10h_{1}>0italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0 and h2>0subscript20h_{2}>0italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0 in a bandwidth range satisfying

max{h1,h2}0,andM0×min{h1,h2}+.formulae-sequencesubscript1subscript20andsubscript𝑀0subscript1subscript2\max\{h_{1},h_{2}\}\rightarrow 0,\qquad\text{and}\quad\sqrt{M_{0}}\times\min\{% h_{1},h_{2}\}\rightarrow+\infty.roman_max { italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } → 0 , and square-root start_ARG italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG × roman_min { italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } → + ∞ . (61)

Then the following risk bounds hold:

(B,M0){1M0h1h2+h12H1+h22H2}.less-than-or-similar-toBsubscript𝑀01subscript𝑀0subscript1subscript2superscriptsubscript12subscript𝐻1superscriptsubscript22subscript𝐻2\mathcal{R}(\textbf{B},M_{0})\lesssim\left\{\frac{1}{M_{0}h_{1}h_{2}}+h_{1}^{2% H_{1}}+h_{2}^{2H_{2}}\right\}.caligraphic_R ( B , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≲ { divide start_ARG 1 end_ARG start_ARG italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG + italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT } . (62)

The proof is provided in the Supplementary Material. The following corollary provides the optimal bandwidth with respect to the L1superscript𝐿1L^{1}italic_L start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT norm.

Corollary 2.

Under the same assumptions of Proposition 5, the optimal bandwidths h1superscriptsubscript1h_{1}^{*}italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and h2superscriptsubscript2h_{2}^{*}italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT satisfies

h1(1M0)H22H1H2+H1+H2,h2(1M0)H12H1H2+H1+H2.formulae-sequenceasymptotically-equalssuperscriptsubscript1superscript1subscript𝑀0subscript𝐻22subscript𝐻1subscript𝐻2subscript𝐻1subscript𝐻2asymptotically-equalssuperscriptsubscript2superscript1subscript𝑀0subscript𝐻12subscript𝐻1subscript𝐻2subscript𝐻1subscript𝐻2h_{1}^{*}\asymp\left(\frac{1}{M_{0}}\right)^{\frac{H_{2}}{2H_{1}H_{2}+H_{1}+H_% {2}}},\quad h_{2}^{*}\asymp\left(\frac{1}{M_{0}}\right)^{\frac{H_{1}}{2H_{1}H_% {2}+H_{1}+H_{2}}}.italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ≍ ( divide start_ARG 1 end_ARG start_ARG italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT , italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ≍ ( divide start_ARG 1 end_ARG start_ARG italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT . (63)

Moreover, if h1superscriptsubscript1h_{1}^{*}italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and h2superscriptsubscript2h_{2}^{*}italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is used for smoothing, the following risk is obtained:

(B,M0)M02ω2ω+1,less-than-or-similar-tosuperscriptBsubscript𝑀0superscriptsubscript𝑀02𝜔2𝜔1\mathcal{R}(\textbf{B}^{*},M_{0})\lesssim M_{0}^{-\frac{2\omega}{2\omega+1}},caligraphic_R ( B start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≲ italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 2 italic_ω end_ARG start_ARG 2 italic_ω + 1 end_ARG end_POSTSUPERSCRIPT , (64)

where ω𝜔\omegaitalic_ω is the “effective smoothness", defined by the relation

1H1+1H2=1ω.1subscript𝐻11subscript𝐻21𝜔\frac{1}{H_{1}}+\frac{1}{H_{2}}=\frac{1}{\omega}.divide start_ARG 1 end_ARG start_ARG italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG + divide start_ARG 1 end_ARG start_ARG italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG = divide start_ARG 1 end_ARG start_ARG italic_ω end_ARG . (65)

All the quantities in Corollary 2 are estimable from the data using our methodology. Structural adaptation enables the optimal bandwidths to be chosen according to Corollary 2, obtaining the anisotropic rates of convergence, when the intrinsic anisotropy is not in the direction of the canonical basis. By working only on the canonical basis, the rates of convergence that is generally obtained corresponds to the isotropic rate. This is confirmed by our simulation study, which we discuss now.

Our simulation study aims to compare the L1superscript𝐿1L^{1}italic_L start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT risk of smoothing, with and without a change of basis. Surfaces corresponding to the sum of fBms with regularities H1=0.8subscript𝐻10.8H_{1}=0.8italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.8, H2=0.5subscript𝐻20.5H_{2}=0.5italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0.5 were simulated using the Algorithm described in Section 5.1. The angles α𝛼\alphaitalic_α were given by α{π/3,5π/6}𝛼𝜋35𝜋6\alpha\in\{\pi/3,5\pi/6\}italic_α ∈ { italic_π / 3 , 5 italic_π / 6 }. N=150𝑁150N=150italic_N = 150 surfaces were generated with M0=101subscript𝑀0101M_{0}=101italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 101 evenly spaced sampling points. Gaussian noise σem(i)𝜎superscriptsubscript𝑒𝑚𝑖\sigma e_{m}^{(i)}italic_σ italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT were added, where em(i)iid𝒩(0,1)superscriptsimilar-to𝑖𝑖𝑑superscriptsubscript𝑒𝑚𝑖𝒩01e_{m}^{(i)}\stackrel{{\scriptstyle iid}}{{\sim}}\mathcal{N}(0,1)italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_RELOP SUPERSCRIPTOP start_ARG ∼ end_ARG start_ARG italic_i italic_i italic_d end_ARG end_RELOP caligraphic_N ( 0 , 1 ) and σ=0.05𝜎0.05\sigma=0.05italic_σ = 0.05. Estimates of α𝛼\alphaitalic_α were obtained with Algorithm 1, with Δ=M01/4Δsuperscriptsubscript𝑀014\Delta=M_{0}^{-1/4}roman_Δ = italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 / 4 end_POSTSUPERSCRIPT, 𝚫={M1/4,Δ1,,ΔK01,0.4}𝚫superscript𝑀14subscriptΔ1subscriptΔsubscript𝐾010.4\mathbf{\Delta}=\{M^{-1/4},\Delta_{1},\dots,\Delta_{K_{0}-1},0.4\}bold_Δ = { italic_M start_POSTSUPERSCRIPT - 1 / 4 end_POSTSUPERSCRIPT , roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_Δ start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT , 0.4 }, and #𝚫=15#𝚫15\#\mathbf{\Delta}=15# bold_Δ = 15.

The online set, consisting of one surface per replication, were generated from the same process, for a total of 400 replications. The true surface was first generated on an equally spaced grid of 201 points on 𝒯𝒯\mathcal{T}caligraphic_T without noise. Nearest neighbor interpolation was then performed to discretize the process onto an grid of 101 points. The same Gaussian noise was added as the learning set.

For the anisotropic setp, a change of basis is performed on the online set. The minimum and maximum regularities were estimated using (18) and (22). The bandwidths were selected according to Corollary 2. In the isotropic setup, no change of basis is performed, and the bandwidths were similarly selected according to Corollary 2, with H1=H2subscript𝐻1subscript𝐻2H_{1}=H_{2}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. A multiplicative kernel was used, where the Epachenikov kernel was used in each dimension.

The risk measure was taken to be the relative risk, given by

rel=aniiso,subscript𝑟𝑒𝑙subscript𝑎𝑛𝑖subscript𝑖𝑠𝑜\mathcal{R}_{rel}=\frac{\mathcal{R}_{ani}}{\mathcal{R}_{iso}},caligraphic_R start_POSTSUBSCRIPT italic_r italic_e italic_l end_POSTSUBSCRIPT = divide start_ARG caligraphic_R start_POSTSUBSCRIPT italic_a italic_n italic_i end_POSTSUBSCRIPT end_ARG start_ARG caligraphic_R start_POSTSUBSCRIPT italic_i italic_s italic_o end_POSTSUBSCRIPT end_ARG , (66)

where anisubscript𝑎𝑛𝑖\mathcal{R}_{ani}caligraphic_R start_POSTSUBSCRIPT italic_a italic_n italic_i end_POSTSUBSCRIPT and isosubscript𝑖𝑠𝑜\mathcal{R}_{iso}caligraphic_R start_POSTSUBSCRIPT italic_i italic_s italic_o end_POSTSUBSCRIPT correspond to the L2superscript𝐿2L^{2}italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT anisotropic and isotropic risk respectively. Simulation results can be seen in Figure 4. We see that with the exception of the angles near the boundary (i.e 0 or π/2𝜋2\pi/2italic_π / 2), performing a change of basis leads to a reduction in the L2superscript𝐿2L^{2}italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT risk, by levels as much as 10%. It is not surprising that at the boundary, the risk is worse, since one is already anisotropic along the canonical basis.

Refer to caption
Figure 4: Results for risk of estimated angles α𝛼\alphaitalic_α (with correction). N𝑁Nitalic_N is the number of curves, M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT the number of observed points along each curve, and σ𝜎\sigmaitalic_σ is the noise level.

7.2 Anisotropic detection

Our focus so far has been on estimating the direction of the maximizing regularity, which implicitly assumes that anisotropy is present. When the process is intrinsically isotropic, no gains can be made by structural adaptation since the regularity of the process X𝑋Xitalic_X is invariant to directions. However, there is no real loss in terms of statistical error either, since the underlying regularity remains the same, as implied by Lemma 1. In particular, the rates of convergence do not get degraded, and can only be improved with structural adaptation.

Nevertheless, there might be instances where knowing the presence of anisotropy is relevant for some applications. Some examples include fingerprint verification Jain et al., (1997), Jiang, (2005) and texture analysis in materials science Germain et al., (2003). This issue can be naturally addressed using the directional regularity approach, by constructing an anistropic detection procedure based on thresholding. The underlying idea is to consider an event

𝒜(τ):={|H¯^H¯^|>τ},assign𝒜𝜏^¯𝐻^¯𝐻𝜏\mathcal{A}(\tau):=\left\{\left|\widehat{\underline{H}}-\widehat{\overline{H}}% \right|>\tau\right\},caligraphic_A ( italic_τ ) := { | over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG - over^ start_ARG over¯ start_ARG italic_H end_ARG end_ARG | > italic_τ } , (67)

and determine anisotropy based on 𝟙𝒜(τ)subscript1𝒜𝜏\mathbbm{1}_{\mathcal{A}(\tau)}blackboard_1 start_POSTSUBSCRIPT caligraphic_A ( italic_τ ) end_POSTSUBSCRIPT, for some threshold τ𝜏\tauitalic_τ that is appropriately chosen. In particular, τ𝜏\tauitalic_τ should be selected to account for the estimation errors of H¯^^¯𝐻\widehat{\underline{H}}over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG and H¯^^¯𝐻\widehat{\overline{H}}over^ start_ARG over¯ start_ARG italic_H end_ARG end_ARG.

Let ε¯=|H¯^H¯|¯𝜀^¯𝐻¯𝐻\underline{\varepsilon}=|\widehat{\underline{H}}-\underline{H}|under¯ start_ARG italic_ε end_ARG = | over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG - under¯ start_ARG italic_H end_ARG | and ε¯=H¯H¯¯𝜀¯𝐻¯𝐻\overline{\varepsilon}=\overline{H}-\underline{H}over¯ start_ARG italic_ε end_ARG = over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG denote the estimation errors of the minimum and maximum regularities respectively. By construction, we have ε¯ε¯¯𝜀¯𝜀\overline{\varepsilon}\geq\underline{\varepsilon}over¯ start_ARG italic_ε end_ARG ≥ under¯ start_ARG italic_ε end_ARG, with strict inequality holding in general, since H¯^^¯𝐻\widehat{\underline{H}}over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG is used as an auxiliary quantity for estimating H¯^^¯𝐻\widehat{\overline{H}}over^ start_ARG over¯ start_ARG italic_H end_ARG end_ARG. A sensible approach is thus to choose τ𝜏\tauitalic_τ such that ε¯<τ<ε¯¯𝜀𝜏¯𝜀\underline{\varepsilon}<\tau<\overline{\varepsilon}under¯ start_ARG italic_ε end_ARG < italic_τ < over¯ start_ARG italic_ε end_ARG. On the one hand, when H¯=H¯¯𝐻¯𝐻\underline{H}=\overline{H}under¯ start_ARG italic_H end_ARG = over¯ start_ARG italic_H end_ARG (i.e isotropy), choosing τ>ε¯𝜏¯𝜀\tau>\underline{\varepsilon}italic_τ > under¯ start_ARG italic_ε end_ARG accounts for the estimation error so that 𝟙𝒜(τ)=0subscript1𝒜𝜏0\mathbbm{1}_{\mathcal{A}(\tau)}=0blackboard_1 start_POSTSUBSCRIPT caligraphic_A ( italic_τ ) end_POSTSUBSCRIPT = 0 with high probability. On the other hand, when H¯H¯¯𝐻¯𝐻\underline{H}\neq\overline{H}under¯ start_ARG italic_H end_ARG ≠ over¯ start_ARG italic_H end_ARG, choosing τ<ε¯𝜏¯𝜀\tau<\overline{\varepsilon}italic_τ < over¯ start_ARG italic_ε end_ARG is necessary to avoid falsely detecting isotropy. An illustration of this can be seen in Figure 5.

0ε¯¯𝜀\underline{\varepsilon}under¯ start_ARG italic_ε end_ARGτ𝜏\tauitalic_τε¯¯𝜀\overline{\varepsilon}over¯ start_ARG italic_ε end_ARG1Always detect anisotropyAlways detect isotropy
Figure 5: Illustration of the region in which the thresholding parameter τ𝜏\tauitalic_τ should fall.

Since ε¯¯𝜀\underline{\varepsilon}under¯ start_ARG italic_ε end_ARG is unknown in practice, we construct a data-driven procedure to estimate it, which we denote ε¯^^¯𝜀\widehat{\underline{\varepsilon}}over^ start_ARG under¯ start_ARG italic_ε end_ARG end_ARG. Let β={β1,,βJ},0<J<formulae-sequence𝛽subscript𝛽1subscript𝛽𝐽0𝐽\beta=\{\beta_{1},\dots,\beta_{J}\},0<J<\inftyitalic_β = { italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_β start_POSTSUBSCRIPT italic_J end_POSTSUBSCRIPT } , 0 < italic_J < ∞ be a random set of angles, where J=J(N,M0)𝐽𝐽𝑁subscript𝑀0J=J(N,M_{0})italic_J = italic_J ( italic_N , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) is an integer that depends on the sample size. Let β={β1,,βJ}superscript𝛽perpendicular-tosuperscriptsubscript𝛽1perpendicular-tosuperscriptsubscript𝛽𝐽perpendicular-to\beta^{\perp}=\{\beta_{1}^{\perp},\dots,\beta_{J}^{\perp}\}italic_β start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT = { italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT , … , italic_β start_POSTSUBSCRIPT italic_J end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT } be the set of angles orthogonal to β𝛽\betaitalic_β, where βj=βj+π/2superscriptsubscript𝛽𝑗perpendicular-tosubscript𝛽𝑗𝜋2\beta_{j}^{\perp}=\beta_{j}+\pi/2italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT = italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_π / 2, j=1,,Jfor-all𝑗1𝐽\forall j=1,\dots,J∀ italic_j = 1 , … , italic_J. The main idea is to estimate the minimum regularity associated to each βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and βjsuperscriptsubscript𝛽𝑗perpendicular-to\beta_{j}^{\perp}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT, and compute the average difference between these two estimates, given by

ε¯^=1Jj=1Jε¯j,^¯𝜀1𝐽superscriptsubscript𝑗1𝐽subscript¯𝜀𝑗\widehat{\underline{\varepsilon}}=\frac{1}{J}\sum_{j=1}^{J}\underline{% \varepsilon}_{j},over^ start_ARG under¯ start_ARG italic_ε end_ARG end_ARG = divide start_ARG 1 end_ARG start_ARG italic_J end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_J end_POSTSUPERSCRIPT under¯ start_ARG italic_ε end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , (68)

where for all j=1,,J𝑗1𝐽j=1,\dots,Jitalic_j = 1 , … , italic_J, we have

ε¯j=|Hˇ𝐮(βj)Hˇ𝐮(βj)|, and Hˇ𝐮(β)=1#𝚫k=1K0H^𝐮(β)(Δk).formulae-sequencesubscript¯𝜀𝑗subscriptˇ𝐻𝐮subscript𝛽𝑗subscriptˇ𝐻𝐮superscriptsubscript𝛽𝑗perpendicular-to and subscriptˇ𝐻𝐮𝛽1#𝚫superscriptsubscript𝑘1subscript𝐾0subscript^𝐻𝐮𝛽subscriptΔ𝑘\underline{\varepsilon}_{j}=\left|\widecheck{H}_{\mathbf{u}(\beta_{j})}-% \widecheck{H}_{\mathbf{u}(\beta_{j}^{\perp})}\right|,\text{ and }\widecheck{H}% _{\mathbf{u}(\beta)}=\frac{1}{\#\mathbf{\Delta}}\sum_{k=1}^{K_{0}}\widehat{H}_% {\mathbf{u}(\beta)}(\Delta_{k}).under¯ start_ARG italic_ε end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = | overroman_ˇ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT - overroman_ˇ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT | , and overroman_ˇ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG # bold_Δ end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) . (69)

The principle for this approach is rooted in Lemma 1, which states that under anisotropy, there exists only one maximising direction 𝐮(α)𝐮𝛼\mathbf{u}(\alpha)bold_u ( italic_α ). Thus if we take any other random direction 𝐯(β)𝐯𝛽\mathbf{v}(\beta)bold_v ( italic_β ), βα𝛽𝛼\beta\neq\alphaitalic_β ≠ italic_α, then the regularity that we will “catch" corresponds to the minimum one.

The quantity ε¯^^¯𝜀\widehat{\underline{\varepsilon}}over^ start_ARG under¯ start_ARG italic_ε end_ARG end_ARG in (68) estimates the “irreducible error" that arises in estimating the worst regularity H¯¯𝐻\underline{H}under¯ start_ARG italic_H end_ARG by simply changing directions, exemplified by Proposition 2. In order to avoid always detecting anisotropy, this error needs to be taken into account, so that τ>ε¯𝜏¯𝜀\tau>\underline{\varepsilon}italic_τ > under¯ start_ARG italic_ε end_ARG. Averaging the estimated regularities over a grid of 𝚫𝚫\mathbf{\Delta}bold_Δ’s provides added robustness since H^𝐮(β)(Δk)subscript^𝐻𝐮𝛽subscriptΔ𝑘\widehat{H}_{\mathbf{u}(\beta)}(\Delta_{k})over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( italic_β ) end_POSTSUBSCRIPT ( roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) is no longer simply an auxiliary quantity.

There are two reasons for computing the ε¯jsubscript¯𝜀𝑗\underline{\varepsilon}_{j}under¯ start_ARG italic_ε end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’s using different groups of estimates in β𝛽\betaitalic_β and βsuperscript𝛽perpendicular-to\beta^{\perp}italic_β start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT. The first is to preserve the independence between summands when computing ε¯^^¯𝜀\widehat{\underline{\varepsilon}}over^ start_ARG under¯ start_ARG italic_ε end_ARG end_ARG in (68). The second is to ensure that the angles are sufficiently well separated to ensure that the difference in the estimates are not driven by the proximity of angles; see Proposition 2. The threshold is then set to

τ=ε¯^+exp(log(M0)ξ),\tau=\widehat{\underline{\varepsilon}}+\exp(-\log(M_{0})^{\xi}),italic_τ = over^ start_ARG under¯ start_ARG italic_ε end_ARG end_ARG + roman_exp ( - roman_log ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) , (70)

for some 0<ξ<00𝜉00<\xi<00 < italic_ξ < 0. The extra exp(log(M0)ξ)\exp(-\log(M_{0})^{\xi})roman_exp ( - roman_log ( italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) term in (70) converges to zero slower than the rate of convergence of H¯^^¯𝐻\widehat{\underline{H}}over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG, and ensures that the strict inequality ε¯<τ<ε¯¯𝜀𝜏¯𝜀\underline{\varepsilon}<\tau<\overline{\varepsilon}under¯ start_ARG italic_ε end_ARG < italic_τ < over¯ start_ARG italic_ε end_ARG is preserved for any M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT large enough. Algorithm 2 provides a summary of the anisotropic detection procedure.

Algorithm 2 Anisotropic Detection
1:Data (Y(j)(𝐭m),𝐭m)superscript𝑌𝑗subscript𝐭𝑚subscript𝐭𝑚(Y^{(j)}(\mathbf{t}_{m}),\mathbf{t}_{m})( italic_Y start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ( bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , bold_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ), Grid 𝒯~={𝐭𝟏,,𝐭𝐩}~𝒯subscript𝐭1subscript𝐭𝐩\widetilde{\mathcal{T}}=\left\{\mathbf{t_{1}},\dots,\mathbf{t_{p}}\right\}over~ start_ARG caligraphic_T end_ARG = { bold_t start_POSTSUBSCRIPT bold_1 end_POSTSUBSCRIPT , … , bold_t start_POSTSUBSCRIPT bold_p end_POSTSUBSCRIPT }; Integer J𝐽Jitalic_J; Estimated Angle α^^^^𝛼\widehat{\widehat{\alpha}}over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG Initialize H𝐻H\leftarrow\emptysetitalic_H ← ∅;
2:for j=1,J𝑗1𝐽j=1,\dots Jitalic_j = 1 , … italic_J do
3:     Sample βjUnif([α^^+π/4,α^^+3π/4])similar-tosubscript𝛽𝑗Unif^^𝛼𝜋4^^𝛼3𝜋4\beta_{j}\sim\text{Unif}([\widehat{\widehat{\alpha}}+\pi/4,\widehat{\widehat{% \alpha}}+3\pi/4])italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∼ Unif ( [ over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG + italic_π / 4 , over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG + 3 italic_π / 4 ] );
4:     Estimate H^j=H^𝐯(βj)subscript^𝐻𝑗subscript^𝐻𝐯subscript𝛽𝑗\widehat{H}_{j}=\widehat{H}_{\mathbf{v}(\beta_{j})}over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_v ( italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT according to (22);
5:     HH^jH𝐻subscript^𝐻𝑗𝐻H\leftarrow\widehat{H}_{j}\bigcup Hitalic_H ← over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋃ italic_H;
6:end for
7:Compute ε¯^^¯𝜀\widehat{\underline{\varepsilon}}over^ start_ARG under¯ start_ARG italic_ε end_ARG end_ARG according to (68);
8:Compute τ𝜏\tauitalic_τ according to (70);
9:return 𝟙𝒜(τ)subscript1𝒜𝜏\mathbbm{1}_{\mathcal{A}(\tau)}blackboard_1 start_POSTSUBSCRIPT caligraphic_A ( italic_τ ) end_POSTSUBSCRIPT; \triangleright 1 indicates anisotropy, isotropy otherwise

In theory, the set of angles β𝛽\mathbf{\beta}italic_β can be chosen randomly, as long as β±α^^𝛽plus-or-minus^^𝛼\mathbf{\beta}\neq\pm\widehat{\widehat{\alpha}}italic_β ≠ ± over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG. However, in order to avoid any “continuity" issues, each βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT should be sufficiently far away from α^^^^𝛼\widehat{\widehat{\alpha}}over^ start_ARG over^ start_ARG italic_α end_ARG end_ARG. We thus suggest to sample the angles from a uniform distribution, as seen in Algorithm 2. The power ξ𝜉\xiitalic_ξ in (70), which governs the rate of convergence, only needs to be ξ(0,1)𝜉01\xi\in(0,1)italic_ξ ∈ ( 0 , 1 ) in theory. Following Golovkine et al., (2023), we choose ξ=1/3𝜉13\xi=1/3italic_ξ = 1 / 3, a value that seems to work well in practice. The integer J(N,M0)𝐽𝑁subscript𝑀0J(N,M_{0})italic_J ( italic_N , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) should be increasing with N𝑁Nitalic_N and M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, so that ε¯^^¯𝜀\widehat{\underline{\varepsilon}}over^ start_ARG under¯ start_ARG italic_ε end_ARG end_ARG eventually converges. In view of computational efficiency, we suggest to select J(N,M0)=(N×M0)1/4𝐽𝑁subscript𝑀0superscript𝑁subscript𝑀014J(N,M_{0})=\lceil(N\times M_{0})^{1/4}\rceilitalic_J ( italic_N , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = ⌈ ( italic_N × italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT ⌉, which is aligned with the rate of convergence of H^^𝐻\widehat{H}over^ start_ARG italic_H end_ARG and also works well in practice.

We establish the consistency of Algorithm 2 in the following proposition.

Proposition 6.

Let H¯^=H^𝐮(α^)^¯𝐻subscript^𝐻𝐮^𝛼\widehat{\overline{H}}=\widehat{H}_{\mathbf{u}(\widehat{\alpha})}over^ start_ARG over¯ start_ARG italic_H end_ARG end_ARG = over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( over^ start_ARG italic_α end_ARG ) end_POSTSUBSCRIPT, H¯^=H^𝐮(α^+π/2)^¯𝐻subscript^𝐻𝐮^𝛼𝜋2\widehat{\underline{H}}=\widehat{H}_{\mathbf{u}(\widehat{\alpha}+\pi/2)}over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG = over^ start_ARG italic_H end_ARG start_POSTSUBSCRIPT bold_u ( over^ start_ARG italic_α end_ARG + italic_π / 2 ) end_POSTSUBSCRIPT, and τ𝜏\tauitalic_τ be defined as in Algorithm 2. Suppose that the assumptions of Corollary 1 hold true. Then Algorithm 2 is consistent, in the sense that

limN,M0(|H¯^H¯^|>τ,H¯H¯)=1,subscript𝑁subscript𝑀0formulae-sequence^¯𝐻^¯𝐻𝜏¯𝐻¯𝐻1\lim_{N,M_{0}\rightarrow\infty}\mathbb{P}\left(\left|\widehat{\overline{H}}-% \widehat{\underline{H}}\right|>\tau,\underline{H}\neq\overline{H}\right)=1,roman_lim start_POSTSUBSCRIPT italic_N , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT → ∞ end_POSTSUBSCRIPT blackboard_P ( | over^ start_ARG over¯ start_ARG italic_H end_ARG end_ARG - over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG | > italic_τ , under¯ start_ARG italic_H end_ARG ≠ over¯ start_ARG italic_H end_ARG ) = 1 , (71)

and

limN,M0(|H¯^H¯^|>τ,H¯=H¯)=0.subscript𝑁subscript𝑀0formulae-sequence^¯𝐻^¯𝐻𝜏¯𝐻¯𝐻0\lim_{N,M_{0}\rightarrow\infty}\mathbb{P}\left(\left|\widehat{\overline{H}}-% \widehat{\underline{H}}\right|>\tau,\underline{H}=\overline{H}\right)=0.roman_lim start_POSTSUBSCRIPT italic_N , italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT → ∞ end_POSTSUBSCRIPT blackboard_P ( | over^ start_ARG over¯ start_ARG italic_H end_ARG end_ARG - over^ start_ARG under¯ start_ARG italic_H end_ARG end_ARG | > italic_τ , under¯ start_ARG italic_H end_ARG = over¯ start_ARG italic_H end_ARG ) = 0 . (72)

The proof can be found in the Supplementary Material.

Simulation results for the anisotropic detection procedure described in Section 7.2 can be seen in Table 1. The percentage column indicates the percentage of cases classified as anisotropic (𝟙𝒜=1subscript1𝒜1\mathbbm{1}_{\mathcal{A}}=1blackboard_1 start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT = 1). In the isotropic case, we achieve virtually perfect classification, even for relatively small values of M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. In the anisotropic case, we require either a sufficiently large number of points observed on each surface, or for the difference H¯H¯¯𝐻¯𝐻\overline{H}-\underline{H}over¯ start_ARG italic_H end_ARG - under¯ start_ARG italic_H end_ARG to be well-separated. When the difference in regularities is large enough, we can achieve almost perfect classification for M0=101×101subscript𝑀0101101M_{0}=101\times 101italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 101 × 101. In the context of regularity estimation and anisotropic detection, this is a relatively small number of points. For example, Richard, (2016) constructs a statistical test for isotropic detection, where the number of surfaces N𝑁Nitalic_N plays a more important role. In his simulations, N=6000𝑁6000N=6000italic_N = 6000, a much larger quantity.

Setup H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG σ𝜎\sigmaitalic_σ M0subscript𝑀0M_{0}italic_M start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT Percentage
Isotropic 0.5 0.1 51×51515151\times 5151 × 51 0
Isotropic 0.5 0.1 101×101101101101\times 101101 × 101 0
Anisotropic 0.8 0.1 51×51515151\times 5151 × 51 35.8
Anisotropic 0.8 0.1 101×101101101101\times 101101 × 101 42.4
Anisotropic 0.9 0.1 51×51515151\times 5151 × 51 80.6
Anisotropic 0.9 0.1 101×101101101101\times 101101 × 101 97
Table 1: Table showing the results of the anisotropic detection approach using thresholding. Percentage column indicates the percentage of the cases classified as anisotropic (i.e 𝟙𝒜=1subscript1𝒜1\mathbbm{1}_{\mathcal{A}}=1blackboard_1 start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT = 1) for the different setups.

Acknowledgements

We thank Valentin Patilea for a careful reading of this paper, and providing detailed suggestions and valuable feedback.

Funding

Sunny Wang gratefully acknowledge support from PIA EUR DIGISPORT project (ANR-18-EURE-0022).

Supplementary Material

In the supplement, we provide detailed proofs for the Propositions, Lemmas and Theorems in the main paper. Moreover, additional simulation results for different processes are available.

References

  • Ammous et al., (2024) Ammous, S., Dedecker, J., and Duval, C. (2024). Adaptive directional estimator of the density in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT for independent and mixing sequences. J. Multivariate Anal., 203:105332.
  • Azzimonti et al., (2015) Azzimonti, L., Sangalli, L. M., Secchi, P., Domanin, M., and Nobile, F. (2015). Blood flow velocity field estimation via spatial regression with PDE penalization. J. Amer. Statist. Assoc., 110(511):1057–1071.
  • Belloni et al., (2015) Belloni, A., Chernozhukov, V., Chetverikov, D., and Kato, K. (2015). Some new asymptotic theory for least squares series: Pointwise and uniform results. J. Econometrics, 186(2):345–366.
  • Bernardi et al., (2018) Bernardi, M. S., Carey, M., Ramsay, J. O., and Sangalli, L. M. (2018). Modeling spatial anisotropy via regression with partial differential regularization. J. Multivariate Anal., 167:15–30.
  • Coeurjolly and Porcu, (2018) Coeurjolly, J.-F. and Porcu, E. (2018). Fast and exact simulation of complex-valued stationary Gaussian processes through embedding circulant matrix. J. Comput. Graph. Statist., 27(2):278–290.
  • Davies and Hall, (1999) Davies, S. and Hall, P. (1999). Fractal analysis of surface roughness by using spatial data. J. R. Stat. Soc. Ser. B. Stat. Methodol., 61(1):3–37.
  • Dieker, (2004) Dieker, T. (2004). Simulation of fractional brownian motion.
  • Fan and Guerre, (2016) Fan, Y. and Guerre, E. (2016). Multivariate local polynomial estimators: Uniform boundary properties and asymptotic linear representation. In Essays in Honor of Aman Ullah, volume 36, pages 489–537. Emerald Group Publishing Limited.
  • Germain et al., (2003) Germain, C., Da Costa, J., Lavialle, O., and Baylou, P. (2003). Multiscale estimation of vector field anisotropy application to texture characterization. Signal Processing, 83(7):1487–1503.
  • Golovkine et al., (2022) Golovkine, S., Klutchnikoff, N., and Patilea, V. (2022). Learning the smoothness of noisy curves with application to online curve estimation. Electron. J. Stat., 16(1):1485–1560.
  • Golovkine et al., (2023) Golovkine, S., Klutchnikoff, N., and Patilea, V. (2023). Adaptive estimation of irregular mean and covariance functions. arxiv 2108.06507v2.
  • Herbin, (2006) Herbin, E. (2006). From N𝑁Nitalic_N parameter fractional Brownian motions to N𝑁Nitalic_N parameter multifractional Brownian motions. Rocky Mountain J. Math., 36(4):1249–1284.
  • Horváth and Kokoszka, (2012) Horváth, L. and Kokoszka, P. (2012). Inference for functional data with applications. Springer Science & Business Media.
  • Jain et al., (1997) Jain, A., Hong, L., and Bolle, R. (1997). On-line fingerprint verification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(4):302–314.
  • Jiang, (2005) Jiang, X. (2005). On orientation and anisotropy estimation for online fingerprint authentication. IEEE Transactions on Signal Processing, 53(10):4038–4049.
  • Kassi et al., (2023) Kassi, O., Klutchnikoff, N., and Patilea, V. (2023). Learning the regularity of multivariate functional data.
  • Kokoszka and Reimherr, (2017) Kokoszka, P. and Reimherr, M. (2017). Introduction to functional data analysis. Texts in Statistical Science Series. CRC Press, Boca Raton, FL.
  • Lepski, (2015) Lepski, O. (2015). Adaptive estimation over anisotropic functional classes via oracle approach. Ann. Stat., 43(3):1178 – 1242.
  • Lepski and Rebelles, (2020) Lepski, O. V. and Rebelles, G. (2020). Structural adaptation in the density model. Math. Stat. Learn., 3(3-4):345–386.
  • Maissoro et al., (2024) Maissoro, H., Patilea, V., and Vimond, M. (2024). Adaptive estimation for weakly dependent functional times series.
  • Matheron, (1973) Matheron, G. (1973). The intrinsic random functions and their applications. Advances in Appl. Probability, 5:439–468.
  • Ramsay and Silverman, (2005) Ramsay, J. and Silverman, B. W. (2005). Functional Data Analysis. Springer Series in Statistics. Springer-Verlag, New York, 2 edition.
  • Ramsay, (2002) Ramsay, T. (2002). Spline smoothing over difficult regions. J. R. Stat. Soc. Ser. B Stat. Methodol., 64(2):307–319.
  • Richard, (2016) Richard, F. J. P. (2016). Tests of isotropy for rough textures of trended images. Statistica Sinica, 26(3):1279–1304.
  • Samarov and Tsybakov, (2004) Samarov, A. and Tsybakov, A. (2004). Nonparametric independent component analysis. Bernoulli, 10(4):565–582.
  • Sangalli et al., (2013) Sangalli, L. M., Ramsay, J. O., and Ramsay, T. O. (2013). Spatial spline regression models. J. R. Stat. Soc. Ser. B. Stat. Methodol., 75(4):681–703.
  • Shen and Hsing, (2020) Shen, J. and Hsing, T. (2020). Hurst function estimation. Ann. Stat., 48(2):838 – 862.
  • Stein, (2002) Stein, M. L. (2002). Fast and exact simulation of fractional Brownian surfaces. J. Comput. Graph. Statist., 11(3):587–599.
  • Wang et al., (2023) Wang, S., Patilea, V., and Klutchnikoff, N. (2023). Adaptive functional principal components analysis. arxiv 2306.16091.
  • Wood and Chan, (1994) Wood, A. T. A. and Chan, G. (1994). Simulation of stationary Gaussian processes in [0,1]dsuperscript01𝑑[0,1]^{d}[ 0 , 1 ] start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. J. Comput. Graph. Statist., 3(4):409–432.
  • Wood et al., (2008) Wood, S. N., Bravington, M. V., and Hedley, S. L. (2008). Soap film smoothing. J. R. Stat. Soc. Ser. B Stat. Methodol., 70(5):931–955.