Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -0,0 +1,336 @@
<!--

@license Apache-2.0

Copyright (c) 2026 The Stdlib Authors.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

-->

# dcorrelation

> Calculate the [correlation distance][correlation-distance] between two double-precision floating-point strided arrays.

<section class="intro">

The [correlation distance][correlation-distance] between random variables `X` and `Y` is defined as

<!-- <equation class="equation" label="eq:sample_correlation_distance" align="center" raw="D(X, Y) = 1 - \frac{\displaystyle\sum_{i=0}^{N-1} (x_i - \bar{x})(y_i - \bar{y})}{\displaystyle\sqrt{\sum_{i=0}^{N-1} (x_i - \bar{x})^2} \sqrt{\sum_{i=0}^{N-1} (y_i - \bar{y})^2}}" alt="Equation for the sample correlation distance."> -->

```math
D(X, Y) = 1 - \frac{\displaystyle\sum_{i=0}^{N-1} (x_i - \bar{x})(y_i - \bar{y})}{\displaystyle\sqrt{\sum_{i=0}^{N-1} (x_i - \bar{x})^2} \sqrt{\sum_{i=0}^{N-1} (y_i - \bar{y})^2}}
```

<!-- </equation> -->

where `x_i` and `y_i` are the _ith_ components of vectors **X** and **Y**, respectively.

<!-- </equation> -->


</section>

<!-- /.intro -->

<section class="usage">

## Usage

```javascript
var dcorrelation = require( '@stdlib/stats/strided/distances/dcorrelation' );
```

#### dcorrelation( N, x, strideX, y, strideY )

Computes the [correlation distance][correlation-distance] of two double-precision floating-point strided arrays.

```javascript
var Float64Array = require( '@stdlib/array/float64' );

var x = new Float64Array( [ 1.0, -2.0, 2.0 ] );
var y = new Float64Array( [ 2.0, -2.0, 1.0 ] );

var c = dcorrelation( x.length, x, 1, y, 1 );
// returns ~0.115
```

The function has the following parameters:

- **N**: number of indexed elements.
- **x**: first input [`Float64Array`][@stdlib/array/float64].
- **strideX**: stride length for `x`.
- **y**: second input [`Float64Array`][@stdlib/array/float64].
- **strideY**: stride length for `y`.

The `N` and stride parameters determine which elements in the strided arrays are accessed at runtime. For example, to compute the [correlation distance][correlation-distance] of every other element in `x` and `y`,

```javascript
var Float64Array = require( '@stdlib/array/float64' );

var x = new Float64Array( [ 1.0, 2.0, 2.0, -7.0, -2.0, 3.0, 4.0, 2.0 ] );
var y = new Float64Array( [ 2.0, 1.0, 2.0, 1.0, -2.0, 2.0, 3.0, 4.0 ] );

var c = dcorrelation( 4, x, 2, y, 2 );
// returns ~0.053
```

Note that indexing is relative to the first index. To introduce an offset, use [`typed array`][mdn-typed-array] views.

<!-- eslint-disable stdlib/capitalized-comments -->

```javascript
var Float64Array = require( '@stdlib/array/float64' );

var x0 = new Float64Array( [ 2.0, 1.0, 2.0, -2.0, -2.0, 2.0, 3.0, 4.0 ] );
var y0 = new Float64Array( [ 2.0, -2.0, 2.0, 1.0, -2.0, 4.0, 3.0, 2.0 ] );

var x1 = new Float64Array( x0.buffer, x0.BYTES_PER_ELEMENT*1 ); // start at 2nd element
var y1 = new Float64Array( y0.buffer, y0.BYTES_PER_ELEMENT*1 ); // start at 2nd element

var c = dcorrelation( 4, x1, 2, y1, 2 );
// returns ~0.693
```

#### dcorrelation.ndarray( N, x, strideX, offsetX, y, strideY, offsetY )

Computes the [correlation distance][correlation-distance] of two double-precision floating-point strided arrays using alternative indexing semantics.

```javascript
var Float64Array = require( '@stdlib/array/float64' );

var x = new Float64Array( [ 1.0, -2.0, 2.0 ] );
var y = new Float64Array( [ 2.0, -2.0, 1.0 ] );

var c = dcorrelation.ndarray( x.length, x, 1, 0, y, 1, 0 );
// returns ~0.115
```

The function has the following additional parameters:

- **offsetX**: starting index for `x`.
- **offsetY**: starting index for `y`.

While [`typed array`][mdn-typed-array] views mandate a view offset based on the underlying buffer, the offset parameters support indexing semantics based on starting indices. For example, to calculate the [correlation distance][correlation-distance] for every other element in `x` and `y` starting from the second element

```javascript
var Float64Array = require( '@stdlib/array/float64' );

var x = new Float64Array( [ 2.0, 1.0, 2.0, -2.0, -2.0, 2.0, 3.0, 4.0 ] );
var y = new Float64Array( [ -7.0, 2.0, 2.0, 1.0, -2.0, 2.0, 3.0, 4.0 ] );

var c = dcorrelation.ndarray( 4, x, 2, 1, y, 2, 1 );
// returns ~0.073
```

</section>

<!-- /.usage -->

<section class="notes">

## Notes

- If `N <= 1`, both functions return `NaN`.
- If all values in either `x` or `y` are constant, the [correlation distance][correlation-distance] is not defined.

</section>

<!-- /.notes -->

<section class="examples">

## Examples

<!-- eslint no-undef: "error" -->

```javascript
var discreteUniform = require( '@stdlib/random/array/discrete-uniform' );
var dcorrelation = require( '@stdlib/stats/strided/distances/dcorrelation' );

var opts = {
'dtype': 'float64'
};
var x = discreteUniform( 10, -50, 50, opts );
console.log( x );

var y = discreteUniform( 10, -50, 50, opts );
console.log( y );

var c = dcorrelation( x.length, x, 1, y, 1 );
console.log( c );
```

</section>

<!-- /.examples -->

<!-- C interface documentation. -->

* * *

<section class="c">

## C APIs

<!-- Section to include introductory text. Make sure to keep an empty line after the intro `section` element and another before the `/section` close. -->

<section class="intro">

</section>

<!-- /.intro -->

<!-- C usage documentation. -->

<section class="usage">

### Usage

```c
#include "stdlib/stats/strided/distances/dcorrelation.h"
```

#### stdlib_strided_dcorrelation( N, \*X, strideX, \*Y, strideY )

Computes the [correlation distance][correlation-distance] between two double-precision floating-point strided arrays.

```c
const double x[] = { 1.0, -2.0, 2.0 };
const double y[] = { 2.0, -2.0, 1.0 };

double c = stdlib_strided_dcorrelation( 3, x, 1, y, 1 );
// returns ~0.115
```

The function accepts the following arguments:

- **N**: `[in] CBLAS_INT` number of indexed elements.
- **X**: `[in] double*` first input array.
- **strideX**: `[in] CBLAS_INT` stride length for `X`.
- **Y**: `[in] double*` second input array.
- **strideY**: `[in] CBLAS_INT` stride length for `Y`.

```c
double stdlib_strided_dcorrelation( const CBLAS_INT N, const double *X, const CBLAS_INT strideX, const double *Y, const CBLAS_INT strideY );
```

#### stdlib_strided_dcorrelation_ndarray( N, \*X, sx, ox, \*Y, sy, oy )

Computes the [correlation distance][correlation-distance] between two double-precision floating-point strided arrays using alternative indexing semantics.

```c
const double x[] = { 1.0, -2.0, 2.0 };
const double y[] = { 2.0, -2.0, 1.0 };

double c = stdlib_strided_dcorrelation_ndarray( 3, x, 1, 0, y, 1, 0 );
// returns ~0.115
```

The function accepts the following arguments:

- **N**: `[in] CBLAS_INT` number of indexed elements.
- **X**: `[in] double*` first input array.
- **strideX**: `[in] CBLAS_INT` stride length for `X`.
- **offsetX**: `[in] CBLAS_INT` starting index for `X`.
- **Y**: `[in] double*` second input array.
- **strideY**: `[in] CBLAS_INT` stride length for `Y`.
- **offsetY**: `[in] CBLAS_INT` starting index for `Y`.

```c
double stdlib_strided_dcorrelation_ndarray( const CBLAS_INT N, const double *X, const CBLAS_INT strideX, const CBLAS_INT offsetX, const double *Y, const CBLAS_INT strideY, const CBLAS_INT offsetY );
```

</section>

<!-- /.usage -->

<!-- C API usage notes. Make sure to keep an empty line after the `section` element and another before the `/section` close. -->

<section class="notes">

</section>

<!-- /.notes -->

<!-- C API usage examples. -->

<section class="examples">

### Examples

```c
#include "stdlib/stats/strided/distances/dcorrelation.h"
#include <stdio.h>

int main( void ) {
// Create strided arrays:
const double x[] = { 1.0, -2.0, 3.0, -4.0, 5.0, -6.0, 7.0, -8.0 };
const double y[] = { 1.0, -2.0, 3.0, -4.0, 5.0, -6.0, 7.0, -8.0 };

// Specify the number of elements:
const int N = 8;

// Specify strides:
const int strideX = 1;
const int strideY = -1;

// Compute the correlation distance between `x` and `y`:
double d = stdlib_strided_dcorrelation( N, x, strideX, y, strideY );

// Print the result:
printf( "Correlation Distance: %lf\n", d );

// Compute the correlation distance between `x` and `y` with offsets:
d = stdlib_strided_dcorrelation_ndarray( N, x, strideX, 0, y, strideY, N-1 );

// Print the result:
printf( "Correlation Distance: %lf\n", d );
}
```

</section>

<!-- /.examples -->

</section>

<!-- /.c -->

<section class="references">

</section>

<!-- /.references -->

<!-- Section for related `stdlib` packages. Do not manually edit this section, as it is automatically populated. -->

<section class="related">

</section>

<!-- /.related -->

<!-- Section for all links. Make sure to keep an empty line after the `section` element and another before the `/section` close. -->

<section class="links">

[correlation-distance]: https://en.wikipedia.org/wiki/Pearson_correlation_coefficient#Pearson's_distance

[@stdlib/array/float64]: https://github.com/stdlib-js/stdlib/tree/develop/lib/node_modules/%40stdlib/array/float64

[mdn-typed-array]: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/TypedArray

</section>

<!-- /.links -->
Loading