|
Calling Sequence
|
|
ArrayInterpolation(xdata, ydata, xvalues, options)
ArrayInterpolation(xydata, xvalues, options)
|
|
Parameters
|
|
xdata
|
-
|
a list, Array, DataFrame, DataSeries, Vector, or Matrix containing the independent coordinate(s) of each of the data points, given in one of several possible forms
|
ydata
|
-
|
a list, Array, DataFrame, DataSeries, or Vector containing the dependent coordinate of each of the data points
|
xydata
|
-
|
alternate input; a list, Array, DataFrame, or Matrix containing both the dependent and independent coordinates of each of the data points
|
xvalues
|
-
|
a numeric value, list, Vector, or Array containing the independent coordinate(s) of one or more points whose dependent coordinate will be approximated using interpolation
|
options
|
-
|
(optional) equation(s) of the form keyword = value, where keyword is one of method, degree, endpoints, knots, uniform, verify, extrapolate, or container.
|
|
|
|
|
Description
|
|
•
|
The ArrayInterpolation command takes a finite set of distinct data points given by xdata and ydata (or xydata), and interpolates to approximate the y-values corresponding to the points given in xvalues. It considers an interpolant function such that for all respective pairs in xdata and ydata (or xydata). Such a function can be constructed using one of various methods (see below). It then computes and returns for all in xvalues.
|
•
|
The ArrayInterpolation function can interpolate numeric data in dimensions, where is any positive integer.
|
•
|
The list of independent coordinates of the data points, given in xdata, can be input in a number of different ways. xdata can be:
|
–
|
(preferred if ) a Vector, DataSeries, list, or one-dimensional Array of strictly increasing x-coordinates. The data set will then have size , where is the length of xdata.
|
–
|
(preferred if ) a list of Vectors, lists, or one-dimensional Arrays, one for each dimension of the data. The th Vector, list, or Array in the input must contain, in increasing order, all of the possible th coordinates of the data points. In this case, the block of data points will be assumed to lie on an by by ... by grid, where is the length of the th Vector or Array in the input. The th coordinate of the data point at index (where ) will be equal to the th element of the th Array in the input.
|
–
|
a list of Arrays of size by by ... by , where the th array contains the th independent coordinate of each of the by by ... by data points. The coordinates must form a proper "grid" of values, and must be sorted in strictly increasing order along each dimension. More formally, must be zero if , and must be positive if .
|
|
The preferred methods minimize memory usage and execution time by avoiding unnecessary storage and verification of redundant data. In all cases, xdata must contain real values of type numeric.
|
•
|
The list of dependent coordinates of the data points, given in ydata, must be input as an Array (or a Matrix, Vector, or list for appropriate values of ) of size by by ... by , so that the value of corresponds to the element in xdata of index . Values in ydata may be complex, but must be of type complex(extended_numeric). The output will not contain complex values unless ydata does.
|
•
|
As an alternate form of input, a single structure xydata containing all coordinates of the data points can be entered. It can be formatted in one of the following ways:
|
–
|
an Array, DataFrame, or Matrix of size by by ... by by (), giving the independent and dependent coordinate(s) of each of by by ... by data points as an ordered ()-tuple. The first n elements in each ()-tuple represent the independent coordinates of each point, and must adhere to the same restrictions as above (a proper "grid" must be formed, and the independent coordinates must be sorted in strictly increasing order along each dimension). The st coordinate in each ()-tuple then represents the dependent coordinate of the respective data point.
|
–
|
a list of Arrays, Vectors, Matrices, or lists of size by by ... by , where the th array contains the th independent coordinate of each of the by by ... by data points for , and the st Array contains the dependent coordinates of each point. As above, the independent coordinates must adhere to certain restrictions (a proper "grid" must be formed, and the independent coordinates must be sorted in strictly increasing order along each dimension).
|
|
For multidimensional data, these methods are not recommended, since space is wasted storing the full grid of independent coordinates instead of a list of all the possible coordinates in each dimension. In both cases, the values representing independent coordinates must be of type numeric, but the values representing dependent coordinates can be of type complex(extended_numeric).
|
•
|
The list of values to interpolate at, given in xvalues, may be input in one of the following formats:
|
–
|
for one-dimensional data, a single numeric value, or a Vector, list, or one-dimensional Array of numeric values can be input. The output will be returned in a format matching the format of the input.
|
–
|
for multidimensional data, an Array or Matrix of size by by ... by by of numeric values can be input. It must contain the coordinates of each of by by ... by values to interpolate at, with the value of giving the th coordinate of the respective point. The output will be returned in an array of size by by ... by containing the interpolated results.
|
–
|
alternatively, a list of Vectors, lists, or one-dimensional Arrays can be input. The th Vector, list, or Array in the input will be assumed to contain all of the possible th coordinates of the values to interpolate at. In this case, interpolation will be performed on an by by ... by block of points, where is the length of the th Vector or Array in the input. The output will then be returned in a Vector, Matrix, list, or Array of size by by ... by .
|
•
|
If any of the data points in xvalues lie outside the rectangular bounding box specified by the input, then extrapolation will be performed to approximate their corresponding y-values. The method by which extrapolation is performed can be controlled by using option extrapolate; see below.
|
•
|
This routine has separate numeric methods for handling hardware and software floats. The decision about which routine to use can be controlled by setting the UseHardwareFloats environment variable. If UseHardwareFloats remains unset, then hardware floats are used if and only if , in which case all software floats in the input will be converted to hardware floats.
|
•
|
Only computations involving numeric floating-point data are supported by this routine. If the input does not contain floating-point data, an error will be thrown.
|
•
|
For optimal performance, all rtables in the input should be Fortran order with rectangular storage (the default). Otherwise, a conversion will take place. All rtables in the output will be Fortran order rtables with rectangular storage.
|
•
|
This function is part of the CurveFitting package, so it can be used in the short form ArrayInterpolation(..) only after executing the command with(CurveFitting). However, it can always be accessed through the long form of the command by using CurveFitting[ArrayInterpolation](..).
|
|
|
Options
|
|
•
|
If the option method = <name> is given, then one of the following interpolation methods are used to compute the interpolant and evaluate for each point in xvalues:
|
–
|
method = nearest: Perform nearest neighbor interpolation. Given a point in xvalues, is defined to be , where is the data point such that the Euclidean distance is minimized.
|
–
|
method = lowest: Perform lowest neighbor interpolation. Given a point in xvalues, is defined to be , where is the data point such that is non-negative in all coordinates, but the Euclidean distance is minimized.
|
–
|
method = highest: Perform highest neighbor interpolation. Given a point in xvalues, is defined to be , where is the data point such that is non-negative in all coordinates, but the Euclidean distance is minimized.
|
–
|
method = linear: Perform n-dimensional linear interpolation (lerping). In the one-dimensional case, is a piecewise-linear function passing through each data point in the input. In the multidimensional case, is the tensor product of such piecewise linear functions, one for each dimension. is computed by performing linear interpolation along the first dimension, then along the second dimension, and so on.
|
–
|
method = cubic: Perform piecewise cubic Hermite interpolation. In the 1-dimensional case, is a piecewise-cubic function passing through each data point in the input. In this case, = if lies in the interval , where each is a cubic polynomial such that and for all data points in the input (where ranges from to ). The coefficients of the functions are determined locally by assigning slopes to each data point and solving for the unique cubic function determined by the additional constraints that and. This forces to be continuously differentiable (). The themselves are computed using Bessel's method: is the slope at of the parabola passing through , , and . In the multidimensional case, is the tensor product of such spline functions, one for each dimension.
|
–
|
method = spline: Perform spline interpolation. By default, natural cubic spline interpolation is used. In the 1-dimensional case, is a piecewise-cubic function passing through each data point in the input. In this case, = if lies in the interval , where each is a cubic polynomial such that and for all data points in the input (where ranges from to ). The coefficients of the functions are selected such that is twice continuously differentiable (), that is, and . In addition, the "natural" condition of the spline specifies that and . In the multidimensional case, is the tensor product of such spline functions, one for each dimension. Using method=spline will produce a smoother interpolant than method=cubic ( instead of ), but is more expensive to set up and more prone to numerical instability because each segment of the spline is determined globally by the positions of all other points in the data set.
|
|
method=linear is used by default.
|
•
|
If the options degree=d and endpoints=e are given, where d is a positive integer and e is one of natural, notaknot, or periodic, then spline interpolation will be performed using the provided degree and endpoint conditions. See Spline Continuity and End Conditions for details. These options only affect the result if method=spline is used. In the multidimensional case, the same degree and endpoint conditions are used for the splines generated in each dimension. The defaults are degree=3 and endpoints=natural, in which case natural cubic spline interpolation will be performed.
|
•
|
If splines of an even degree are being used, the option knots=data forces the use of a spline function where the spline knots are positioned on the nodes. See Spline Continuity and End Conditions for details. The default method, knots=default, defines the spline knots at the midpoints of the nodes when even degree splines are used. This option has no effect when other methods are used.
|
•
|
If the option uniform=true is given, then ArrayInterpolation assumes that the data points are sampled over a grid of uniformly spaced points in each dimension. In other words, if is the th possible coordinate in the th dimension, then is assumed to be constant over all possible , given any fixed value of . This gives a considerable speedup when the input contains uniform data, since it allows ArrayInterpolation to use a fast lookup algorithm when evaluating the interpolant at the specified points. The default is uniform=false, in which case ArrayInterpolation uses a slower but more general binary search algorithm to perform interpolation. Using the uniform=true option with non-uniform data may produce incorrect results.
|
•
|
If the option verify=false is given, then ArrayInterpolation skips the various checks it performs to ensure correctly formatted input. This can decrease the time required to solve large problems, but will prevent the function from detecting any errors in the input. If the input is improperly sorted, contains Arrays indexed from values other than 1, contains non-rectangular or C order rtables, or is otherwise formatted incorrectly, ArrayInterpolation may return incorrect results or throw an unexpected error.
|
•
|
If the option extrapolate=e is given, where e is of type extended_numeric or truefalse, then one of the following possible extrapolation methods will be used to compute if lies outside the bounding box specified by the input:
|
–
|
extrapolate = true: Perform extrapolation using the closest valid branch of the interpolating function. In the case of method=lowest and method=highest, this is not be defined for some points, in which case undefined will be returned.
|
–
|
extrapolate = false: Do not extrapolate. An error will be thrown if any point in xvalues lies outside the bounding box specified by the input.
|
–
|
extrapolate = e, where e is of type extended_numeric : Define to be e if lies outside the bounding box specified by the input. e is commonly zero or undefined.
|
|
extrapolate=true is used by default.
|
•
|
If the option container=c is given, where c is an appropriately sized and typed rtable, then the result is written into c. c must be of the correct size and datatype to match the output of the routine. If ydata (and thus c) have only real values, then with this option, no additional memory is allocated to store the result; this is a programmer-level feature that can be used to reduce memory usage and decrease the time spent by Maple's garbage collector. If nonreal values are present, then the output is still copied into c, but there are no memory savings. The default is container=false, in which case Maple creates and returns a new rtable containing the result.
|
|
|
Examples
|
|
An introductory example. Suppose a signal is sampled several times over a given interval of time:
ArrayInterpolation to resample the data at a higher sampling frequency:
Use a cubic spline to achieve a smoother, more realistic resampling of the data:
Try again, using a spline that assumes the data is sampled from a periodic waveform:
A two-dimensional example: a tiny grayscale image stored in a Matrix:
Upsample it to a larger image using bilinear interpolation:
Try again, using bicubic interpolation instead for a smoother fit:
A non-uniform multidimensional example. Create some 3-D mesh structures to pass through a given set of points defined by a mathematical function:
Define a non-uniform grid of points, and sample f over them:
Plot the data so far:
Create a finer mesh to interpolate over:
Linear interpolation produces a quick approximation to f:
Nearest-neighbor interpolation can also be used for quick lookup purposes:
Spline interpolation produces a smoother approximation to the original function f:
Increasing the degree of the spline approximation can increase the smoothness of the result, but results in a longer computation time, greater numerical instability, and can cause large oscillations around the edges of a data set:
Array interpolation of real x values and complex y values.
Plot of the points:
x points to interpolate over.
A complex multidimensional example. Create some 3-D mesh structures to pass through a given Set of points defined by a mathematical function:
Define a non-uniform grid of points, and sample f over them:
Create a finer mesh to interpolate over:
Linear interpolation produces a quick approximation to f:
Another format for the last example
A 5 dimensional example
Finally, a large example to illustrate a few tips for increasing the speed of computations:
On such a large one-dimensional example, a significant portion of the execution time is spent verifying the integrity of the input data. Disabling this verification will produce a significant speedup in the execution time of the routine, but will produce incorrect results if the input is not correctly formatted or sorted:
Asserting that the data is uniform allows a faster lookup method to be used:
Cubic interpolation takes longer than the default, linear method:
|
|
|