# Method for predicting the best and worst in a set of non-unique solutions

Imported: 17 Feb '17 | Published: 28 Feb '12

Rebecca L. Saltzer, Christopher J. Finn, Robert G. Keys

USPTO - Utility Patents

## Abstract

Method for determining best and worst cases for values of model parameters such as porosity and shale volume fraction generated by non-unique matrix inversion of physical data such as seismic reflection amplitudes. The matrix is diagonalized, and then orthonormal basis vectors associated with insignificant diagonal elements are used to generate upper and lower bounds on the solution. Best and worst case solutions are determined as linear combinations of the null basis vectors, where the expansion coefficients are determined by making a best fit to the upper and lower bounds.

## Description

This application claims the benefit of U.S. Provisional Application No. 60/698,760 filed on Jul. 13, 2005.

### FIELD OF THE INVENTION

This invention relates generally to the field of geophysical modeling, although the invention has broader application. Specifically, the invention is a method for predicting best and worst solutions when model inversion yields non-unique solutions.

### BACKGROUND OF THE INVENTION

In the oil industry, it is common to be faced with a set of data from which one wishes to infer some sort of information of interest. It is also fairly common that such inverse problems are non-unique, that is, different solutions explain the data equally well. While it is straightforward to obtain a single solution that the user considers “most likely”, it is often desirable to know the “best” and “worst” case solutions that fit the data in addition to the “most likely” one, to adequately understand the risks of a given course of action. An example of this sort of problem in the oil industry is the prediction of the sand and porosity distribution in a reservoir where one would like to know the largest and smallest hydrocarbon volumes (i.e., the best and worst case scenarios) possible in addition to the “most likely”. An accurate understanding of the potential risks involved in draining a potential reservoir should reduce total costs (correctly sized platforms, optimal draining strategy, etc.).

A common method for determining alternative scenarios is to do forward simulations of many different models in which the variables deemed to affect the final result are chosen at random from some pre-defined distribution. The forward models are then compared to the observed data to see which of the various forward models match. A distribution of the parameters fitting the data is then extracted from the set of models that are deemed to fit the data well. From this distribution, a best and worst case can, in principle, be determined. This method is time-consuming because it requires a large number of forward models. In addition, it suffers from user bias in that the only models tried are the ones that the user has thought of or deemed relevant.

Another method is to take the most likely model and simply scale it up and down by some amount and call that the best and worst case. This method produces results that generally do not match the observed data in a forward modeling sense and are not necessarily the “best” and “worst” case answers.

What is needed is a method in which the best and worst case scenarios are obtained as mathematical solutions to the inverse problem.

### SUMMARY OF THE INVENTION

In one embodiment, the invention is a computer-implemented method for (see box 1 of the flowchart of FIG. 1) determining the largest and smallest of the possible solutions of a matrix equation that can be expressed in the form

$[ G ] ⁡ [ m 1 m 2 ⋮ m N ] = [ data ] ,$
where m1 . . . mN are physical parameters to be solved for and G is a matrix based on a model of a physical system that relates the mi to measured data, wherein the equation may be non-uniquely inverted by numerical methods yielding an infinite number of possible solutions all of which fit the data substantially equally well and from which a most likely solution can be determined, said method comprising (a) finding (step 2 in FIG. 1) orthonormal basis vectors that diagonalize the G matrix, and using said vectors to diagonalize G; (b) selecting (step 3) a threshold below which the values of the elements of the diagonalized G are considered insignificant in terms of their effect on the most likely solution to said matrix equation; (c) identifying (step 4) from the orthonormal basis vectors those vectors (the “null” vectors) associated with the insignificant diagonal elements; (d) choosing (step 5) an LP mathematical norm where pε[0, ∞]; (e) determining (step 6) an upper and lower bound for possible solutions m1, m2 . . . mN by summing corresponding components of said null vectors according to the chosen norm, said lower bound being given by the negative of said sums; (f) finding (step 7) a linear combination of the null vectors that most closely approaches said upper bound, and repeating this finding for said lower bound; and (g) adding (step 8) each of said two linear combinations in turn to the most likely solution to yield a largest and a smallest solution, scaling as needed to eliminate any physically unreal results.

The invention will be described in connection with its preferred embodiments. However, to the extent that the following description is specific to a particular embodiment or a particular use of the invention, this is intended to be illustrative only, and is not to be construed as limiting the scope of the invention. On the contrary, it is intended to cover all alternatives, modifications and equivalents that may be included within the spirit and scope of the invention, as defined by the appended claims.

### DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The present invention is a method for obtaining the “best” and “worst” case solutions by solving a system of equations that relate the observations (data) to parameters of interest. For the lithology prediction problem mentioned above, these equations may, for example, include the convolutional equation, the Aki & Richards (1980) reflectivity equation, and a linearized, rock physics equation. (See U.S. patent application filed Jun. 24, 2005 by Saltzer, Finn and Lu.) In matrix notation, they take the following form:

$[ G ] ⁡ [ ϕ vsh ] = [ data ] ( 1 )$
where φ and vsh are the porosity and vshale (shale volume fraction) values as a function of time, data are the seismic traces associated with different source receiver apertures and G is a matrix that relates the model parameters (vshale and porosity in this example application) to the data parameters, typically seismic reflection data. However, the invention may be applied to any physical system where a physical model exists to provide a G matrix that relates model parameters m to the measured or otherwise obtained data, in which general case eqn. (1) can be written as Gm=Data. The G matrix may be partioned into two pieces: a first region characterized by sensitivity of the data to the model parameters and a second region with little sensitivity. These two regions are found by defining orthonormal bases that diagonalize the G matrix. Once these bases have been found, a cut-off value is chosen below which the elements of the diagonalized G matrix are insignificant. The orthonormal vectors associated with these insignificant components of the diagonalized G matrix are the “null” vectors. Thus if the basis vectors are uk and νk, they can be used to construct matrices U and V such that G=USV′ where S has non-zero elements only on its diagonal. Persons familiar with linear algebra will know that the matrix S, which may be called the diagonalized G matrix, can be found. Typically, G and hence S will not be a square matrix, but the elements Gi,j and Si,j are considered to be diagonal elements when i=j. If the diagonal elements of S are called λ1, λ2, . . . λN, and if λk is below the threshold selected for significance, then νk is a null vector. Mathematically, the null vectors correspond to the
Gm=0  (2)
solutions (as stated above, m is a column matrix or vector whose components are φ and vsh values in the embodiment represented by eqn. (1)). Consequently, they can be added to the “most likely” solution without changing the fit of that model to the measured data, because they do not project onto the data space. An underlying theory of the present invention is that the infinite number of solutions that fit the data almost equally well due to the non-uniqueness of the solution can be regarded as perturbations of the most likely solution, and that the perturbations are driven by the different possible linear combinations of the null vectors that can be constructed. This follows from the fact that the null vectors are a basis set of vectors spanning a portion of the relevant space. Thus, the part of the equation that does not affect the most likely solution in fact causes the differences between any given solution and the most likely solution.

The problem then becomes one of finding the combination of null vectors ({right arrow over (ν)}k) that will yield the greatest porosity and sand volumes ({right arrow over (m)}biggest). This can be expressed mathematically as

$m → biggest = ∑ k ⁢ α k ⁢ v → k null ( 3 )$
where (αk) is a vector of coefficients that weight the relative importance of each null vector. If the best possible model ({right arrow over (m)}biggest) of the subsurface is known a priori, it is a simple matter to obtain a least squares solution of the appropriate weighting factors (αk). The more difficult part is determining the upper bound on the porosity and vshale perturbations (i.e., knowing what model of {right arrow over (m)}biggest to solve for).

The upper bound of possible perturbations ({right arrow over (m)}biggest) can be determined by summing the elements or components of the orthonormal basis vectors that are associated with the portion of the diagonalized G matrix that have been determined to be insignificant (the null vectors νk). This summing operation is done according to the particular norm chosen to apply. An L1 norm is computed with the absolute values of the of the elements of the relevant orthonormal vectors

$m ⇀ biggest = ∑  v → k  ( 4 )$
(i.e. the absolute value of the first element of each null vector is added to make the first element of the perturbation vector and the absolute value of the second element of each null vector is added to make the second element, etc.) whereas an L2 norm is the computed using the square of the same elements. An L3 norm is computed using the cube of the absolute values of the components and so on for pε[0, ∞]. An L norm would use the maximum, absolute value of the same elements (i.e., the maximum of the first element of each null vector is taken as the first element of the perturbation vector and the maximum of the second element of each vector is taken as the second element, etc.). The lower bound is the negative of the sum computed for the upper bound. Next, one solves eqn. (3) for the combination of null vectors that most closely approaches that upper (lower) bound and finishes by scaling the resulting perturbation vector by a constant and adding that result to the most likely solution. Persons skilled in the art will know methods for finding a most likely, or best guess, solution to matrix equations of the form of eqn. (1). For example, Menke describes standard inversion methods in Geophysical Data Analysis: Discrete Inverse Theory, Academic Press (1984). The scaling serves to prevent unphysical results, and preferably is performed in increments in iterative fashion until just enough scaling has been applied to prevent physically unreal values of the parameters mi. A priori information may favor stronger scaling in particular instances.

This method is applicable to any problem where an appropriate physical model can be used to describe the relationship between what is observed and what is to be inferred. For example, if the differences in AVO behavior observed over time (time-lapse seismic) can be related to changes in pressure and water saturation in the reservoir, then the null space method can be used to solve for the “best” and “worst” case scenarios possible, given the observed differences between the seismic data. Another possible application is production history data from which the best and worst case reservoir permeability might be inferred. Typically well logs are processed and a single best answer (e.g., the vshale log) is produced. However, this null space methodology could be used in a joint inversion of different well log data for some property of interest (e.g. permeability, water saturation, etc.) to produce the best and worst case logs possible, given whatever data was actually recorded. In constructing the G matrix, anisotropy terms can be included in the reflectivity and rock physics equations when determining the porosity and sand distribution in a reservoir. Alternatively, the equations can be parameterized in terms of other properties of interest (e.g., elastic properties such as impedances or velocities).

In some embodiments of the invention, a singular value decomposition (SVD) is used to decompose the G matrix into the two subspaces of interest. Such a decomposition is described in various textbooks including G. Strang, Introduction to Linear Algebra, Wellesley-Cambridge Press, 321-333 (1993); C. Lanczos, Linear Differential Operators, Van Nostrand, (1961); the previously cited book by Menke; and Press, et al., Numerical Recipes in C: the art of scientific computing, Cambridge University Press, 1992. This operation produces two orthogonal matrices, U and V, that contain the data and model eigenvectors and a third matrix, S, that contains the singular values (eigenvalues). Then the following steps are performed:

• 1. sum the absolute values of the components of the null space eigenvalues (an L1 norm choice). This sum is made across corresponding elements of every eigenvector (along the rows of the eigenvector matrix) and is equivalent to determining an upper bound on the total perturbations that can be wrung from the data. The lower bound is the negative of this sum.
• 2. choose a shale cut-off value. Elements of the perturbation vector corresponding to values above this cut-off in the most likely solution are excluded from step 3.
• 3. solve for the combination of eigenvectors that most closely approaches that upper (lower) bound.
• 4. scale the resulting perturbation vector by a constant and add that result to the most likely solution. In the lithology problem, the scaling constant is chosen such that the final porosity and vshale values obtained are physically real (neither porosity or vshale is less than 0% and vshale is not greater than 100% and porosity is not greater than 40%).
The commercial software product “Matlab” (www.mathworks.com) readily decomposes, using singular value decomposition, a given matrix G into two orthonormal matrices U and V and a diagonal matrix S, i.e., G=USV′. S contains the singular values (the eigenvalues of G′G), U contains the orthonormal basis vectors that are associated with the data space, and V contains the orthonormal basis vectors that are associated with the model space. (Recall that G relates the model parameters to the data parameters.) The book by Press, et al., contains a source code listing of a similar computer program. Typically, an SVD algorithm will generate an S matrix having its diagonal elements ordered from largest to smallest. It is then a simple matter to select a cutoff value and to select the basis vectors from the V matrix associated with the values below the cutoff. Those vectors will appear in V as consecutive columns, and one simply sums the corresponding components of these column vectors using whatever norm has been selected. The invention will work with any norm.

### Example

The present inventive method was applied to some seismic data acquired over a potential oil field. FIGS. 2A-C show a 3-D image on a black background of, respectively, the “worst,” “most-likely,” and “best” inferred (1—vshale) sand channel winding through a inverted vshale volume (the shaly parts have been made invisible). The estimates of total reserves carried in the “best” and “worst” case scenarios varies by almost a factor of two. In addition, the difference in connectedness between the sand bodies in the “worst” and “best” cases are significantly different and imply different draining strategies that might be optimal for this field.

The foregoing application is directed to particular embodiments of the present invention for the purpose of illustrating it. It will be apparent, however, to one skilled in the art, that many modifications and variations to the embodiments described herein are possible. All such modifications and variations are intended to be within the scope of the present invention, as defined in the appended claims.

## Claims

1. In the problem of inverting a matrix equation that can be expressed in the form
$[ G ] ⁡ [ m 1 m 2 ⋮ m N ] = [ data ] ,$
where m1 . . . mN are physical parameters to be solved for and G is a matrix based on a model of a physical system that relates the mi to measured data, expressed as components of vector [data] in the equation, wherein the equation may be non-uniquely inverted by numerical methods yielding an infinite number of possible solutions all of which fit the data substantially equally well and from which a most likely solution can be determined, a computer implemented method for determining the largest and smallest of said possible solutions, said method comprising:
(a) finding orthonormal basis vectors that diagonalize the G matrix, and using said vectors to diagonalize G;
(b) selecting a threshold below which the values of the elements of the diagonalized G are considered insignificant in terms of their effect on the most likely solution to said matrix equation;
(c) identifying from the orthonormal basis vectors those vectors (the “null” vectors) associated with the insignificant diagonal elements;
(d) choosing an LP mathematical norm where pε[0, ∞];
(e) determining an upper and lower bound for possible solutions m1, m2 . . . mN by summing corresponding components of said null vectors according to the chosen norm, said lower bound being given by the negative of said sums;
(f) finding a linear combination of the null vectors that most closely approaches said upper bound, and repeating this finding for said lower bound; and
(g) adding each of said two linear combinations in turn to the most likely solution to yield a largest and a smallest solution, scaling as needed to eliminate any physically unreal results;
wherein at least one of (a), (c), (e), and (f) is performed using a computer.
(a) finding orthonormal basis vectors that diagonalize the G matrix, and using said vectors to diagonalize G;
(b) selecting a threshold below which the values of the elements of the diagonalized G are considered insignificant in terms of their effect on the most likely solution to said matrix equation;
(c) identifying from the orthonormal basis vectors those vectors (the “null” vectors) associated with the insignificant diagonal elements;
(d) choosing an LP mathematical norm where pε[0, ∞];
(e) determining an upper and lower bound for possible solutions m1, m2 . . . mN by summing corresponding components of said null vectors according to the chosen norm, said lower bound being given by the negative of said sums;
(f) finding a linear combination of the null vectors that most closely approaches said upper bound, and repeating this finding for said lower bound; and
(g) adding each of said two linear combinations in turn to the most likely solution to yield a largest and a smallest solution, scaling as needed to eliminate any physically unreal results;
2. The method of claim 1, wherein the mi are values of porosity and shale volume fraction, and the data are seismic amplitude values.
3. The method of claim 1, wherein singular value decomposition is used to find the orthonormal basis vectors and to diagonalize the G matrix.
4. The method of claim 3, wherein G is a non-square matrix, and said orthonormal basis vectors are two different sets of basis vectors, one set associated with the data vector's space and the other set associated with the m vector's space, and wherein the null vectors come from the basis vectors associated with the m vector's space.
5. The method of claim 1, wherein the absolute values of the corresponding components of the null vectors are summed, corresponding to choosing an L1 norm.
6. The method of claim 1, wherein finding said linear combination of null vectors is solving the equation
$m → biggest = ∑ k ⁢ α k ⁢ v → k null$
for weighting factors αk using a least squares method, where {right arrow over (ν)}knull is the kth null vector and {right arrow over (m)}biggest is a bound for possible m solutions.
7. A method of producing hydrocarbons from a subterranean region, comprising:
(a) obtaining measured data from a seismic survey of the subterranean region;
(b) obtaining a model of the subterranean region that relates physical parameters mi of the subsurface region to measured seismic data;
(c) obtaining a matrix equation of the form
$[ G ] ⁡ [ m 1 m 2 ⋮ m N ] = [ data ]$
relating the mi to the measured data, expressed as components of vector [data] in the equation, where G is a matrix based on said model;
(d) obtaining a largest solution and a smallest solution among possible non-unique solutions for the mi from inverting the matrix equation, said largest and smallest solutions having been obtained by:
(i) finding orthonormal basis vectors that diagonalize the G matrix, and using said vectors to diagonalize G;
(ii) selecting a threshold below which the values of the elements of the diagonalized G are considered insignificant in terms of their effect on the most likely solution to said matrix equation;
(iii) identifying from the orthonormal basis vectors those vectors (the “null” vectors) associated with the insignificant diagonal elements;
(iv) choosing an LP mathematical norm where pε[0, ∞];
(v) determining an upper and lower bound for possible solutions m1, m2 . . . mN by summing corresponding components of said null vectors according to the chosen norm, said lower bound being given by the negative of said sums;
(vi) finding a linear combination of the null vectors that most closely approaches said upper bound, and repeating this finding for said lower bound; and
(vii) adding each of said two linear combinations in turn to the most likely solution to yield a largest and a smallest solution, scaling as needed to eliminate any physically unreal results;
wherein at least one of (i), (iii), (v), and (vi) is performed using a computer; and
(e) using said largest and smallest solutions to develop production of hydrocarbons from the subterranean region.
(a) obtaining measured data from a seismic survey of the subterranean region;
(b) obtaining a model of the subterranean region that relates physical parameters mi of the subsurface region to measured seismic data;
(c) obtaining a matrix equation of the form
(d) obtaining a largest solution and a smallest solution among possible non-unique solutions for the mi from inverting the matrix equation, said largest and smallest solutions having been obtained by:
(i) finding orthonormal basis vectors that diagonalize the G matrix, and using said vectors to diagonalize G;
(ii) selecting a threshold below which the values of the elements of the diagonalized G are considered insignificant in terms of their effect on the most likely solution to said matrix equation;
(iii) identifying from the orthonormal basis vectors those vectors (the “null” vectors) associated with the insignificant diagonal elements;
(iv) choosing an LP mathematical norm where pε[0, ∞];
(v) determining an upper and lower bound for possible solutions m1, m2 . . . mN by summing corresponding components of said null vectors according to the chosen norm, said lower bound being given by the negative of said sums;
(vi) finding a linear combination of the null vectors that most closely approaches said upper bound, and repeating this finding for said lower bound; and
(vii) adding each of said two linear combinations in turn to the most likely solution to yield a largest and a smallest solution, scaling as needed to eliminate any physically unreal results;
wherein at least one of (i), (iii), (v), and (vi) is performed using a computer; and
(e) using said largest and smallest solutions to develop production of hydrocarbons from the subterranean region.
(i) finding orthonormal basis vectors that diagonalize the G matrix, and using said vectors to diagonalize G;
(ii) selecting a threshold below which the values of the elements of the diagonalized G are considered insignificant in terms of their effect on the most likely solution to said matrix equation;
(iii) identifying from the orthonormal basis vectors those vectors (the “null” vectors) associated with the insignificant diagonal elements;
(iv) choosing an LP mathematical norm where pε[0, ∞];
(v) determining an upper and lower bound for possible solutions m1, m2 . . . mN by summing corresponding components of said null vectors according to the chosen norm, said lower bound being given by the negative of said sums;
(vi) finding a linear combination of the null vectors that most closely approaches said upper bound, and repeating this finding for said lower bound; and
(vii) adding each of said two linear combinations in turn to the most likely solution to yield a largest and a smallest solution, scaling as needed to eliminate any physically unreal results;
8. The method of claim 7, wherein the mi are values of porosity and shale volume fraction, and the data are seismic amplitude values.