Probability Distributions of Phases I

Jan Brosius; Walter Brosius

doi:10.35248/1314-3344.24.14.226

Mini Review - (2024)Volume 14, Issue 3

View PDF Download PDF

Probability Distributions of Phases I

Jan Brosius^* and Walter Brosius

^*Correspondence: Jan Brosius, Department of Theoretical Chemistry, University of Valencia, Valencia, Spain, Email:

Author info »

Abstract

This article presents the mathematical foundation for calculating PD's (Probability Distributions) for some set of phases {ϕh} needed for the structure determination of a crystal. We can obtain PD's of the phases that can contain N or without N. A former paper could only obtain PD's of the phases containing N. Here we have the two possibilities.

Keywords

Random variable; Reciprocal vectors; Binomial distribution; Infinite number; Phase

Introduction

In a short review was given of the old probabilistic DM (Direct Methods) way for calculating phase distributions [1].

There were two mathematical approaches see (A) and (B) below (A): The basic R.V.’s (Random Variables) are the set of the Equation that are distributed independently and uniformly over the asymmetric unit (we consider in this paper only P1) and one studies the normalized structure factors And one calculates the probabilities of the phases (B): The basic R.V.’s are the reciprocal vectors h that are distributed uniformly and independently over reciprocal space and one keeps the X_iconstant. This method can give algebraic equations as follows: One can study the structure factors and we consider only h as the basic reciprocal vector and one keeps k fixed. The B₃,0 formula is an equation obtained this way. Although this equation gives the value of in theory, in practice this equation is wrong for high N, which is due to accidental overlap of the xi which invalidates the calculation of the joint probabilities of Even when one calculates the joint probabilities where h and k are the basic R.V.’s one must assume no accidental overlap of the xi (which becomes a problem for high N.). The calculation of joint probabilities gives then the same results as in (A) above.

(C): Using method (A) one can derive the probability of the cosine invariant Equation

It follows that this formula

Loses predictive power for high N.

Cannot predict negative cosines.

The probabilities of quartets, quintets, etc. are even worse since they are of order of 1/N (for quartets), of order Equation (for quintets), etc. (Although one can get a quartet formula that theoretically predicts negative cosines for the quartet (but again with too low predictive power)). At the end of the twentieth century nobody was busy anymore with calculating prob-abilistic phase distributions using one of the methods (A) or (B). For the calculations of structures with high N (N being here the number of independent non-H atoms in the asymmetric unit), one began to devise methods in direct space to solve crystal structures. One uses an automatic cyclical process: (a): Phase refinement (for instance with the use of the (modified) tangent formula) in reciprocal space and; (b): With the imposition in real space of physically meaningful constraints through an atomic interpretation of the electron density, with minimization of a well-chosen FOM (Figure Of Merit) of the phases. One of these methods in DM is known as the SnB (Shake and Bake) algorithm with N 1200 [2,3]; Another is the twin variables approach with Equation Sir2000 the successor of SIR97 and SIR99 although different from SnB: (e.g. triplet invariants via the P10 formula with Another interesting result is the solution of a crystal when a substructure is known where N may become higher [4-9]. For an overview of DM before the year 2000 we refer to Giacovazzo [10].

(D): In order to circumvent these problems one approach might be to consider R.V.’s (x_i)that are no longer independent neither uniformly distributed, say a dependence through a positive distribution Equation

One can give such distributions by using the functions Equation

But then one encounters insurmountable mathematical difficulties.

The solution is to not consider the x_i as R.V.’s anymore but to replace Equation by a field and to sample the field over the allowable function space. What we shall discuss here is a novel way for doing DM (Direct Methods).

(E): Differences with our approach

• We shall be able to solve any structure (any N) ab initio.

• Much lower CPU time.

• Let Equation then with our approach we can easily calculate the probability distribution of for any h. No need to compute all possible triplets.

• Easy to incorporate any given substructure.

• Easy to calculate the PD’s (Probability Distributions) of phases: One only needs to take derivatives.

In this paper we shall give the mathematical basis that is necessary for this completely new DM approach. This approach is not mathematically as simple as in (A) and (B) but it is perfectly doable. It consists in using the atomic distribution function (x) as the basic random variable. The method will also be based on a functional integration over the random variable and using a nonstandard fuzzy approach wherein Dirac delta functions (among which a novel delta function representation for angle variables) are replaced by nonstandard fuzzy delta functions. To show the strength of the method, a simple formula was given in Brosius for the distribution of the triplet phase formula of the form Equation

Where A is a function depending (not on N!) on the structure factors of the first neighborhood of the triplet [1].

In this paper a more profound mathematical foundation of our DM approach is given and this will be a major improvement compared to Brosius [1]. Recall that the sampling is done over positive functions Equation (in the space group P1) and that the R.V.’s that we study are the phases which are defined by the relation Where is a R.V. defined by

Equation

and from now on we shall use the notations

Equation

One then needs to define a probability density Equation on the sample space ρ's We build up by fuzzy Dirac delta functions in 4 steps

Through constraints of the form Equation by using fuzzy Dirac delta’s (ε_a positive infinitesimal).

Next through maximization: Adding obvious terms to Equation where that cannot be added by using a constraint, like e.g. the term.

Eventually we add fermionic terms to z, like e.g.

Equation

By imposing the mathematical requirement on the basic R.V. ρ that the different atoms in the unit cell of the crystal repel each other.

The idea is that if one would consider a function Equation for which it is known that whenever x_i equals some x_j, this can be done by requiring that is antisymmetric antisymmetric x_i, that is

Equation

Inspired by modern QFT (Quantum Field Theory) we replace Equation by an antisymmetric (fermionic) field with the property

giving thus

Equation

The added benefit is then that the different x_i will repel each other. Now one has two basic R.V.’s: ρ and ψ and we must integrate over ρ and ψ.

One can also sample over the set of Gaussian (normal) distributions by using the substitution

Equation

where Equation represents the true electronic distribution and is the laplacian of f at the point x.

As in QFT, D (x, y) is called the propagator from the point y to x. Using constraints we shall see that the first candidate for D (x,y) is Q(x–y) where Q is the origin-removed Patterson function defined here by

Equation

This propagator depends on N since Equation

Notations and formulas

Equation

The error function Equation [11,12].

Equation

A without subscript stands for some infinite positive number.

Equation

Equation where is the inverse of the kernel operator Q(x–y)

The phase random variable Equation is defined by where denotes the atomic distribution and the function ρ is our basic R.V.

Equation

The functional integral

Equation

The Equation constants. We define the constants by the series

Equation

The bn;m constants, defined by

Equation

Our representation of Equation for an angleis

Equation

We then define the fuzzy nonstandard Equation function by

Equation

For real x (not an angle) we define the nonstandard fuzzy Equation by

Equation for positive infinitesimal ε, and for complex

Equation

For some set H of reciprocal vectors we define

Equation

and sometimes we simply write Equation

We use the explicit definition of the functional derivative by

Equation

Where

Equation

where Equation

Equation ; Where

Some vector calculus: (f, g: vector valued functions, h a scalar function)

Equation

Recall that in three dimensions Equation

Equation

Preliminary knowledge

For an introduction on nonstandard theory we refer to Diener et al. and for a more advanced text see Nelson [13,14].

Nonstandard theory: Standard numbers are the known numbers: Equation the other numbers are the nonstandard real numbers which make up the field R. It is important to observe that there are an infinity of infinite numbers in R that are greater than any standard real number. Also there are an infinity of infinitesimals ε in R for which the absolute value |ε| is less than any positive standard number in R. From the axioms it follows that for every positive infinitesimal Equation is a positive infinite number and vice versa. Note that an infinite number is different from In this paper we use A to denote an infinite positive number and ε will always denote (unless explicitly noted otherwise) a positive infinitesimal will denote a function that associates a positive infinitesimal with every position x in the unit cell Equation We will use this function in our fuzzy Dirac delta. We shall use the notation when we deal with angle variables.

Anticommuting variables: In a detailed exposition of anticommuting numbers is given [1]. In this subsection we shall only expose the bare minimum needed to read this paper. For more information, we refer to Weinberg, Siegel, Kuzenko et al. and for a more mathematical treatment to Bruhat et al. and deWitt [15-19].

One starts with a set of anticommuting numbers θ_λ:

Equation

From this follows that every even product of such anticommuting numbers is commuting Equation Also one adds the axiom: Then the algebra is defined as the set of all finite sums of products.

When M is even, this is a commuting number (also called even) and when it is odd it is an anticommuting number (also called odd). Sums of such products with even M do commute and are called even, and with odd M these sums are anticommuting and are called odd. Every z∈C is also even. It follows that every Equation is a sum with β even and γ odd.

An involution Equation defined such that, and is odd when is odd and even when otherwise. One calls is odd when α is odd and even when otherwise. One calls ψ or ψ_x an odd function of x if ψ_x is odd for every x. It then follows that is even. Then the derivative with respect to the anticommuting variable θ is defined by Equation

Equation

Equation where

A function Equation of an odd variable θ has the simple form (Taylor expansion), (here a is odd when f is odd, and even otherwise, but b has the opposite statistics of f). This can be generalized for a function of N anti-commuting variables: The coefficients of even products in the expansion of the θ_ihave the same statistics as f, whereas the coefficients of uneven products have the opposite statistics. Next one defines the integration Equation as

Equation

and the multiple integration

Equation

It is also convenient to define θ as an odd element:

Equation

Also the following formulas are important

Equation

Note that the set of all odd numbers has vanishing volume

Equation and

Equation

Discussion

The four determinants are listed below. The following Theoremes are:

Theorem 1: Let M be an matrix n×n– matrix. Then

Equation

where by definition Equation

Proof develop Equation

Equation

Since,

Equation the theorem follows.

The continuous version is as follows. Let Equation be an anticommuting variable for every X in the unit cell. Then,

Equation

where one has defined Equation

Theorem 2: Suppose now that the inverse M^–1 exists and Equation be an anticommuting variable for every X. Then

Where

Equation

Proof let

Equation

Then transform

Equation

and substitute this in Equation Then using the relation

Equation

Thus

Equation

Also

Equation

The minus sign arises from the observation that Equation in

Equation

Indeed, note that

Equation

The probability functional Equation

We shall show that we can obtain the following probability function Equation (H is some set of reciprocal vectors) given by

where Equation is given, up to a phase unimportant constant, by

Equation

where Equation denotes chemical information or an intermediate iteration of ρ.

Equation

Equation will be the basic operator for all our First we need the following theorem:

Theorem 3: Let Equation be a functional of such that where A is a positive infinite number and p an integer ≥1. If we impose the constraint

where F has the property that Equation

Then if we define the action functional Equation (where c>0 is a constant) Then (where w_F>…. (10)

For a sequence of such Equation will become (if we drop the constant

Proof we impose this constraint by Equation

Equation

Since Equation is independent of the ϕ,h one can drop it in the above exponent. Next change and choose Since one obtains

Equation

since Equation (infinitesimal). Also under the change integral volume So finally ( after replacing

Equation

Equation (if we drop the constant

Theorem 4:One can write Equation

Where

Equation

where from now on Equation is included convenience, with parameters

Equation

Proof.

Equation and use the Dirac Then

Equation

where we defined Equation such that

• The R.V. Equation was defined by

Then the probability distribution of Equation is generated by the expression

But, (when A is infinite and positive)

Equation

After the transformation Equation obtain the result

Equation

where Equation For convenience, from now on, we shall include

• Next use Equation Then

Equation

where Equation

• For every Equation we impose the constraint

Equation

where

Equation

and

Equation

Then, according to theorem 1 above, one has

Equation

Next note that there is a phase unimportant peak Equation and define Q by

Then if one chooses the positive function Equation

Equation

One can also add other terms to Equation For example consider the triplet expression

Equation

Impose now the constraint Equation Since and is constant in the phases we can write according to the basic theorem

where Equation One can also do the same for quartets, quintets and so on. Next impose for the triplet, the constraint.

Equation

where

Equation

Note that Important, Equation from now on we shall treat all weights the same: We shall not distinguish between the different measurements

The same will be true for Equation The same is true for the But we shall not consider triplet terms of order in this paper. So now we have arrived at

Equation

This propagator Dx,y does not depend anymore on Equation In the sequel we shall simply say: “does not depend anymore on N”. It is better than Indeed to see this we can write as

Equation

This last expression becomes very low whenever x-y is not an interatomic vector since then Equation and thus and thus

That is Equation demoting such a ρ. We recall that we have also

Equation

Recall that Equation In order to see what this new propagator can offer let us look at Q_x.

Q_x is an N-sum of gaussian functions. Let us consider one of them, say Equation For sake of convenience we take now and we consider the one dimensional case ThenAnd Thus at x=0 we see that times larger than since which is very large since σ is very small. The function then drops very fast to zero at after which it remains negative, attains a negative minimum and then goes fast to Equation Also there is exactly one large negative minimum in the range Exactly as discussed for a ρ for which at one of these minima. For

we get Equation Because of the differentiationthis does not depend on N

Note, Equation This can also be used; Then there are no negative minima, but in order to make it N independent, one has to follow the procedure used in That is we must subtract the term in the Fourier expansion of to get a new propagator that is N-independent:

Improvements Equation

Let d be the maximum distance of all Equation where is the nearest neighbour ofThen we can obviously replace the is the characteristic function of the spherein the asymmetric unit of the crystal. Thus becomes If we know d we can then improve the phase densi- ties When ξ is a given chemical information (be it a submodel or an intermediate state ofduring iteration) then we can derive a new propagator, with notation Equation Indeed if we look at the term it is clear that we can consider an (improved) term and replace with the latter term. For instance if when and and are interatomic vectors; This is a stronger restriction on than merely the condition Now if is a submodel of then we can also replace

and obtain again a term of order Equation by replacingby the stronger condition (on ρ ) But now also changes to Indeed, in we can replace Then becomes where now (Remark thatis symmetric whenever and we replace b by another parameter f. Hence for a given submodel ξ we can now write a better

with Equation

Example: We can always place the origin of the asymmetric unit wherever we want, i.e. we can always suppose that one atomic vector, say a, is given. This means that at least we can always use the chemical information. Equation Then we get

Now we can show that with this we can directly calculate the density of the phase invariant.

Equation instead of simply Indeed consider the functional (where we

Equation

Next we do the functional change of variables: Equation where Then the Jacobian is the inverse of the determinant of the matrix which is not dependentThen

and Equation Defining the phase invariant

and considering the case that interests us most Equation we can write now

Where now Equation

Remark: Equation is indeed a phase invariant because under a translation of the origin also and thus under this translation which shows that is indeed a phase invariant. For the reciprocal vectors we can write where So we can write the phase invariant

The case for general ξ: Let Equation then and consider Then we apply the same functional change and we then get for

where Equation

Equation

Note: From now on we shall always write Equation instead of instead of resp.

A fermionic action functional and a new Equation

One knows that the different atoms in the unit cell repel each other. So, our random variable ρ should be chosen in such a way that the different peaks of ρ(x) spread over the unit cell and repel each other. This can be treated by considering ρ as an antisymmetric (fermionic) field written now as ψ. Then, following the treatment of QFT (Quantum Field Theory) [15], we replace.

Equation

Remark that Equation will be replaced by which must be even and hermitian. So

Equation

Next Equation

where I is the identity operator and we now replace Equation

We then get (where Equation is the inverse of the operator

Equation

since det Equation does not depend on the and since for a matrix

We can write

Equation

Then using Equation

Equation

A fermionic action functional and a new Equation

Since Equation does not depend on the

we can dismiss it in equation (38). Next continue with the case and and we define

Equation

Then for Equation

Equation

To get some idea let’s consider the simpler case Equation but still

Then the inverse of Q, i.e. Q^-1reads

Equation

Then

Equation

Where we omitted a term in equation (40) that does not depend on ϕ_h. In equation (40) we have used the identities

Equation

Finally, for Equation we get (omitting the terms that don’t depend on

Equation

The terms Equation are of higher order in f and c. So we see that we obtain in this way a probability of the form and thus

Equation

Equation (41) shows that for this model it is advantageous to choose f=c and then to use c for convergence considerations. For example, Equation (42) is then valid up to

We can extend the above model and study instead the model with action.

Equation

To calculate then the functional integral Equation we use the following trick.

If we then define

Equation

Then

Equation

where the choice Equation is clear and where we choose and invertible to make calculations easier Let us define

The it can be shown that Equation contains exactly all the connected diagrams of [1,15,16]. It is beyond the scope of this article to talk more about diagrams, but we shall discuss it together with the solution in a future paper.

Averaging over gaussian distributions ρ

So far we have been averaging over all positive Equation But what if we want to average only over gaussian ρ functions? The solution is the functional change of variables where is the true atomic distribution; This substitution is good if we don’t care about N-dependence, if we don’t want N-dependence we should instead consider

That is Equation

where Equation is a positive function, our new random variable.

Since Equation and thus also is about the true density they are completely determined by the phases In this way we will get a probability distribution of all Then the “volume” element , that is

This can be calculated but we can avoid this added complexity if we remark that we could have started from the very beginning by using instead of ρ the more complex form Equation that is we replace and so on. Replacing next the symbol by we then get etc. In this way the former is now describing “point” particles. However, the whole use of functional integrals in QFT is to describe interactions among point particles. So we do not know if it is worth doing averages over those Gaussian “point” particles.

We close this remark by giving two representations of the δ function. One is to represent Equation by a gaussian with infinitesimal variance. The other very interesting representation is In our case it reads

We can then first integrate over ρ and after that perform the integration over k, which is much easier.

Maximality with constraints

We saw in the foregoing sections that we had to maximize Equation Let us analyze this further. We shall now start with We will maximize this with the constraints for all Next observe that We then use the method of Langrangian multipliers. Put now

The minus signs in equation (51) have been chosen so as to use later on the more general “KKT- multipliers”) and find the solutions for which is maximal (critical), that is solve the equations

Equation

Next observe that

Equation

Since has now become redundant, we replace in equation (51). We can also add inequality constraints for ρ

Equation

and

Equation

In this case the multipliers Equation multipliers (KKT stands for Karush- Kuhn-Tucker). And we have a dependence now on

Equation

It follows from the above equation that we can impose (we suppose in this paper that friedel’s law is valid, that is and

Equation

We use the notation Equation to denote the transpose of A, and then We have to solve

Equation

We find

Equation

This gives (using ρ* instead of ρ)

Equation

and thus

Equation

Next we develop Equation [16]

Equation

Then,

Equation

Then we can write if we choose a to be great and b small (a>>b)

Equation

or if we choose a small and Equation

Equation

Since we prefer to use the easier instead of we shall in this paper proceed with the development of equation (64). Then for a>>b we find

Equation

Next Equation will give

Equation

From Equation follows

Equation

From the equations (67,68) we derive the values of α_pand β_p as functions of f and b. From these results and equation (69) we derive the value of f as a function b. We now see that are of order . If we would derive the value for b with the condition then we will also see that b is of order which gives a problem since we started with the assumption (a>>b)

For this reason, we shall not impose the condition Equation The bare minimum is the calculation of all the Lagrange multipliers and one or more Lagrange multipliers All the multipliers depend strongly on the phase invariants The situation becomes even more interesting if one now calculates and this is good news. We think that this last model is very exciting (perhaps it can even be used to construct the exact ρ from any given ξ). We will study all this in a separate paper.Now Equation can be written in a short way as

Since Equation and moreover one can verify easily that is a constant in ρ.

Conclusion

To calculate a probability distribution prob Equation for some phase one chooses one of the models discussed in this paper and also some of reciprocal vectors containing h. Then one calculates according to the chosen model. After that one calculates the marginal distribution Always choose structural information ξ e.g. the fixing of the origin Equation All models should lead to the solution of the phase problem.

In a future paper (II) we shall study in detail all different models but especially the fermionic model and the one of maximality with constraints. Especially we shall discuss the most general fermionic model Equation and we shall talk about the technique of the diagrams to calculate

For the very interesting model of maximality with constraints we shall also add the KKT condition Equation with some KKT multiplier Finally, in a last paper (III or IV) we shall test the theory on simulated crystal structures.

We shall also discuss which strategy to use in case of available space group information. Our paper treated only the space P1 (satisfying Friedel’s law). Our use of functional integration and calculus is much more powerful than the other methods of phase determination, be it probabilistic or direct space methods and is valid for any number N of atoms. We shall also try to discuss models for which the formulas will depend N.

References

Brosius J. Probability distributions of phases without N. Adv Mathematic Res. 2020;27.
Xu H, Weeks CM, Deacon AM, Miller R, Hauptman HA. Ill-conditioned shake-and-bake: The trap of the false minimum. Acta Crystallogr A. 2000;56(2):112-118.
[Crossref] [Google Scholar] [PubMed]
Langs DA, Hauptman HA. Relaxation of the resolution requirements for direct-methods phasing. Acta Crystallogr A. 2011;67(4):396-401.
[Crossref] [Google Scholar] [PubMed]
Bethanis K, Tzamalis P, Hountas A, Mishnev AF, Tsoucaris G. Upgrading the twin variables algorithm for large structures. Acta Crystallogr A. 2000;56(2):105-111.
[Crossref] [Google Scholar] [PubMed]
Burla MC, Camalli M, Carrozzini B, Cascarano GL, Giacovazzo C, Polidori G, et al. SIR2000, a program for the automatic ab initio crystal structure solution of proteins. Acta Crystallogr A. 2000;56(5):451-457.
[Crossref] [Google Scholar] [PubMed]
Altomare A, Burla MC, Camalli M, Cascarano GL, Giacovazzo C, Guagliardi A, et al. SIR97: A new tool for crystal structure determination and refinement. J Appl Cryst. 1999;32(1):115-119.
[Crossref] [Google Scholar]
Burla MC, Camalli M, Carrozzini B, Cascarano GL, Giacovazzo C, Polidori G, et al. SIR99, a program for the automatic solution of small and large crystal structures. Acta Crystallogr A. 1999;55(6):991-999.
[Crossref] [Google Scholar] [PubMed]
Cascarano G, Giacovazzo C, Camalli M, Spagna R, Burla MC, Nunzi A, et al. The method of representations of structure seminvariants. The strengthening of triplet relationships. Acta Crystallogr A. 1984;40(3):278-283.
[Crossref] [Google Scholar]
Burla MC, Carrozzini B, Cascarano GL, Comunale G, Giacovazzo C, Mazzone A, et al. Estimates of triplet invariants given a model structure. Acta Crystallogr A. 2012;68(4):513-520.
[Crossref] [Google Scholar]
Giacovazzo C. Direct phasing in crystallography: Fundamentals and applications. Oxford Univers Press. 1998.
[Google Scholar]
Gradshteyn IS, Ryzhik IM. Table of integrals, series, and products. Academic Press. 1980.
[Crossref]
Frank WO, Daniel WL, Ronald FB, Charles WC. NIST handbook of mathematical functions. Cambridge Univers Press. 2010.
[Google Scholar]
Diener F, Reeb G. Analyse non standard. 1989. [Google Scholar]
Nelson E. Internal set theory: A new approach to nonstandard analysis. Bull Amer Math Soc. 1977;83(6):1165-1198.
[Crossref] [Google Scholar]
Weinberg S. The quantum theory of fields. Cambridge University Press. 2005.
[Google Scholar]
Siegel W. Fields. 2005.
Kuzenko SM, Buchbinder IL. Ideas and methods of supersymmetry and supergravity or a walk through superspace. 1998.
[Crossref] [Google Scholar]
Bruhat YC, Bleick MD, Morette CD. Analysis, manifolds and physics. 2000.
[Crossref] [Google Scholar]
deWitt BS. Supermanifolds. 1992.
[Crossref] [Google Scholar]

Author Info

Jan Brosius^* and Walter Brosius

Department of Theoretical Chemistry, University of Valencia, Valencia, Spain

Citation: Brosius J, Brosius W (2024) Probability Distributions of Phases I. Math Eter. 14:226.

Received: 11-Jun-2024, Manuscript No. ME-24-31944 ; Editor assigned: 14-Jun-2024, Pre QC No. ME-24-31944 (PQ); Reviewed: 01-Jul-2024, QC No. ME-24-31944 ; Revised: 08-Jul-2024, Manuscript No. ME-24-31944 (R); Published: 15-Jul-2024 , DOI: 10.35248/1314-3344.24.14.226

Copyright: © 2024 Brosius J, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

References

Yacht Charter in Australia Blog | Mortgage info in New York

Mathematica EternaOpen Access

Probability Distributions of Phases I

Abstract

Keywords

Introduction

Discussion

Conclusion

References

Author Info

Mathematica Eterna
Open Access