Mathematica Eterna

Mathematica Eterna
Open Access

ISSN: 1314-3344

+44-77-2385-9429

Mini Review - (2024)Volume 14, Issue 3

Probability Distributions of Phases I

Jan Brosius* and Walter Brosius
 
*Correspondence: Jan Brosius, Department of Theoretical Chemistry, University of Valencia, Valencia, Spain, Email:

Author info »

Abstract

This article presents the mathematical foundation for calculating PD's (Probability Distributions) for some set of phases {ϕh} needed for the structure determination of a crystal. We can obtain PD's of the phases that can contain N or without N. A former paper could only obtain PD's of the phases containing N. Here we have the two possibilities.

Keywords

Random variable; Reciprocal vectors; Binomial distribution; Infinite number; Phase

Introduction

In a short review was given of the old probabilistic DM (Direct Methods) way for calculating phase distributions [1].

There were two mathematical approaches see (A) and (B) below (A): The basic R.V.’s (Random Variables) are the set of the Equation that are distributed independently and uniformly over the asymmetric unit (we consider in this paper only P1) and one studies the normalized structure factorsEquation And one calculates the probabilities of the phasesEquation (B): The basic R.V.’s are the reciprocal vectors h that are distributed uniformly and independently over reciprocal space and one keeps the Xiconstant. This method can give algebraic equations as follows: One can study the structure factorsEquation and we consider only h as the basic reciprocal vector and one keeps k fixed. The B3,0 formula is an equation obtained this way. Although this equation gives the value ofEquation in theory, in practice this equation is wrong for high N, which is due to accidental overlap of the xi which invalidates the calculation of the joint probabilities ofEquation Even when one calculates the joint probabilitiesEquation where h and k are the basic R.V.’s one must assume no accidental overlap of the xi (which becomes a problem for high N.). The calculation of joint probabilities gives then the same results as in (A) above.

(C): Using method (A) one can derive the probability of the cosine invariantEquationEquation

It follows that this formula

Loses predictive power for high N.

Cannot predict negative cosines.

The probabilities of quartets, quintets, etc. are even worse since they are of order of 1/N (for quartets), of orderEquation (for quintets), etc. (Although one can get a quartet formula that theoretically predicts negative cosines for the quartet (but again with too low predictive power)). At the end of the twentieth century nobody was busy anymore with calculating prob-abilistic phase distributions using one of the methods (A) or (B). For the calculations of structures with high N (N being here the number of independent non-H atoms in the asymmetric unit), one began to devise methods in direct space to solve crystal structures. One uses an automatic cyclical process: (a): Phase refinement (for instance with the use of the (modified) tangent formula) in reciprocal space and; (b): With the imposition in real space of physically meaningful constraints through an atomic interpretation of the electron density, with minimization of a well-chosen FOM (Figure Of Merit) of the phases. One of these methods in DM is known as the SnB (Shake and Bake) algorithm with N 1200 [2,3]; Another is the twin variables approach withEquation Sir2000 the successor of SIR97 and SIR99 although different from SnB: (e.g. triplet invariants via the P10 formula withEquation Another interesting result is the solution of a crystal when a substructure is known where N may become higher [4-9]. For an overview of DM before the year 2000 we refer to Giacovazzo [10].

(D): In order to circumvent these problems one approach might be to consider R.V.’s (xi)that are no longer independent neither uniformly distributed, say a dependence through a positive distributionEquation

One can give such distributions by using the functions Equation

But then one encounters insurmountable mathematical difficulties.

The solution is to not consider the xi as R.V.’s anymore but to replaceEquation by a fieldEquation and to sample the field over the allowable function space. What we shall discuss here is a novel way for doing DM (Direct Methods).

(E): Differences with our approach

• We shall be able to solve any structure (any N) ab initio.

• Much lower CPU time.

• LetEquation then with our approach we can easily calculate the probability distribution ofEquation for any h. No need to compute all possible triplets.

• Easy to incorporate any given substructure.

• Easy to calculate the PD’s (Probability Distributions) of phases: One only needs to take derivatives.

In this paper we shall give the mathematical basis that is necessary for this completely new DM approach. This approach is not mathematically as simple as in (A) and (B) but it is perfectly doable. It consists in using the atomic distribution function (x) as the basic random variable. The method will also be based on a functional integration over the random variable and using a nonstandard fuzzy approach wherein Dirac delta functions (among which a novel delta function representation for angle variables) are replaced by nonstandard fuzzy delta functions. To show the strength of the method, a simple formula was given in Brosius for the distribution of the triplet phase formula of the form Equation

Where A is a function depending (not on N!) on the structure factors of the first neighborhood of the triplet [1].

In this paper a more profound mathematical foundation of our DM approach is given and this will be a major improvement compared to Brosius [1]. Recall that the sampling is done over positive functionsEquation (in the space group P1) and that the R.V.’s that we study are the phases Equation which are defined by the relationEquation Where is a R.V. defined byEquation

Equation

and from now on we shall use the notations

Equation

Equation

Equation

One then needs to define a probability density Equation on the sample space ρ's We build up Equation by fuzzy Dirac delta functions in 4 steps

Through constraints of the form Equation by using fuzzy Dirac delta’s Equationa positive infinitesimal).

Next through maximization: Adding obvious terms toEquation whereEquation that cannot be added by using a constraint, like e.g. the term. Equation

Eventually we add fermionic terms to z, like e.g.

Equation

By imposing the mathematical requirement on the basic R.V. ρ that the different atoms in the unit cell of the crystal repel each other.

The idea is that if one would consider a function Equation for which it is known thatEquation whenever xi equals some xj, this can be done by requiring thatEquation is antisymmetric antisymmetric xi, that is

Equation

Inspired by modern QFT (Quantum Field Theory) we replace Equation by an antisymmetric (fermionic) field Equation with the propertyEquation

giving thus

Equation

The added benefit is then that the different xi will repel each other. Now one has two basic R.V.’s: ρ and ψ and we must integrate over ρ and ψ.

One can also sample over the set of Gaussian (normal) distributions by using the substitution

Equation

where Equation represents the true electronic distribution and Equation is the laplacian of f at the point x.

As in QFT, D (x, y) is called the propagator from the point y to x. Using constraints we shall see that the first candidate for D (x,y) is Q(x–y) where Q is the origin-removed Patterson function defined here by

Equation

This propagator depends on N since Equation

Notations and formulas

Equation

Equation

Equation

The error functionEquation [11,12].

Equation

Equation

Equation

Equation

Equation

A without subscript stands for some infinite positive number.

Equation

Equation

Equation whereEquation is the inverse of the kernel operator Q(x–y)

The phase random variable Equationis defined byEquation whereEquation denotes the atomic distribution and the function ρ is our basic R.V.

Equation

The functional integral

Equation

Equation

Equation

Equation

The Equation constants. We define the constants Equation by the series

Equation

The bn;m constants, defined by

Equation

Our representation of Equation for an angleEquationis

Equation

We then define the fuzzy nonstandard Equation function by

Equation

For real x (not an angle) we define the nonstandard fuzzyEquation by

Equation for positive infinitesimal ε, and for complexEquation

Equation

For some set H of reciprocal vectors we define

Equation

and sometimes we simply write Equation

We use the explicit definition of the functional derivative by

Equation

Where

Equation

Equation

where Equation

Equation; WhereEquation

Some vector calculus: (f, g: vector valued functions, h a scalar function)

Equation

Equation

Equation

Equation

Equation

Equation

Recall that in three dimensionsEquation

Equation

Equation

Preliminary knowledge

For an introduction on nonstandard theory we refer to Diener et al. and for a more advanced text see Nelson [13,14].

Nonstandard theory: Standard numbers are the known numbers: Equation the other numbers are the nonstandard real numbers which make up the field R. It is important to observe that there are an infinity of infinite numbers in R that are greater than any standard real number. Also there are an infinity of infinitesimals ε in R for which the absolute value |ε| is less than any positive standard number in R. From the axioms it follows that for every positive infinitesimalEquation is a positive infinite number and vice versa. Note that an infinite number is different fromEquation In this paper we use A to denote an infinite positive number and ε will always denote (unless explicitly noted otherwise) a positive infinitesimal Equation will denote a function that associates a positive infinitesimal with every position x in the unit cellEquationEquation We will use this function in our fuzzy Dirac delta.Equation We shall use the notationEquation when we deal with angle variables.

Anticommuting variables: In a detailed exposition of anticommuting numbers is given [1]. In this subsection we shall only expose the bare minimum needed to read this paper. For more information, we refer to Weinberg, Siegel, Kuzenko et al. and for a more mathematical treatment to Bruhat et al. and deWitt [15-19].

One starts with a set of anticommuting numbers θλ:

Equation

From this follows that every even product of such anticommuting numbers is commutingEquation Also one adds the axiom:Equation Then the algebraEquation is defined as the set of all finite sums of products.Equation

When M is even, this is a commuting number (also called even) and when it is odd it is an anticommuting number (also called odd). Sums of such products with even M do commute and are called even, and with odd M these sums are anticommuting and are called odd. Every z∈C is also even. It follows that every Equation is a sumEquation with β even and γ odd.

An involution Equation defined such that, Equation andEquationEquation is odd whenEquation is odd and even when otherwise. One callsEquation is odd when α is odd and even when otherwise. One calls ψ or ψx an odd function of x if ψx is odd for every x. It then follows thatEquation is even. Then the derivativeEquation with respect to the anticommuting variable θ is defined byEquation

Equation

Equation

Equation

Equation whereEquation

A function Equation of an odd variable θ has the simple formEquation (Taylor expansion), (here a is odd when f is odd, and even otherwise, but b has the opposite statistics of f). This can be generalized for a functionEquation of N anti-commuting variables: The coefficients of even products in the expansion of the θi have the same statistics as f, whereas the coefficients of uneven products have the opposite statistics. Next one defines the integrationEquation as

Equation

and the multiple integration

Equation

It is also convenient to define θ as an odd element:

Equation

Also the following formulas are important

Equation

Equation

Note that the set of all odd numbers has vanishing volume

Equation and

Equation

Discussion

The four determinants are listed below. The following Theoremes are:

Theorem 1: Let M be an matrix n×n– matrix. Then

Equation

where by definition Equation

Proof develop Equation

Equation

Since,

Equation the theorem follows.

The continuous version is as follows. LetEquation be an anticommuting variable for every X in the unit cell. Then,

Equation

where one has defined Equation

Theorem 2: Suppose now that the inverse M–1 exists andEquation be an anticommuting variable for every X. ThenEquation

Where

Equation

Equation

Proof let

Equation

Then transform

Equation

and substitute this in Equation Then using the relation Equation

Equation

Thus

Equation

Also

EquationEquation

The minus sign arises from the observation thatEquation inEquation

Equation

Indeed, note that

Equation

Equation

Equation

Equation

The probability functionalEquation

We shall show that we can obtain the following probability function Equation (H is some set of reciprocal vectors) given byEquation

where Equation is given, up to a phase unimportant constant, byEquation

EquationEquationEquationEquationEquationEquation

where Equation denotes chemical information or an intermediate iteration of ρ.

Equation

Equation will be the basic operator for all ourEquation First we need the following theorem:

Theorem 3: LetEquation be a functional of Equation such thatEquation where A is a positive infinite number and p an integer ≥1. If we impose the constraintEquation

where F has the property that Equation

Then if we define the action functional Equation (where c>0 is a constant) ThenEquation (where wF>…. (10)

For a sequence of such Equation will become (if we drop the constant Equation

Proof we impose this constraint by Equation

Equation

Equation

Since Equation is independent of the ϕ,h one can drop it in the above exponent. Next changeEquation and chooseEquation SinceEquation one obtainsEquation

Equation

Equation

since Equation (infinitesimal). Also under the changeEquation integral volume Equation So finally ( after replacingEquation

Equation

Equation

Equation (if we drop the constantEquation

Theorem 4:One can write Equation

Where

Equation

Equation

Equation

Equation

where from now on Equation is included Equation convenience, with parameters

Equation

Equation

Proof.

Equation and use the DiracEquation Then

Equation

Equation

Equation

Equation

Equation

where we defined Equation such thatEquation

• The R.V. Equation was defined byEquation

Then the probability distribution of Equation is generated by the expression Equation

But, (when A is infinite and positive)

Equation

Equation

Equation

After the transformation Equation obtain the result

Equation

where Equation For convenience, from now on, we shall includeEquation

• Next use Equation Then

Equation

Equation

Equation

Equation

where Equation

• For every Equation we impose the constraint

Equation

where

Equation

and

Equation

Then, according to theorem 1 above, one has

Equation

Equation

Next note that there is a phase unimportant peak Equation and define Q byEquation

Then if one chooses the positive function Equation

Equation

Equation

Equation

Equation

Equation

One can also add other terms to EquationFor example consider the triplet expression

Equation

Equation

Impose now the constraint Equation SinceEquation andEquation is constant in the phases we can write according to the basic theoremEquation

where Equation One can also do the same for quartets, quintets and so on. Next impose for the triplet, the constraint.

Equation

where

Equation

Equation

Equation

Equation

Equation

Equation

Equation

Note that Important, Equation from now on we shall treat all weightsEquation the same: We shall not distinguish between the different measurementsEquation

The same will be true for Equation The same is true for the Equation But we shall not consider triplet terms of orderEquation in this paper. So now we have arrived atEquation

Equation

This propagator Dx,y does not depend anymore onEquation In the sequel we shall simply say: “does not depend anymore on N”. It is better thanEquation Indeed to see this we can writeEquation as

Equation

Equation

This last expression becomes very low whenever x-y is not an interatomic vector since then Equation and thusEquation and thusEquationEquation

That is Equation demoting such a ρ. We recall that we have alsoEquation

Equation

Recall that Equation In order to see what this new propagator can offer let us look at Qx.

Qx is an N-sum of gaussian functions. Let us consider one of them, sayEquation For sake of convenience we take nowEquation and we consider the one dimensional caseEquation ThenEquationAndEquation Thus at x=0 we see thatEquation times larger thanEquation sinceEquation which is very large since σ is very small. The functionEquation then drops very fast to zero atEquation after which it remains negative, attains a negative minimum and then goes fast toEquation Also there is exactly one large negative minimum in the rangeEquation Exactly as discussedEquationEquation for a ρ for which Equationat one of these minima. For Equation

we get Equation Because of the differentiationEquationthis Equation does not depend on N

Note, Equation ThisEquation can also be used; Then there are no negative minima, but in order to make it N independent, one has to follow the procedure used inEquation That is we must subtract the termEquation in the Fourier expansion of Equation to get a new propagator that is N-independent:Equation

Improvements Equation

Let d be the maximum distance of all Equation whereEquation is the nearest neighbour ofEquationThen we can obviously replace theEquation is the characteristic function of the sphereEquationin the asymmetric unit of the crystal. ThusEquation becomesEquation If we know d we can then improve the phase densi- tiesEquation When ξ is a given chemical information (be it a submodel or an intermediate state ofduring iteration) then we can derive a new propagator, with notation Equation Indeed if we look at the termEquation it is clear that we can consider an (improved) term Equation and replaceEquation with the latter term. For instance if Equation when andEquation and Equation are interatomic vectors; This is a stronger restriction on than merely the conditionEquation Now ifEquation is a submodel of then we can also replaceEquation

and obtain again a term of order Equation by replacingEquationby the stronger condition (on ρ ) Equation But now alsoEquation changes toEquation Indeed, inEquation we can replaceEquation ThenEquation becomes Equation where now Equation (Remark thatEquationis symmetric whenever Equation and we replace b by another parameter f. Hence for a given submodel ξ we can now write a betterEquationEquation

with Equation

Example: We can always place the origin of the asymmetric unit wherever we want, i.e. we can always suppose that one atomic vector, say a, is given. This means that at least we can always use the chemical information. Equation Then we getEquationEquation

Now we can show that with this we can directly calculate the density of the phase invariant.

Equation instead of simplyEquation Indeed consider the functional (where weEquation

Equation

Next we do the functional change of variables: Equation whereEquation Then the Jacobian is the inverse of the determinant of the matrixEquation which is not dependentEquationThenEquationEquationEquation

and Equation Defining the phase invariantEquation

and considering the case that interests us most Equation we can write nowEquation

Where now Equation

Remark:Equation is indeed a phase invariant because under a translation of the originEquation alsoEquation and thusEquation under this translation which shows thatEquation is indeed a phase invariant. For the reciprocal vectorsEquationEquation we can writeEquation whereEquation So we can write the phase invariantEquation

The case for general ξ: LetEquation then Equation and consider EquationEquation Then we apply the same functional change Equation and we then get forEquationEquation

whereEquation

Equation

Equation

Equation

Equation

Equation

Equation

Equation

Note: From now on we shall always write Equation instead ofEquation instead ofEquation resp. Equation

A fermionic action functional and a newEquation

One knows that the different atoms in the unit cell repel each other. So, our random variable ρ should be chosen in such a way that the different peaks of ρ(x) spread over the unit cell and repel each other. This can be treated by considering ρ as an antisymmetric (fermionic) field written now as ψ. Then, following the treatment of QFT (Quantum Field Theory) [15], we replace.

Equation

Equation

Remark thatEquation will be replaced by Equation which must be even and hermitian. SoEquation

Equation

Next EquationEquationEquation

where I is the identity operator and we now replace Equation

We then get (whereEquation is the inverse of the operatorEquation

Equation

Equation

since det Equation does not depend on the and since Equation for a matrixEquation

We can write

Equation

Equation

Equation

Equation

Equation

Then using Equation

Equation

Equation

Equation

A fermionic action functional and a newEquation

Since Equation does not depend on theEquation

we can dismiss it in equation (38). Next continue with the caseEquation andEquation and we define

Equation

Equation

Equation

Then for Equation

Equation

Equation

To get some idea let’s consider the simpler case Equation but still Equation

Then the inverse of Q, i.e. Q-1reads

Equation

Then

Equation

Equation

Equation

Equation

Where we omitted a term in equation (40) that does not depend on ϕh. In equation (40) we have used the identities

Equation

Equation

Equation

Equation

Finally, for Equation we get (omitting the terms that don’t depend onEquation

Equation

Equation

Equation

The terms Equation are of higher order in f and c. So we see that we obtain in this way a probability of the formEquation and thus Equation

Equation

Equation (41) shows that for this model it is advantageous to choose f=c and then to use c for convergence considerations. For example, Equation (42) is then valid up to Equation

We can extend the above model and study instead the model with action.

Equation

To calculate then the functional integral Equation we use the following trick.Equation

If we then define

Equation

Equation

Equation

Then

Equation

where the choiceEquation is clear and where we chooseEquation and invertible to make calculations easier Equation Let us defineEquation

The it can be shown that Equation contains exactly all the connected diagrams of Equation[1,15,16]. It is beyond the scope of this article to talk more about diagrams, but we shall discuss it together with the solution in a future paper.

Averaging over gaussian distributions ρ

So far we have been averaging over all positive Equation But what if we want to average only over gaussian ρ functions? The solution is the functional change of variablesEquation whereEquation is the true atomic distribution; This substitution is good if we don’t care about N-dependence, if we don’t want N-dependence we should instead considerEquation

That is Equation

where Equation is a positive function, our new random variable.

Since Equation and thus also Equation is about the true density they are completely determined by the phasesEquation In this way we will get a probability distribution of allEquation Then the “volume” element Equation, that is Equation

This can be calculated but we can avoid this added complexity if we remark that we could have started from the very beginning by using instead of ρ the more complex formEquation that is we replaceEquation and so on. Replacing next the symbolEquation by Equation we then getEquationEquation etc. In this way the former is now describing “point” particles. However, the whole use of functional integrals in QFT is to describe interactions among point particles. So we do not know if it is worth doing averages over those Gaussian “point” particles.

We close this remark by giving two representations of the δ function. One is to representEquation by a gaussian with infinitesimal variance. The other very interesting representation isEquation In our case it readsEquation

We can then first integrate over ρ and after that perform the integration over k, which is much easier.

Maximality with constraints

We saw in the foregoing sections that we had to maximizeEquation Let us analyze this further. We shall now start withEquation We will maximize this with the constraintsEquation for allEquation Next observe thatEquation We then use the method of Langrangian multipliers. Put now Equation

The minus signs in equation (51) have been chosen so as to use later on the more general “KKT- multipliers”) and find the solutions Equation for whichEquation is maximal (critical), that is solve the equations

Equation

Equation

Equation

Next

Equation

Equation

Equation

Next observe that

Equation

Equation

Equation

Since Equation has now become redundant, we replaceEquation in equation (51). We can also add inequality constraints for ρ

Equation

Equation

and

Equation

Equation

In this case the multipliers Equation multipliers (KKT stands for Karush- Kuhn-Tucker). And we have a dependenceEquation now onEquation

Equation

It follows from the above equation that we can impose (we suppose in this paper that friedel’s law is valid, that is Equation andEquation

Equation

Equation

We use the notationEquation to denote the transpose of A, and thenEquation We have to solve

Equation

Equation

Equation

Equation

Equation

We find

Equation

This gives (using ρ* instead of ρ)

Equation

and thus

Equation

Next we developEquation [16]

Equation

Equation

Equation

Then,

Equation

Then we can write if we choose a to be great and b small (a>>b)

Equation

Equation

or if we choose a small and Equation

Equation

Equation

Since we prefer to use the easier Equation instead ofEquation we shall in this paper proceed with the development of equation (64). Then for a>>b we find Equation

Equation

Equation

Equation

Equation

Next Equation will giveEquation

Equation

From Equation follows

Equation

From the equations (67,68) we derive the values of αp and βp as functions of f and b. From these results and equation (69) we derive the value of f as a function b. We now see thatEquation are of order Equation . If we would derive the value for b with the condition Equation then we will also see that b is of order Equationwhich gives a problem since we started with the assumption (a>>b)

For this reason, we shall not impose the conditionEquation The bare minimum is the calculation of all the Lagrange multipliersEquation and one or more Lagrange multipliersEquation All the multipliers depend strongly on the phase invariantsEquation The situation becomes even more interesting if one now calculatesEquation and this is good news. We think that this last model is very exciting (perhaps it can even be used to construct the exact ρ from any given ξ). We will study all this in a separate paper.Now Equation can be written in a short way asEquation

Since Equation and moreover one can verify easily thatEquation is a constant in ρ.

Conclusion

To calculate a probability distribution prob Equation for some phase Equation one chooses one of the models discussed in this paper and also some Equation of reciprocal vectors containing h. Then one calculatesEquation according to the chosen model. After that one calculates the marginal distributionEquation Always choose structural information ξ e.g. the fixing of the originEquation All models should lead to the solution of the phase problem.

In a future paper (II) we shall study in detail all different models but especially the fermionic model and the one of maximality with constraints. Especially we shall discuss the most general fermionic modelEquation and we shall talk about the technique of the diagrams to calculateEquation

For the very interesting model of maximality with constraints we shall also add the KKT condition Equation with some KKT multiplierEquation Finally, in a last paper (III or IV) we shall test the theory on simulated crystal structures.

We shall also discuss which strategy to use in case of available space group information. Our paper treated only the space P1 (satisfying Friedel’s law). Our use of functional integration and calculus is much more powerful than the other methods of phase determination, be it probabilistic or direct space methods and is valid for any number N of atoms. We shall also try to discuss models for which the formulas will depend N.

References

Author Info

Jan Brosius* and Walter Brosius
 
Department of Theoretical Chemistry, University of Valencia, Valencia, Spain
 

Citation: Brosius J, Brosius W (2024) Probability Distributions of Phases I. Math Eter. 14:226.

Received: 11-Jun-2024, Manuscript No. ME-24-31944 ; Editor assigned: 14-Jun-2024, Pre QC No. ME-24-31944 (PQ); Reviewed: 01-Jul-2024, QC No. ME-24-31944 ; Revised: 08-Jul-2024, Manuscript No. ME-24-31944 (R); Published: 15-Jul-2024 , DOI: 10.35248/1314-3344.24.14.226

Copyright: © 2024 Brosius J, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Top