Closed-Form Estimators for Blind Separation of Sources – Part II: Complex Mixtures
VICENTE ZARZOSO and ASOKE K. NANDI
Signal Processing and Communications Group, Department of Electrical Engineering and Electronics, The University of Liverpool, Brownlow Hill, Liverpool L69 3GJ, U.K.
E-mail: [email protected]
Abstract. The problem of multiuser detection in wireless communications systems adopts, in flat-fading channels, a blind source separation (BSS) formulation of instantaneous linear mixtures. This contribution addresses the closed-form solutions to BSS in the complex-mixture scenario. The algebraic devices which span a unifying framework for the complex BSS closed-form estimators are developed. With the aid of these tools, results orig- inally encountered in the real-mixture case are extended to the complex case, thus highlighting the remarkable parallelism existing between the real and complex problems in the context of their analytic solutions. Computer simulations illustrate the theoretical results and compare the proposed methods to other BSS procedures.
Keywords:bicomplex numbers, blind source separation, closed-form estimators, complex mixtures, higher-order statistics, independent component analysis, multiuser detection.
1. Introduction
1.1. PROBLEMSTATEMENT
CDMA’s many attractive features (“soft capacity”, “soft handover”, inherently dynamic chan- nel sharing, etc.) have transformed it into the preferred multiple access technique for third generation wireless personal communications [20]. Since the quality of CDMA systems de- grades with the interference caused by users operating in the same cell, active suppression of multiuser interference (MUI) leads to considerable performance gains. Multiuser detection (MUD) [26, 27] techniques take advantage of the particular structure of MUI in order to extract the signal(s) of interest free from interference. This process implicitly involves the recovery of the users’ transmitted signals that are mixed at the receiving antenna. In flat-fading environments [21], the signals observed at a sensor array output y = [y1, . . . , yp]T ∈ Cp can be considered as unknown instantaneous mixtures of the transmitted data signals x = [x1, . . . , xq]T∈Cq:
y=Mx. (1)
The unknown mixing matrixM∈Cp×qis determined by the spreading waveforms, propaga- tion conditions, source-sensor positions, etc. Hence, MUD reduces to the estimation of users’
data x from the observations y, a problem which corresponds the blind source separation (BSS) of instantaneous linear mixtrues [31]. A myriad of other applications, in areas as diverse
Supported through a Post-doctoral Research Fellowship awarded by the Royal Academy of Engineering, U.K.
as biomedical signal processing [15, 16, 22], seismic exploration [14, 24], radar and sonar, speech processing [25], etc., can also be modelled from the perspective of BSS. Due to this wide range of applications, the BSS problem has aroused great research interest over the last decade.
Only two assumptions usually suffice to achieve the separation, namely, that the source signals be statistically independent and the matrixM be full column rank. Even counting on these fundamental hypotheses, the solution to the BSS problem suffers from inherent indeter- minacies related to the order and the scale of the recovered sources. Both are unimportant in most practical situations. As a consequence, it can be assumed without loss of generality that the sources are unit-variance.
The convenience of tackling the problem in two steps has been endorsed in a great number of works [4, 6–10, 12–14, 19, 30, 31]. The first step, known as pre-whitening, involves second- order statistics (SOS) and yields a set of uncorrelated normalized signals, so-called whitened observations. The second step accomplishes the source separation and mixing matrix iden- tification by means of the higher-order statistics (HOS) of the data. The justification for the first SOS-based step is twofold: (a) since SOS are generally estimated with higher accuracy than HOS, it seems sensible to benefit from the former before resorting to the latter, and (b) after pre-whitening, the dimensionality of the problem is reduced. Effectively, the whitened observationszare related to the true sources through a unitary transformationQ∈Cq×q:
z=Qx, (2)
from where it becomes apparent that fewer parameters remain to be estimated relative to the initial model (1).
1.2. CLOSED-FORMSOLUTIONS
Several different approaches exist to cope with the BSS problem [4, 8, 31], but we are pri- marily concerned with the closed-form or analytical solutions which yield direct estimators for the mixing-structure elements. In the two-source two-sensor scenario, direct solutions are found for model (1) without the need for a pre-whitening stage [5, 17]. However, this is at the expense of constraining the mixing matrix structure, resulting in the loss of the uniform performance property [3, Section III-E]. The desirable uniform performance feature means that the separation quality depends only on the source distribution, but not on the particular mixing structure [2, 3].
For this reason, our focus is rather on the direct estimation of matrixQafter pre-whitening, where no such constraints on the mixing structure are made. In [6] the first analytical expres- sion (Comon’s formula, CF) was proposed for the real-mixture case, based on the 4th-order cumulants. The extension of this idea to the complex-signal environment (complex CF, CCF) was carried out in [7], [9] and [13]. Also for the real-mixture scenario, the maximum- likelihood (ML) approach was adopted in [12], and with the help of Gram–Charlier expansion of the source probability density function (pdf), another closed-form estimator (approximate ML, AML) was obtained for symmetric sources with identical distribution and positive kurto- sis. Those restrictions were spared with the estimator developed in [30] (extended ML, EML).
Another of such expressions (sum of output kurtosis, SOK) was found in [10] as the closed- form solution to the maximimization of a contrast function proposed in [19]. The connection among the above estimators for the real case was accomplished in [33]. The relationship between the CF, the AML and the EML was unveiled and the performance of the CF analyzed.
A family of closed-form estimators based on thenth-order cumulants was developed, of which the EML was found to be a particular case (at ordern =4). The family was associated with the respective cost function optimization criteria, which led to the link between the EML and the SOK solutions. Also, a novel 4th-order estimator (alternative EML, AEML) was proposed, in conjunction with an empirically-derived decision rule, in order to overcome a deterioration in performance of the other 4th-order estimators for null source kurtosis sum (sks).
1.3. OBJECTIVES ANDORGANIZATION
In MUD, as well as in other communications problems, digital modulations and channel ef- fects are conveniently described via complex-valued (phasor) representations, which result in a BSS model of complex mixtures. Solving the complex-mixture problem is also of interest in other important applications such as seismic sounding [14, 24]. The purpose of this paper is to extend to the complex case some of the results unfolded in [33] for the real-mixture scenario.
In particular, we want to lay emphasis on the existing analogy between the direct solutions to the real and complex problems. The definition of a special set of numbers in accordance with the particular structure of matrixQplays a crucial role in achieving these objectives.
The contents are divided in the following sections. In the first place, Section 2 deals with the peculiarities of the problem in the complex case, and sets the statistical nomenclature that will be used for complex variables. Section 3 defines a new number class – so calledbicomplex numbers– in compliance with the structure and properties of the unitary transformation to be identified. By using this new tool, a family of closed-form estimators based on the HOS (the higher-order cumulants) of the whitened signals is disclosed in Section 4. This family is totally analogous to the one found for the real case in [33]. At fourth order, the general expression reduces to the complex extension of the EML estimator of [30]. Similarly as occurred for its real counterpart, this 4th-order estimator suffers a performance degradation when the sks is near zero. To surmount this limitation, a hybrid estimation procedure is proposed, featuring an alternative estimator also based on 4th-order cumulants. The simulations reported in Section 6 illustrate these results and compare the methods studied to other BSS techniques. Conclusions are left for Section 7. Table 1 lists all the acronyms employed throughout the paper.
2. Complex Mixtures
2.1. PROBLEMPARATERIZATION ANDEQUIVALENCE CLASS OFVALID SOLUTIONS
In the complex case, the 2×2 unitary transformationQbecomes a unitary matrix, with the generic shape:
Q=
a −b∗ b a∗
, (3)
symbol∗representing complex conjugation. Up to (irrelevant, due to the source scaling inde- terminacy) pre-multiplication by a unit-norm diagonal matrix, the above matrix reduces to a complex Givens rotation, which exhibits the general form:
Q=Q(θ, α) , (4)
with
Q(·,·):R×R→C2×2, Q(θ, α)=
cosθ −e−j αsinθ ej αsinθ cosθ
. (5)
Table 1. List of acronyms.
Acronym Definition
AEML alternative EML
AML approximate maximum likelihood BSS blind source separation
CAEML complex AEML
CCF complex CF
CDMA code division multiple access
CEML complex EML
CF Comon’s formula
combCCF combined CCF
combCEML combined CEML
EML extended maximum likelihood HOS higher-order statistics ISR interference-to-signal ratio
JADE joint approximate diagonalization of eigenmatrices
MC Monte Carlo
ML maximum likelihood
MUD multiuser detection MUI multiuser interference pdf probability density function PRBS pseudorandom binary sequence QAM quadrature amplitude modulation skd source kurtosis difference
sks source kurtosis sum
SNR signal-to-noise ratio SOK sum of output kurtosis SOS second-order statistics SVD singular value decomposition
Therefore, two parameters, angleθ and phaseα, need to be estimated in order to achieve the identification of the relevant matrix, and hence the source extraction. If θˆ and αˆ represent estimates ofθ andα, respectively, then the source separation is attained through
ˆ
x=QHz=QHQx, Q =Q(θ ,ˆ α) .ˆ (6) The actual values of(θ, α)are not the only solutions to provide a valid source extraction. Due to the indeterminacies commented in Section 1.1, any solution of the form
GQHQ=P D, (7)
withP a permutation and D a regular diagonal matrix is considered as acceptable, since it preserves the source waveforms. Accordingly, the following set of solutions are equivalent in the waveform-preserving sense to(θ, α), i.e.,(θ ,ˆ α)ˆ ≡(θ, α):
(θ ,ˆ α)ˆ =
θ+mπ
2 , α+2π n
, and (8a)
(θ ,ˆ α)ˆ =
−θ +mπ
2 , α+(2n+1)π
, n, m∈N. (8b)
Hence, we call [(θ, α)] {(θ,ˆ α)ˆ ∈ R2 | (θ ,ˆ α)ˆ ≡ (θ, α)} the equivalence class of valid solutionsof our problem.
2.2. COMPLEX-VARIATE STATISTICS
The methods studied in this paper rely on the higher-order cumulants of the data. There are different possible natural ways to define the cumulants for complex variables [1]. For our purposes, however, it is convenient to choose:
Cumzi
1i2i3...Cum[z∗i1, zi2, z∗i3, . . .]. (9) The same pairwise conventions as those defined in [33] hold (Kendall’s convention [23]):
κnz−r,r Cumz1...1
n−r
2...2
r
. (10)
For unit-variance zero-mean uncorrelated components, the expressions for the 4th-order cumulants are:
κ40z =E[|z1|4] − |E[z21]|2−2, κ31z =E[z∗1|z1|2z2] κ22z =E[|z1|2|z2|2] −1, κ13z =E[z∗1|z2|2z2] κ04z =E[|z2|4] − |E[z22]|2−2,
(11) where E[·]denotes the mathematical expectation. Note that certain cumulants may take com- plex values (such asκ31z andκ13z above). It is precisely from these complex-valued cumulants that the complex phaseαcan be estimated.
Owing to the linear relationship between the whitened observations and the source signals represented by Equations (2), (4) and (5), and the multilinearity property of cumulants [18, Section 2.4], we have:
κ40z =cos4θ κ40x +sin4θ κ04x
κ31z =cos3θsinθej ακ40x −sin3θcosθej ακ04x κ22z =cos2θsin2θ (κ40x +κ04x)
κ13z =cosθsin3θej ακ40x −sinθcos3θej ακ04x κ04z =sin4θ κ40x +cos4θ κ04x .
(12)
3. Bicomplex Numbers
The use of complex numbers facilitates the closed-form identification of the orthogonal mix- ing matrix in the real case [30, 33]. Since we are now dealing with complex mixtures, it seems
natural to seek certain type of extension of the complex numbers. We carry out such extension as follows.
DEFINITION 1 (bicomplex number). A bicomplex number x¯ ∈ Bis an expression of the form:
¯
x=a+j b, a, b∈C, j2= −1. (13)
Although symbolj has the same numeric value as the usual imaginary unitj, it must be understood as “orthogonal” to or in a “different space” from that ofj, so that they actually repesent distinct algebraic elements.
DEFINITION 2 (breal and bimaginary part). Givenx¯ =a+j b∈ B,Re(x)¯ =a ∈Cis the bicomplex-real (breal)part ofx, and¯ Im(x)¯ =b ∈ Cits bicomplex-imaginary (bimaginary) part.
This terminology prevents confusion with the commonplace real [Re(·)] and imaginary [Im(·)] parts of complex numbers, which are always real valued. Accordingly, j is named bimaginary unit. Due to the nature of the matrix to be identified, our attention is restricted to the subset:
DEFINITION 3 (unitary bicomplex number). Bu = { ¯x ∈ B : |a|2+ |b|2 = 1}is the set of unitarybicomplex numbers.
It is straightforward to establish an isomorphism between the set of bicomplex numbers and a particular set of matrices related to our problem. Let us first define this matrix set.
DEFINITION 4. Uis the set ofunitary 2×2matrices:U= {U ∈C2×2:U UH=UHU =I}. The setUunder ordinary matrix multiplication forms a non-Abelian group. In particular, Uis a subgroup of SL(2,C), the so-called special linear group of 2×2 matrices overC[11].
Any matrixU ∈Ucan be written in general form as:
U =
a −b∗ b a∗
, a, b∈C, |a|2+ |b|2=1. (14) The product of two unitary matrices is given by:
U1U2=
a1a2−b1∗b2 −a1b∗2−b∗1a2∗ b1a2+a1∗b2 −b1b∗2+a∗1a2∗
. (15)
Now, we can define theproductof two bicomplex numbersx¯1andx¯2in compliance with the product ofU-matrices by regarding the first column of the previous expression, so that:
¯
x1x¯2[Re(x¯1)Re(x¯2)−Im∗(x¯1)Im(x¯2)] +j[Im(x¯1)Re(x¯2)+Re∗(x¯1)Im(x¯2)]. (16) With this product operation: j2 = −1, in agreement with Definition 1. Also, if both the breal and bimaginary parts of the bicomplex operands are real, such product reduces (chang- ingj forj) to the usual complex-number multiplication. Hence, Equation (16) is a natural generalization of the complex product. By virtue of these conventions, the mapping
ℵ :U→Bu, (17)
with
ℵ(U)=u11+j u21 =a+j b, U ∈U, (18)
represents an isomorphism betweenUunder normal matrix multiplication andBuunder the bicomplex-number product as defined in (16). Effectively,ℵ(·)is one-to-one, onto and oper- ation preserving, conditions that must be fulfilled by any isomorphism [11]. From the former two conditions, anyx¯ = a+j b ∈ Bucan be uniquely associated to the first column of one and only one matrixU = [u1,u2] = ℵ−1(x)¯ ∈U,u1= [a, b]T. The latter condition is due to (16), which guaranteesℵ(U1U2)= ℵ(U1)ℵ(U2). Therefore, the related sets are isomorphic:
U ∼ Bu. Having established this connection, and owing to the properties of isomorphisms, the following definitions naturally follow,∀ ¯x ∈B:
DEFINITION 5 (conjugation). x¯∗=Re(x)¯ ∗−jIm(x).¯ This is because the first column ofUHis[a∗,−b]T.
DEFINITION 6 (modulus). | ¯x|2= ¯xx¯∗ = ¯x∗x¯ = |Re(x)¯ |2+ |Im(x)¯ |2.
Effectively, the first column ofU UHandUHUis given by[|a|2+|b|2,0]T. As an immediate consequence:
DEFINITION 7 (inverse). x¯−1= ¯x∗/| ¯x|2.
For instance,j−1= −j, just like its complex twinj.
If the unitary transformation is parameterized like in (4)–(5), then the associated unitary bicomplex number becomes an important particular case.
DEFINITION 8 (bicomplex exponential).Expression
ej θα cosθ +jej αsinθ, θ, α∈R, (19)
is calledbicomplex exponential, by analogy with its familiar complex counterpart.
The following properties of the bicomplex exponential are easily proven,∀n∈ N,∀θ, θ1, θ2, α ∈R:
(i) ej θα 1ej θα 2 =ej θα 2ej θα 1 =ej (θα 1+θ2), and hence(ej θα )n=ej nθα (20a)
(ii) (ej θα )−1=(ej θα )∗=e−αj θ (20b)
(iii) ej θα±π =e−αj θ (20c)
(v) ej (θ+
nπ 2)
α =
(−1)n2ej θα , neven (−1)n−21jej αej θα , nodd.
(20d) The bimaginary unit can be written asj =ej
π 2
0 , so that, as an important special case of (20d):
(vi) jn=
(−1)n2, neven (−1)n−12 j, nodd,
(20e)
identities which again highlight the similarity between the bimaginary and the imaginary units. Another immediate consequence of property (20d) is that ej
nπ
α 2 is always a pure breal (n even) or pure bimaginary (nodd) bicomplex quantity, which correspond to unitary matrices of the formP D[Equation (7)]. Hence, global matrixGrepresents a valid separating solution if and only if its associated bicomplex number ℵ(G) is pure breal or pure bimaginary. As G = QHQ, then ℵ(G) = ℵ(Q) ∗ℵ(Q) = (ejαˆθˆ)∗ej θα . Now, straightforward application of properties (20) lead to the equivalent class of valid solutions (8), thus efficiently formulated in the bicomplex domain.
The next definition extends the argument (“arg”) function.
DEFINITION 9 (barg function). Given x¯ = | ¯x|ej θα , let the bicomplex argument (barg) function be defined as
(ψ, ϕ)= x,¯
ψ= (Re(x)¯ +j|Im(x)¯ |)
ϕ= Im(x),¯ (21)
where (·) ∈] −π, π]provides the angle with respect the real axis of its complex argument.
Then:ej ψϕ =ej θα . In particular:(ψ, ϕ)≡(θ, α).
In the real-mixture case, the so-calledscatter diagramallows to simplify the development and provides an insightful geometrical interpretation of the real rotation taking place over the source signals after pre-whitening [12, 30, 31]. If z = Q(θ )x, with Q(θ ) = Q(θ,0) and x,z∈R2, thenz1+j z2=ej θ(x1+j x2). Taking into account bicomplex multiplication rule (16) and exponential expression (19), it is straightforward to prove that a similar relationship holds in the complex-mixture case too.
PROPOSITION 10 (bicomplex scatter-diagram). Ifz=Q(θ, α)xthen
¯
z=ej θα x ,¯ (22)
with x¯ = (x1 + j x2) and z¯ = (z1 +j z2). By analogy to the real case, the bicomplex numbers x¯ and z¯ are called bicomplex scatter-diagram points of the source and whitened signals, respectively.
In the real case, the remaining orthogonal transformation after pre-whitening is easily interpreted as a geometric rotation of the scatter plots. By contrast, in the complex case both the scatter-diagram points and the associated “rotation” defined above, though algebraically analogous to its real-mixture equivalent, become less illuminating as a geometric notion.
Many of the above new concepts will allow us to simplify the notation of the complex- case closed-form estimators which are presented next. The beauty of bicomplex numbers is that by constraining the breal and bimaginary parts to be real quantities, i.e., the associated transformation to have real elements, one is at once dealing with the familiar complex numbers and, in the context of BSS, with the real-mixture scenario.
4. Complex Analytic Estimators
With the aid of the bicomplex-number tools developed in the previous section, [33, Theorem 1]
accepts the following extension to complex mixtures, which is proved in the Appendix.
THEOREM 11. Defineξ¯n(z)as the following bicomplex weighted sum of pairwise nth-order cumulants of the components ofz, withn∈N+:
ξ¯n(z) n
r=0
n r
jrκnz−r,r. (23)
Ifz=Q(θ, α)x, withxmade up of independent components, then:
ξ¯n(z)=ej nθ(−1)nαξ¯n(x) , (24)
where, according to (23),
ξ¯n(x)=κn0x +jnκ0nx . (25)
By analogy with the real case, cumulant combination (23) is termed whitened-observation nth-order bicomplex centroid. Ifξn(x)is known or can be estimated, parameters(θ, α)may be readily obtained in closed-form from (24) via:
(nθ,ˆ α)ˆ =
ξ¯n(z) ξ¯n(x)
, (26)
where “ (·)” represents the barg function defined in (21).
Solution Indeterminacy
The indeterminacy issues discussed in [33, Section 3] for the real-case estimation family also apply to estimators (26). This indeterminacy lies in the fact that exp(j nθ ) = exp(j n(θ + 2π m/n)), for any n, m ∈ N, which may lead to a non-valid separation solution. In addi- tion, the value ofξ¯n(x)(i.e., the source marginalnth-order cumulants) is not known a priori.
Nevertheless, these two difficulties do not arise in all cases.
Fourth Order
At fourth-order, (24) becomes:
ξ¯4(z)=(κ40z −6κ22z +κ04z )+j4(κ31z −κ13z )=ej4θα (κ40x +κ04x)=ej4θα γ , (27) so that the unknown parameters can be obtained via(4θ ,ˆ α)ˆ = (ξ¯4(z)/ξ¯4(x)). Due to the connection with its real counterpart [30, 33], this is called theComplex EML (CEML)estima- tor. We might as well have obtained the same result by developing the 4th-order cumulants (12) directly. Effectively:
κ40z −6κ22z +κ04z =cos(4θ )γ
4(κ31z −κ31z )=ej αsin(4θ )γ , (28)
Just like its real twin, the sksγ = ¯ξ4(x)may be obtained from the available signals as in [30]:
γ =κ40z +2κ22z +κ04z =κ40x +κ04x . (29)
We conclude this section with a final remark. In [6], the following relationship was found for the whitened-observation 4th-order cumulants in the real case:
ρ2−σ+2=0, with
ρ κ31zκ−zκ13z
22
σ κ40zκ+zκ04z
22
. (30)
Note that relating the norms of the terms at both sides of the second equality in (27), and using
|ρ|2instead ofρ2, one arrives at the complex equivalent of the above relationship.
5. Alternative Fourth-Order Estimator
5.1. THECOMPLEXAEMLAND ACOMBINEDESTIMATION STRATEGY
As in the real case, the CEML estimator derived from Equation (27) cannot be used for zero sks, since thenξ¯4(x) = 0 (the performance degradation of this estimator aroundγ = 0 will be demonstrated in the experiments of Section 6). To overcome this deficiency, we resort to the complex version of the estimator put forward in [33, Section 5]. In the complex case, and by virtue of relationships (12), the associated cumulant combination develops into:
ξ¯4(z)=(κ40z −κ04z)+2j (κ31z +κ13z )=ej2θα η , (31) where, η = κ40x −κ04x represents the source kurtosis difference (skd), whose actual sign is unimportant for the estimation of the true parameters [due to equivalence class (8) and property (20d)]. Then,(2θ ,ˆ α)ˆ = ξ¯4(z). For the sake of consistency in the nomenclature, we name this theComplex AEML (CAEML)estimator.
The same problem as in the real case about when to use (27) or (31) naturally arises at this point. In a bid to shed some light on this issue, experiments along the lines of those described in [33, Section 5.2] were carried out with the complex versions. This time, 4-QAM signals were used as sources, with independent real and imaginary parts composed of respective real- valued PRBSs with the same symbol probability. In these conditions, the (normalized) kurtosis of the resultant 4-QAM signal isκ4x =κ4(PRBS)x /2, so that any kurtosis value equal or greater than –1 can be obtained by suitably varying the PRBS symbol probability [28, 29]. At each separation, the interference-to-signal ratio (ISR) [31, 33] was used as a performance index:
ISR= E
i=j
|(QTQ)ij|2
|(QTQ)ii|2
. (32)
Signals were composed of T = 5000 samples, and the mean square value of ISR was computed over 100 Monte Carlo (MC) runs, given in dB as ISR2(dB) = 5 log10(E[ISR2]).
The results obtained, plotted in Figure 1, are very similar to those of [33, Section 5.2]. The alternation in best performance occurs when the skd is equal to the sks (around –1), with the CEML offering lower ISR for|γ|>|η|, and vice versa. Since|¯ξ4(z)| = |γ|and|¯ξ4(z)| = |η| (just like for their real counterparts [33]), this leads to the conclusion that decision rule
|¯ξ4(z)|
CEML
><
CAEML
|¯ξ4(z)| (33)
Figure 1. Performance of CEML, CAEML and combCEML vs. sks. 4-QAM sources,κ40x = −1,θ = 30◦, α=45◦.
may as well be applied in the complex case. Although θ = 30◦and α = 45◦were used in Figure 1, this conclusion is essentially identical for all angle values(θ, α). Figure 1 also shows the results obtained under the same conditions by using the combined estimation scheme derived from the above rule, which we thus call combined CEML (combCEML)estimator.
The performance degradation exhibited by the CEML and the CAEML aroundγ = 0 and η=0, respectively, is now corrected with the combCEML.
5.2. CONNECTIONS
In [7] another two-closed form estimators are proposed to solve the complex-mixture problem from the pre-whitened signals. The first one (which is also featured in [9] and [13], and we refer to as complex CF, CCF) is simply the complex extension of the real estimator developed in [6] and analyzed in [33, Section 2]. Interestingly, the second one happens to coincide with the complex alternative estimator suggested in the previous section.1To choose between the two, the decision rule
|κ22z|
1st.
><
2nd.
|κ31z +κ13z | (34)
is utilized, the rationale behind it being the conditioning of the respective parametersρ [like that in Equation (30)] of the estimators. For consistency in the nomenclature, we refer to the combined estimator derived from this rule as combined CCF (combCCF).
Herein we endow estimator (31) with a novel reformulation in terms of 4th-order cumu- lant bicomplex centroids, coherent with the unified notation used for the other closed-form methods considered in this paper and in [33]. This allows us: (1) to easily interrelate estimator (31) to the estimators studied in Section 4 and in [33], and (2) to obtain a practical decision rule for the combined estimation strategy developed in Section 5.1 (combCEML), which, we
1 Note that [7, expression (13)] should actually read “arg{θk} = −arg{ρ} +kπ” when the second formula of ρ(Equation (15) therein) is employed.
Figure 2. Mean square ISR vs. parameterθ. 16-QAM and complex Gaussian sources,α=65◦.
anticipate, will prove more effective than the combCCF rule (34). This last claim is put to test in the next section.
6. Simulation Results
This section reports some numerical simulations that illustrate and give support to the preced- ing theoretical results. We also assess and compare the relevant methods in both noiseless and noisy environments. As in Section 5.1 in the experiments that follow, unless stated otherwise, all signals are composed of 5000 samples and MC iterations are run over 100 independent mixture realizations. At each MC iteration, identical signal realizations are fed into all meth- ods considered. Performance measure ISR2(dB)(see Section 5.1) is obtained by averaging the results over all iterations, and is the value represented at each point in the plots.
Variation with Angular Parameters
We first evaluate estimators CEML (27), CAEML (31), and CCF [7]. We are primarily con- cerned with their behaviour as a function of angular parametersθ and α. Figure 2 plots the results for varyingθ, with 16-QAM and complex Gaussian sources. It is seen how both CCF and CEML performance depends onθ, whereas CAEML performance is independent ofθ. Nevertheless, the performance of these methods is independent of parameterα, as illustrated by Figure 3.
Comparison in Noiseless Environments
In the second place, the combined estimation strategies based on decision rules (33) and (34) are assessed and compared to other methods. The other methods considered are SOK, SOK and JADE. Recall that the SOK is the closed-form solution given in [10] to the maximization of a 4th-order contrast function of [19]. It was shown in [33] that such solution corresponds to the EML when instead of the equal sign of source kurtosis the sign of source kurtosis sum is used. In this manner, the applicability of the contrast function was expanded, since the need for a prior knowledge of the signs of source kurtosis and for these signs to be equal is spared. Accordingly, SOKcorresponds to solution SOK, but using in the associated algorithm
Figure 3. Mean square ISR vs. parameterα. 16-QAM and complex Gaussian sources,θ=5◦.
the sign of source kurtosis sum as calculated from the whitened data by (29). On the other hand, the well-known JADE (joint approximate diagonalization of eigenmatrices) method is based on the optimization of another 4th-order contrast function, which can be efficiently carried out through the diagonalization of certain set of cumulant-tensor matrix slices, so- calledeigenmatrices[4].2
In this comparison experiment, different sks are simulated by means of 4-QAM signals of appropriate symbol probability (as commented in Section 5.1). In a bid to smoothen the result curves, now 200 MC iterations are run at each sks point. For SOK, positive source kurtosis is assumed. Figure 4 summarizes the results. Note, first, the close performance trends between the JADE and the combCEML. On the other hand, decision rule (34) does not seem to be effective, since the combCCF performance worsens as the kurtosis of the sources tend to be similar. Results for CEML and SOK are exactly identical besides the region near γ = 0, around which the former improves the latter. This suggests that the conclusion drawn in [33, Section 4.3] regarding the relationship between the CEML and the SOK might perhaps be extended to the complex case as well, even though the complex SOK solution makes use of a whitened-observation 4th-order cross-cumulant which is not regarded by the CEML.
However, further experiments (Figure 5) confirm that, as was to be expected from the CEML performance variation withθevidenced by Figure 2, this similarity decreases for higher values ofθ,3although the performances of both methods remain extremely near. Figure 5 also shows that the performance of the combCCF gets closer to that of the combCEML for higherθ.
2 The MATLAB function implementing this method was downloaded from http://www-sig.enst.fr/ ˜car- doso/guidesepsou.html. The pre-whitening part was skipped, since only unitary mixtures were considered in these simulations.
3 For “highθ” is understood values ofθ around(2m+1)π/4,m ∈ N, since the equivalent class of valid solutions (8) areπ2-periodic inθ.
Figure 4. Mean square ISR vs. sks. 4-QAM sources,κ40x = −1,θ=5◦,α=65◦.
Figure 5. Mean square ISR vs. sks. 4-QAM sources,κ40x = −1,θ=30◦,α=65◦.
Comparison in Noisy Environments
Finally, we assess the CEML and JADE in noisy environments. Now, three sources and sensors are employed, together with the arbitrary non-unitary mixture:
M=
1 −1 1
2 3 4
−2 1 3
, (35)
whose choice is motivated by having also been used in the real-case experiments of [32, Section 4]. Pre-whitening is carried out, for both methods, via the SVD [8, 31]. In order the extend the CEML to more than two signals, the pairwise notion originally proposed in [8] utilized. After pre-whitening, signals are processed in a two-by-two fashion, applying the suitable estimators to each signal pair in turn. The sweep over the q(q −1)/2 signal pairs is repeated around(1+ √q)times, value which coincides with that suggested for the method of [8]. The signal-to-noise ratio (SNR) is defined sensor-wise, as the power due to the
Figure 6. Mean square ISR vs. SNR, for 3 uniform-phase constant-modulus sources and 3 sensors corrupted by additive complex Gaussian noise.
Figure 7. Mean square ISR vs. SNR, for 3 sources (2 uniform-phase constant-modulus and 1 complex-Gaussian) and 3 sensors corrupted by additive complex Gaussian noise.
sources over the power due to the noise, and chosen to be the same at all sensors. Again, we process 5000-sample signal blocks over 100 MC runs at each SNR value, ranging from –30 dB to 30 dB in 1-dB steps. Figure 6 shows the performance results for independent constant- modulus (with uniformly distributed phase in[−π, π]) sources in additive complex Gaussian noise. As a first conclusion, the CEML extension to more than two signals works successfully.
On the other hand, remark the anomalous trend exhibited by JADE in this situation, whereas the CEML improves as the noise power decreases (high SNR), just as it is reasonable to expect. To obtain the results in Figure 7, one of the sources is changed to a complex Gaussian distribution. Now the performance of JADE improves over CEML in the positive SNR range.
The reason for JADE’s pathological behaviour in the previous experiment is unclear, but it seems to be observed when all sources have the aforementioned particular distribution. Similar unexpected performance was reported in [32] for the method of [8] when Toeplitz-circulant mixtures were processed under certain source and noise distribution combinations.
7. Conclusions and Outlook
In flat-fading propagation conditions, joint MUI cancellation in CDMA communication sys- tems develops into a problem of BSS in complex-valued instantaneous linear mixtures. In the present work we have defined the bicomplex numbers, forming an isomorphic set to the group of transformations relevant to the BSS problem after pre-whitening. This algebraic formalism has permitted an elegant extension to the complex-mixture case of a number of closed-form estimation ideas originally derived in real scenarios, including the concepts of scatter diagram and centroid. The extended centroids appear as specific bicomplex linear combinations of the whitened-vector cumulants which are able to retain the information contained in the unitary mixing transformation, giving rise to compact closed-form expressions for the estimation of the pertinent parameters. Consequently, a unified formulation of the real and the complex problems within the framework of their analytical solutions has been devised.
Paths for the continuation of this investigation include the asymptotic performance analysis of the complex estimators studied in this paper, which could possibly be accomplished along the lines of the asymptotic analysis [30, 33] of their real counterparts. This analysis would explain, for example, the dependency onθof the CEML method, which does not occur for its real equivalent [33]. Also, the theoretical study of the noise impact on the estimators’ perfor- mance remains an interesting open subject. Finally, only a few properties of the new class of numbers presented in this contribution have been utilized herein; however, it is envisaged that the full potential of the bicomplex formalism may be employed in further research to generate new results on BSS, or even on problems in other fields, hence opening exciting new research directions.
References
1. P.-O. Amblard, M. Gaeta and J.-L. Lacoume, “Statistics for Complex Variables and Signals – Part I:
Variables”,Signal Processing, Vol. 53, pp. 1–13, 1996.
2. J.F. Cardoso, “On the Performance of Orthogonal Source Separation Algorithms”, inProceedings EUSIPCO, Edingburgh, U.K., 1994, pp. 776–779.
3. J.F. Cardoso, “The Invariant Approach to Source Separation”, inProceedings NOLTA, 1995, pp. 55–60.
4. J.-F. Cardoso and A. Souloumiac, “Blind Beamforming for Non-Gaussian Signals”,IEE Proceedings-F, Vol. 140, No. 6, pp. 362–370, 1993.
5. R.M. Clemente and J.I. Acha, “Blind Separation of Sources Using a New Polynomial Equation”, IEE Electronics Letters, Vol. 33, No. 3, pp. 176–177, 1997.
6. P. Comon, “Separation of Stochastic Processes”, in Proceedings Workshop on Higher-Order Spectral Analysis, Vail, CO, 1989, pp. 174–179.
7. P. Comon, “Higher-Order Separation, Application to Detection and Localization”, inProceedings EUSIPCO, Barcelona, Spain, 1990, pp. 277–280.
8. P. Comon, “Independent Component Analysis, A New Concept?”,Signal Processing, Vol. 36, No. 3, pp.
287–314.
9. P. Comon and P. Chevalier, “Blind Source Separation: Models, Concepts, Algorithms and Performance”, in S.S. Haykin (ed.),Unsupervised Adaptive Filtering, Vol. I: Blind Source Separation, Wiley Series in Adaptive and Learning Systems for Communications, Signal Processing and Control, John Wiley & Sons:
New York, 2000.
10. P. Comon and E. Moreau, “Improved Contrast Dedicated to Blind Separation in Communications”, in Proceedings ICASSP, Munich, Germany, 1997, pp. 3453–3456.
11. J.A. Gallian,Contemporary Abstract Algebra, Houghton Mifflin Company: Boston, MA, fourth edition, 1998.
12. F. Harroy and J.-L. Lacoume, “Maximum Likelihood Estimators and Cramer–Rao Bounds in Source Separation”,Signal Processing, Vol. 55, pp. 167–177, 1996.
13. J.-L. Lacoume, P.-O. Amblard and P. Comon,Statistiques d’Ordre Supérieur pour le Traitement du Signal, Collection Sciences de l’Ingénieur, Masson: Paris, 1997.
14. J.-L. Lacoume, F. Glangeaud and J. Mars, “Blind Separation of Polarized Waves”, inProceedings EUSIPCO, Rhodes, Greece, 1998, pp. 1629–1632.
15. L.D. Lathauwer, D. Callaerts, B.D. Moor and J. Vandewalle, “Fetal Electrocardiogram Extraction by Source Subspace Separation”, inProceedings IEEE/ATHOS Signal Processing Conference on Higher-Order Statistics, Girona, Spain, 1995, pp. 134–138.
16. S. Makeig, A.J.Bell, T.-P. Jung and T.J. Sejnowski, “Independent Component Analysis of Electroencephalo- graphic Data”,Advances in Neural Information Processing Systems, Vol. 8, pp. 145–151, 1996.
17. A. Mansour and C. Jutten, “A Direct Solution for Blind Separation of Sources”,IEEE Transactions on Signal Processing, Vol. 44, No. 3, pp. 746–748, 1996.
18. P. McCullagh,Tensor Methods in Statistics, Monographs on Statistics and Applied Probability, Chapman and Hall: London.
19. E. Moreau, “Criteria for Complex Source Separation”, inProceedings EUSIPCO, Trieste, Italy, 1996, pp.
931–934, 1996.
20. T. Ojanperä and R. Prasad,Wideband CDMA for Third Generation Mobile Communications, Artech House:
Boston, MA, 1998.
21. J.G. Proakis,Digital Communications, McGraw-Hill: New York, third edition, 1995.
22. J.J. Rieta, V. Zarzoso, J. Millet-Roig, R. García-Civera and R. Ruiz-Granell, “Atrial Activity Extraction Based on Blind Source Separation as an Alternative to QRST Cancellation for Atrial Fibrillation Analysis”, inComputers in Cardiology, Boston, MA, pp. 67–72, 2000.
23. A. Stuart and J.K. Ord,Kendall’s Advanced Theory of Statistics, Vol. I, Edward Arnold: London, sixth edition, 1994.
24. N. Thirion, J. Mars and J.-L. Boelle, “Separation of Seismic Signals: A New Concept Based on a Blind Algorithm”, inProceedings EUSIPCO, Trieste, Italy, 1996, pp. 85–88.
25. K. Torkkola, “Blind Separation for Audio Signals – Are We There Yet?”, inProceedings 1st International Workshop on Independent Component Analysis and Signal Separation, Aussois, France, 1999, pp. 239–244.
26. M.K. Tsatsanis and D. Slock, “Multi-User Communications/Multi-User Detection”, in G. Giannakis (ed.), IEEE Signal Processing Magazine, “Highlights of Signal Processing for Communications”, pp. 38–42, 1999.
27. S. Verdú,Multiuser Detection, Cambridge University Press: Cambridge, U.K., 1998.
28. V. Zarzoso, “Closed-Form Higher-Order Estimators for Blind Separation of Independent Source Signals in Instantaneous Linear Mixtures”, Ph.D. Thesis, The University of Liverpool, U.K., 1999.
29. V. Zarzoso and A.K. Nandi, “Modelling Signals of Arbitrary Kurtosis for Testing BSS Methods”, IEE Electronics Letters, Vol. 34, No. 1, pp. 29–30, 1998. (Errata: Vol. 34, No. 7, Apr. 2, 1998, p. 703.)
30. V. Zarzoso and A.K. Nandi, “Blind Separation of Independent Sources for Virtually Any Source Probability Density Function”,IEEE Transactions on Signal Processing, Vol. 47, No. 9, pp. 2419–2432, 1999a.
31. V. Zarzoso and A.K. Nandi, “Blind Source Separation”, in A.K. Nandi (ed.),Blind Estimation Using Higher- Order Statistics, Kluwer Academic Publishers: Boston, MA, pp. 167–252, 1999b.
32. V. Zarzoso and A.K. Nandi, “Blind Source Separation Without Optimization Criteria"”, in Proceedings ICASSP, Vol. III, Phoenix, AZ, 1999c, pp. 1453–1456.
33. V. Zarzoso and A.K. Nandi, “Closed-Form Estimators for Blind Separation of Sources – Part I: Real Mixtures”,Wireless Personal Communications, Vol. 21, No. 2, pp. 5–28, 2002.
Appendix. Proof of Theorem 11
The steps of the proof are just like those in [33, Theorem 1], but now we have to take into account the complex conjugations in the definition of the cumulants [Equation (9)]. Two cases must be distinguished. Ifnis even:
κnz−r,r =
i1...in
q1i∗
1q1i2q1i∗
3. . .
n−r
. . . q2in−2q2i∗n−1q2in
r
Cumxi
1...in, (36)
whereqij =(Q)ij. Taking into account the source independence and thatq11=q22 =cosθ, q21 = −q12∗ =ej αsinθ, the above equation develops into:
κnz−r,r =(cosθ )n−rsinrθej α(rmod 2)κn0x +(−sinθ )n−rcosrθej α((r−n)mod 2)κ0nx . (37) Sincejr =jnj−(n−r)=jn(−j )n−r, expression (23) becomes:
ξ¯n(z) = n
r=0
n r
jr[(cosθ )n−rsinrθej α(rmod 2)κn0x +(−sinθ )n−rcosrθej α((n−r)mod 2)κ0nx]
= n
r=0
n r
(cosθ )n−r(jsinθ )rej α(rmod 2)κn0x +jn
n r=0
n r
(jsinθ )n−rcosrθej α((n−r)mod 2)κ0nx.
(38)
Now, the breal part of the first summatory is n
0
cosnθ− n
2
(cosθ )n−2sin2θ + · · · =cos(nθ ) , (39) whereas the bimaginary part reads
n 1
(cosθ )n−1sinθ− n
3
(cosθ )n−3sin3θ + · · ·
ej α=ej αsin(nθ ) . (40) The second sum produces exactly the same results. Since jn is real for neven and thus it commutes with bicomplex quantities, then:
ξ¯n(z)=(cos(nθ )+jej αsin(nθ ))(κn0x +jnκ0nx)=ej nθα (κn0x +jnκ0nx) . (41) The first half of the theorem is then proven, sinceξ¯n(x)=κn0x +jnκ0nx. Fornodd, however:
κnz−r,r =
i1...in
q1i∗1q1i2q1i∗3. . .
n−r
. . . q2i∗n−2q2in−1q2i∗n
r
Cumxi1...in, (42)
which turns into
κnz−r,r =(cosθ )n−rsinrθe−j α(rmod 2)κn0x +(−sinθ )n−rcosrθej α((n−r)mod 2)κ0nx . (43) So now the bimaginary part of the first sum becomes e−j αsin(nθ ), whereas the rest of the terms remain just as before. Hence
ξ¯n(z) = (cos(nθ )+jej αsin(nθ ))κn0x +jn(cos(nθ )+jej αsin(nθ ))κ0nx
= ej nθ−α κn0x +jnej nθα κ0nx .
Butjej nθα =ej nθ−α j, and thenξ¯n(z)= ej nθ−α (κn0x +jnκ0nx)fornodd. Therefore ∀n: ¯ξn(z) =
ej nθ(−1)nαξ (x).¯