Modulo Compressed Sensing

Compressed Sensing

Recover a high dimensional vector x with a very few non-zero values. Plethora of algorithms

Sensors and Measurements

  • Sensors have Finite Dynamic Range

  • Loss of information due to clipping. One potential approach to fix this is to wrap around the signal.

    image-20210610112718042

This is where Modulo comes in.

Why can’t we scale the signal?

There is an inherent quantization step involved during measurements. Therefore, scaling down and scaling back up will have the same resolution.

  • Quantization error is proportional to the maximum value of the input signal.

Modulo Compressed Sensing

Typically, the real world sensors have a finite dynamic range. They have a clipping effect where the measurements become saturated once the values cross the range of the sensor. High dynamic range systems are affected by high quantization noise. To counter this problem, a new approach for measurements called self-reset analog to digital converters (SR-ADCs) have been proposed.

These sensors fold the amplitudes back into the dynamic range of the ADCs using the modulo arithmetic. However, these systems encounter loss due to the modulo operation. The transfer function of the SR_ADC with parameter λ is given by

Mλ=2λ([[t2λ+12]]12)

Here, [[t]]t=t. Now, we need to consider how we shall sample the values. A sampling theory called unlimited sampling framework was developed which provides sufficient conditions on the sampling rate for guaranteeing the recovery of band-limited signals from the folded samples. SR-ADC is applied individually to multiple linear measurements of the images, and is termed as modulo compressed sensing.

Note. The measured values may lie outside the dynamic range due to sensor noise.

Let us concretely define the problem. Let xRN denote an s-sparse vector (||x||0s) with s<N2. We obtain m projections of x as follows:

zi=[[ai,x]],i=1,2,,m

Usually, mN in the compressed sensing paradigm. Stacking these projections, we get

z=[[y]]=[[Ax]]

The non-linearity introduced by the modulo operation along with the undetermined compressive measurements could lead to an indeterminate system.

P0: Any real valued vector yRm can be uniquely decomposed as y=z+v where z[0,1)m and vZm denote the fractional and integral part of y respectively. The following optimization problem is defined as P0

argminx,vx0 subject to Ax=z+v;vZm
  • Notice that v is the integral part and not z, which may be confusing.
  • The optimization equation does not oblige z to belong to [0,1)m. In fact, this condition is implicitly taken care when we impose minimality over z.
  • The w0 condition comes due to the sparsity of w.
  • Any s sparse solution to P0 such that ss is also s sparse.

Identifiability

Lemma. Any vector x satisfying x0s<N2 is a unique solution to the optimization problem P0 iff any 2s columns of matrix A are linearly independent of all vZm


Proof.

() Suppose a set of 2s columns of A, say S, are not linearly independent. That is,

Ay=wZm where yRn has support S

Consider a vector x defined from y whose support is the first s columns of S. Similarly, define x1 whose support is the remaining s columns of S. Let Ax=z+v where z(0,1)m and vZm. We have

Ax1=AyAx=w(z+v)

Therefore, x1 and x are both solutions of P0. This is a contradiction. This proves sufficiency.

() Let z(0,1)m and v,v1Zm such that Ax=z+v and Ax1=z+v1 where x0,x10s. Now, the size of the support of xx1 is at most 2s. Consequently,

A(xx1)=vv1Zm

This is a contradiction as any 2s columns of A are linearly independent of all vZm. This proves necessity.


Corollary. Any vector x satisfying x0N2 is a unique solutions to P0 iff columns of A are linearly independent. Consequently, the minimum number of measurements required for unique recovery is m = N+1 (not N, see below)

Conditions for recovery

Signals in spacial domain

To compare the modulo-CS problem to the standard CS problem, we state the two necessary conditions for modulo-CS recovery.

The following two conditions are necessary for recovering any vector x satisfying x0s as a unique solution to P0:

  • m2s+1, and
  • Any 2s columns of A are linearly independent.

m=2s is necessary and sufficient for unique sparse recovery in the standard CS setup. We now show m=2s+1 measurements is sufficient for unique recovery.

Theorem. For any N2s+1, there exists a matrix ARm×N with m=2s+1 rows such that every s-sparse xRN can be uniquely recovered from its modulo measurements z=[[Ax]]


Proof. We define the following

  1. V={v|vZm} denotes the countably infinite set of all integer vectors.
  2. G={T|T[N],|T|=2s} denotes the set of all index sets on [N] whose cardinality is 2s.

Let A be a matrix for which at least one s- sparse vector x cannot be recovered from z=[[Ax]] via (P0). Our aim is to show the set of all such matrices is of Lebesgue measure zero.

For a given uV and SG, construct B(u,S)=[uAS]. If det(B(u,S)) equals 0, then A is not linearly independent of Zm. This function is a non-zero polynomial of the entries of As, and therefore the set of matrices which satisfy this condition have Lebesgue measure zero.

Now, consider SGuV{A|det(B(u,S))=0}, This set is of Lebesgue measure zero (why?). Therefore, any matrix A chosen outside this set will ensure that any s-sparse vector x can be recovered from y=[[Ax]].


  • If the entries of A are drawn independently from any continuous distribution, A lies outside the set of Lebesgue measure 0 described in the above theorem.
  • Integer matrices cannot be used as candidate measurement matrices for modulo CS (why?)

Signals in Temporal domain

Unlimited Sampling Theorem - A band-limited signal can be recovered from modulo samples provided that a certain sampling density criterion is satisfied. This criterion must be independent of the ADC threshold.

Theorem. Let f(t) be a function with no frequencies higher than Ω(rad/s), then a sufficient condition for recovery of f(t) from its modulo samples yk=Mλ(f(t)),t=kT,kZ is,

T12Ωe

The above guarantees the recovery of the signal. Now, we discuss the conditions for unique recovery. There is a one-to-one mapping between a band-limited function and its modulo samples provided that the sampling rate is higher than the Nyquist Rate, T<π/Ω.

Theorem. Let f(t) be a finite-energy function with no frequencies higher than Ω (rad/s). Then the function f(t) is uniquely determined by its modulo samples yk=Mλ(f(tk)) taken on grid t=kTϵ,kZ where

0<Tϵ<πΩ+ϵ,ϵ>0

Convex Relaxation

Replacing the l0-norm in P0 with the l1-norm, we obtain the optimization problem:

P1:argminx,vx1 subject to Ax=z+v;vZm

Integer Range Space Property

A matrix A is said to satisfy the IRSP of order s if, for all sets S[N] with |S|s,

uS1<uSc1

holds for every uRN with Au=vZm.

Theorem. Every s-sparse x is the unique solution of (P1) iff the matrix A satisfies the IRSP of order s.


Proof.

() Consider a fixed index set S with |S|s, and suppose that every x supported on S is a unique minimizer of (P1). Then, for any u such that Au=vZm, the vector uS is the unique minimizer of (P1). But, A(uS+uSc)Zm. This means that uSc is also a solution of (P1) for [[AuS]]=y. Consequently, (we are talking about l1-norm not l0-norm)

uS1<uSc1

() Suppose IRSP holds with respect to the set S. Consider x supported on S and another vector x1 that result in the same modulo measurements with respect to A. Let u=xx1, the vector AuZm. Due to the IRSP,

uS1<uSc1

Hence,

(1)x1xxS11+xS11(2)=uS1+xS11(3)<uSc1+xS11(4)=xSc11+xS11=x11

Thus proving uniqueness.


Mixed Linear Integer Program (MILP)

The l1-norm can be re-written as a linear function using two positive vectors x+ and x, where x=x+x. This leads to the MILP formulation

minx+,x,v1T(x++x) subject to [AAI][x+xv]=z;vZm;x+,x0

This is efficiently solved using branch-and-bound algorithm.

Appendix - Lebesgue Measure

Lebesgue measure is an extension of the classical notions of length, area and volume to more complicated sets. Given an open set Sk(ak,bk) containing disjoint intervals, the Lebesgue measure is defined by

μL(S)=k(bkak)

Refer to this document for more details regarding this topic.

Let AR. A is said to be of measure zero if for any ϵ>0, there exists a sequence of open intervals (In)nN such that,

An=1In and n=1l(In)<ϵ

where, l((a,b))=ba. For example, Q is of measure zero.




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Key Works in ML Systems
  • Data Systems for Machine Learning
  • AI Agents
  • Reinforcement Learning Theory
  • Brains and AI
  • Talk to my AI