Complexidade do circuito OR de um operador linear denso

Considere o seguinte modelo de circuito simples monótono: cada porta é apenas um OR binário. Qual é a complexidade de uma função $f(x)=Ax$ que $A$ é uma matriz booleana $n \times n$ com $O(n)$ 0's? Pode ser calculado por circuitos OR de tamanho linear?

Mais formalmente, $f$ é uma função de $n$ para $n$ bits. A $i$ -ésima saída de $f$ é $\bigvee_{j=1}^{n}(A_{ij} \land x_j)$ (isto é, um OR do subconjunto de bits de entrada dadas pela $i$ linha de -ésimo $A$ ).

Observe que $O(n)$ 0's dividem as linhas de $A$ em intervalos de $O(n)$ (subconjuntos que consistem em elementos consecutivos de $[n]$ ). Isso torna possível empregar estruturas de dados de consulta de intervalo conhecidas. Por exemplo, uma estrutura de dados de tabela esparsa pode ser transformada em um circuito OR do tamanho $O(n\log n)$ . O algoritmo de Yao para consultas de operadores de semigrupos de alcance pode ser transformado em um circuito quase linear (do tamanho $O(\alpha(n) \cdot n)$ onde $\alpha(n)$ é inverso de Ackermann)

Em particular, nem sei como construir um circuito de tamanho linear para um caso especial em que cada linha de $A$ contém exatamente dois zeros. Embora o caso de exatamente um zero em cada linha seja fácil. (Cada função de saída pode ser calculada por um OR de um prefixo $[1..k-1]$ e um sufixo $[k+1..n]$ , que pode ser pré-computado por $2n$ OR-gates.)

ds.algorithms circuit-complexity upper-bounds Alexander S. Kulikov
fonte

Um limite superior é conhecido: é no máximo rk (A) vezes n dividido pelo log n, em que rk (A) é o ranking OR de uma matriz booleana A (= número mínimo de sub-trechos 1 cujo OR coincide com A ) Veja o Lema 2.5 neste livro . Então, qual o tamanho (no máximo) da classificação booleana de uma matriz nxn com O (n) zeros pode ser?

Stasys

@Stasys Obrigado, Stasys! Já para a matriz com diagonal zero, o OR-rank é linear, certo?

Alexander S. Kulikov

A classificação OR da sua matriz (diagonal zero e 1s em outro lugar) é no máximo 2 \ log n: rotule linhas / colunas por cadeias binárias de comprimento \ log n e considere retângulos {(r, c): r (i) = a, c (i) = 1-a} para a = 0,1. Observe também que o lema 2.5 é um limite superior . Um limite inferior em termos de classificação OR é dado em Thm. 3.20 Além disso, o log da classificação OR é exatamente a complexidade da comunicação não determinística das matrizes.

Stasys

@Stasys oh, sim, certo!

Alexander S. Kulikov

Respostas:

Esta é uma resposta parcial (afirmativa) no caso em que temos um limite superior no número de zeros em todas as linhas ou colunas.

Um retângulo é uma matriz booleana que consiste em uma submatriz all-1 e possui zeros em outro lugar. Um OR-rank de uma matriz booleana é o menor número de retângulos, de modo que possa ser escrito como um OR (componente a componente) desses retângulos. Ou seja, cada entrada de é uma entrada em pelo menos um dos retângulos e cada entrada de é entrada de 0 em todos os retângulos. Observe que é exatamente a complexidade de comunicação não determinística da matriz $rk(A)$ $r$ $A$ $A$ $A$ $\log rk(A)$ $A$ (onde Alice obtém linhas e colunas Bob). Como OP escreveu, toda matriz $m\times n$ define um mapeamento , onde para . Ou seja, tomamos um produto de vetor de matriz sobre o semiamento booleano. $A=(a_{i,j})$ $y=Ax$ $y_i=\bigvee_{j=1}^na_{i,j}x_j$ $i=1,\ldots,m$

O seguinte lema é devido a Pudlák e Rödl; veja a Proposição 10.1 neste artigo ou o Lema 2.5 deste livro para uma construção direta.

Lema 1: Para cada matriz booleana , o mapeamento pode ser calculado por um circuito OR de profundidade ilimitada de ventilador 3, usando no máximo fios . $n\times n$ $A$ $y=Ax$ $O(rk(A)\cdot n/\log n)$

Também temos o seguinte limite superior no ranking OR de matrizes densas. O argumento é uma variação simples da usada por Alon neste artigo .

Lema 2: Se todas as colunas ou linhas de uma matriz booleana contiver no máximo zeros, então , em que é o número de s em $A$ $d$ $rk(A)=O(d\ln|A|)$ $|A|$ $1$ . $A$

Prova: Construa uma submatriz aleatória all- escolhendo cada linha independentemente com a mesma probabilidade . Seja o subconjunto aleatório obtido de linhas. Então deixe- , onde é o conjunto de todas as colunas de que não têm zeros nas linhas em . $1$ $R$ $p=1/(d+1)$ $I$ $R=I\times J$ $J$ $A$ $I$

A $1$ -entry $(i,j)$ of $A$ is covered by $R$ if $i$ was chosen in $I$ and none of (at most $d$ ) rows with a $0$ in the $j$ -th column was chosen in $I$ . Hence, the entry $(i,j)$ is covered with probability at least $p(1-p)^{d}\geq pe^{-pd-p^2d}\geq p/e$ . If we apply this procedure $r$ times to get $r$ rectangles, then the probability that $(i,j)$ is covered by none of these rectangles does not exceed $(1-p/e)^r\leq e^{-rp/e}$ . By the union bound, the probability that some $1$ -entry of $A$ remains uncovered is at most $|A|\cdot e^{-rp/e}$ , which is smaller than $1$ for $r=O(d\ln|A|)$ . $\Box$

Corollary: If every column or every row of a boolean matrix $A$ contains at most $d$ zeros, then the mapping $y=Ax$ can be computed by an unbounded fanin OR-circuit of depth-3 using $O(dn)$ wires.

I guess that a similar upper bound as in Lemma 2 should also hold when $d$ is the average number of $1$ s in a column (or in a row). It would be interesting to show this.

Remark: (added 04.01.2018) An analogue $rk(A)=O(d^2\log n)$ of Lemma 2 also holds when $d$ is the maximum average number of zeros in a submatrix of $A$ , where the average number of zeros in an $r\times s$ matrix is the total number of zeros divided by $s+r$ . This follows from Theorem 2 in N. Eaton and V. Rödl;, Graphs of small dimension, Combinatorica 16(1) (1996) 59-85. A slightly worse upper bound $rk(A)=O(d^2\ln^2 n)$ can be derived directly from Lemma 2 as follows.

Lemma 3: Let $d\geq 1$ . If every spanning subgraph of a bipartite graph $G$ has average degree $\leq d$ , then $G$ can be written as a union $G=G_1\cup G_2$ , where the maximum left degree of $G_1$ and the maximum right degree of $G_2$ are $\leq d$ .

Proof: Induction on the number $n$ of vertices. The base cases $n=1$ and $n=2$ are obvious. For the induction step, we will color the edges in blue and red so that the maximum degree in both blue and red subgraphs are $\leq d$ . Take a vertex $u$ of degree $\leq d$ ; such a vertex must exists because also the average degree of the entire graph must be $\leq d$ . If $u$ belongs to the left part, then color all edges incident to $u$ in blue, else color all these edges in red. If we remove the vertex $u$ then the average degree of the resulting graph $G$ is also at most $d$ , and we can color the edges of this graph by the induction hypothesis. $\Box$

Lemma 4: Let $d\geq 1$ . If the maximum average number of zeros in a boolean $n\times n$ matrix $A=(a_{i,j})$ is at most $d$ , then $rk(A)=O(d^2\ln^2 n)$ .

Proof: Consider the bipartite $n\times n$ graph $G$ with $(i,j)$ being an edge iff $a_{i,j}=0$ . Then the maximum average degree of $G$ is at most $d$ . By Lemma 3, we can write $G=G_1\cup G_2$ , where the maximum degree of the vertices on the left part of $G_1$ , and the maximum degree of the vertices on the right part of $G_2$ is $\leq d$ . Let $A_1$ and $A_2$ be the complements of the adjacency matrices of $G_1$ and $G_2$ . Hence, $A= A_1\land A_2$ is a componentwise AND of these matrices. The maximum number of zeros in every row of $A_1$ and in every column of $A_2$ is at most $d$ . Since $rk(A)\leq rk(A_1)\cdot rk(A_2)$ , Lemma 2 yields $rk(A)=O(d^2\ln^2 n)$ . $\Box$

N.B. The following simple example (pointed by Igor Sergeev) shows that my "guess" at the end of the answer was totally wrong: if we take $d=d(A)$ to be the average number of zeros in the entire matrix $A$ (not the maximum of averages over all submatrices), then Lemma 2 can badly fail. Let $m=\sqrt{n}$ , and put an identity $m\times m$ matrix in, say left upper corner of $A$ , and fill the remaining entries by ones. Then $d(A)\leq m^2/2n < 1$ but $rk(A)\geq m$ , which is exponentially larger than $\ln|A|$ . Note, however, that the OR complexity of this matrix is very small, is $O(n)$ . So, direct arguments (not via rank) can yield much better upper bounds on the OR complexity of dense matrices.

Stasys
fonte

Thanks a lot, Stasys! This is nice! In the meantime, Ivan Mihajlin came with another proof. I've posted it below.

Alexander S. Kulikov

(I tried to post this as a comment to Stasys' answer above, but this text is too long for a comment, so posting it as an answer.) Ivan Mihajlin (@ivmihajlin) came up with the following construction. Similarly to Stasys' proof, it works for the case when the maximum (rather than average) number of 0’s in each row is bounded.

First, consider the case when every row contains exactly two zeros. Consider the following undirected graph: the set of vertices is $[n]$ ; two nodes $i$ and $j$ are joined by an edge, if there is a row having zeros in columns $i$ and $j$ . The graph has $n$ edges and hence it contains a cut $(L,R)$ of size at least $n/2$ . This cut splits the columns of the matrix into two parts ( $L$ and $R$ ). Let now also split the rows into two parts: the top part $T$ contains all columns that have exactly one zero in both $L$ and $R$ ; the bottom part $B$ contains all the remaining rows. What is nice about the top part of the matrix ( $T \times (L \cup R)$ ) is that it can be computed by $O(n)$ gates. For the bottom part, let’s cut all-1 columns out of it and make a recursive call. The corresponding recurrence relation is $C(n) \le an + C(n/2)$ implying $C(n)=O(n)$ .

Now, generalize it to the case of at most $d$ zeros in every row. Let $C_d(n)$ be the complexity of an $n \times (\le dn)$ matrix with at most $d$ zeros per row (if there are more than $dn$ columns, then some of them are all-1). Partition the columns into two parts $L$ and $R$ such that at least $n(1-2^{-d})$ rows (call them $T$ ) satisfy the following property: if there are exactly $d$ zeroes in a row, then not all of them belong to the same part (denote the remaining rows by $B$ ). Then make three recursive calls: $T \times L$ , $T \times R$ , and $B \times (L \cup R)$ . This gives a recurrence relation $C_d(n) \le an + 2\cdot C_{d-1}(n(1-2^{-d}))+C_d(2^{-d}n)$ . This, in turn, implies that $C_d(n) \le f(d)\cdot n$ . The function $f(d)$ is exponential, but still.

Alexander S. Kulikov
fonte

A nice argument. But it seems to be tailor made for the case of d=2 zeros per row. What about d>2 zeros?

Stasys

@Stasys, it is doable if I'm not mistaken. I've updated the answer.

Alexander S. Kulikov