Bergman's diamond lemma

In mathematics, specifically the field of abstract algebra, Bergman's Diamond Lemma (after George Bergman) is a method for confirming whether a given set of monomials of an algebra forms a k {\displaystyle k} -basis. It is an extension of Gröbner bases to non-commutative rings. The proof of the lemma gives rise to an algorithm for obtaining a non-commutative Gröbner basis of the algebra from its defining relations. However, in contrast to Buchberger's algorithm, in the non-commutative case, this algorithm may not terminate.[1]

Preliminaries

Let k {\displaystyle k} be a commutative associative ring with identity element 1, usually a field. Take an arbitrary set X {\displaystyle X} of variables. In the finite case one usually has X = { x 1 , x 2 , x 3 , , x n } {\displaystyle X=\{x_{1},x_{2},x_{3},\dots ,x_{n}\}} . Then X {\displaystyle \langle X\rangle } is the free semigroup with identity 1 on X {\displaystyle X} . Finally, k X {\displaystyle k\langle X\rangle } is the free associative k {\displaystyle k} -algebra over X {\displaystyle X} .[2][3] Elements of X {\displaystyle \langle X\rangle } will be called words, since elements of X {\displaystyle X} can be seen as letters.

Monomial Ordering

The reductions below require a choice of ordering < {\displaystyle <} on the words i.e. monomials of X {\displaystyle \langle X\rangle } . This has to be a total order and satisfy the following:

  1. For all words u , u , v {\displaystyle u,u',v} and w {\displaystyle w} , we have that if w < v {\displaystyle w<v} then u w u < u v u {\displaystyle uwu'<uvu'} .[2]
  2. For each word w {\displaystyle w} , the collection { v X : v < w } {\displaystyle \{v\in \langle X\rangle :v<w\}} is finite.[1]

We call such an order admissible.[4] An important example is the degree lexicographic order, where w < v {\displaystyle w<v} if w {\displaystyle w} has smaller degree than v {\displaystyle v} ; or in the case where they have the same degree, we say w < v {\displaystyle w<v} if w {\displaystyle w} comes earlier in the lexicographic order than v {\displaystyle v} . For example the degree lexicographic order on monomials of k x , y {\displaystyle k\langle x,y\rangle } is given by first assuming x < y {\displaystyle x<y} . Then the above rule implies that the monomials are ordered in the following way:

1 < x < y < x 2 < x y < y x < y 2 < x 3 < x 2 y < {\displaystyle 1<x<y<x^{2}<xy<yx<y^{2}<x^{3}<x^{2}y<\dots }

Every element h k X {\displaystyle h\in k\langle X\rangle } has a leading word which is the largest word under the ordering < {\displaystyle <} which appears in h {\displaystyle h} with non-zero coefficient.[1] In k x , y {\displaystyle k\langle x,y\rangle } if h = x 2 + 2 x 2 y y 2 x {\displaystyle h=x^{2}+2x^{2}y-y^{2}x} , then the leading word of h {\displaystyle h} under degree lexicographic order is y 2 x {\displaystyle y^{2}x} .

Reduction

Assume we have a set { g σ } σ S k X {\displaystyle \{g_{\sigma }\}_{\sigma \in S}\subseteq k\langle X\rangle } which generates a 2-sided ideal I {\displaystyle I} of k X {\displaystyle k\langle X\rangle } . Then we may scale each g σ {\displaystyle g_{\sigma }} such that its leading word w σ {\displaystyle w_{\sigma }} has coefficient 1. Thus we can write g σ = w σ f σ {\displaystyle g_{\sigma }=w_{\sigma }-f_{\sigma }} , where f σ {\displaystyle f_{\sigma }} is a linear combination of words v {\displaystyle v} such that v < w σ {\displaystyle v<w_{\sigma }} .[1] A word w {\displaystyle w} is called reduced with respect to the relations { g σ } σ S {\displaystyle \{g_{\sigma }\}_{\sigma \in S}} if it does not contain any of the leading words w σ {\displaystyle w_{\sigma }} . Otherwise, w = u w σ v {\displaystyle w=uw_{\sigma }v} for some u , v X {\displaystyle u,v\in \langle X\rangle } and some σ S {\displaystyle \sigma \in S} . Then there is a reduction r u σ v : k X k X {\displaystyle r_{u\sigma v}:k\langle X\rangle \to k\langle X\rangle } , which is an endomorphism of k X {\displaystyle k\langle X\rangle } that fixes all elements of X {\displaystyle \langle X\rangle } apart from w = u w σ v {\displaystyle w=uw_{\sigma }v} and sends this to u f σ v {\displaystyle uf_{\sigma }v} .[2] By the choice of ordering there are only finitely many words less than any given word, hence a finite composition of reductions will send any h k X {\displaystyle h\in k\langle X\rangle } to a linear combination of reduced words.

Any element shares an equivalence class modulo I {\displaystyle I} with its reduced form. Thus the canonical images of the reduced words in k X / I {\displaystyle k\langle X\rangle /I} form a k {\displaystyle k} -spanning set.[1] The idea of non-commutative Gröbner bases is to find a set of generators g σ {\displaystyle g_{\sigma }} of the ideal I {\displaystyle I} such that the images of the corresponding reduced words in k X / I {\displaystyle k\langle X\rangle /I} are a k {\displaystyle k} -basis. Bergman's Diamond Lemma lets us verify if a set of generators g σ {\displaystyle g_{\sigma }} has this property. Moreover, in the case where it does not have this property, the proof of Bergman's Diamond Lemma leads to an algorithm for extending the set of generators to one that does.

An element h k X {\displaystyle h\in k\langle X\rangle } is called reduction-unique if given two finite compositions of reductions s 1 {\displaystyle s_{1}} and s 2 {\displaystyle s_{2}} such that the images s 1 ( h ) {\displaystyle s_{1}(h)} and s 2 ( h ) {\displaystyle s_{2}(h)} are linear combinations of reduced words, then s 1 ( h ) = s 2 ( h ) {\displaystyle s_{1}(h)=s_{2}(h)} . In other words, if we apply reductions to transform an element into a linear combination of reduced words in two different ways, we obtain the same result.[5]

The series of reductions lead to a common expression. The diamond shape gives rise to the name.

Ambiguities

When performing reductions there might not always be an obvious choice for which reduction to do. This is called an ambiguity and there are two types which may arise. Firstly, suppose we have a word w = t v u {\displaystyle w=tvu} for some non-empty words t , v , u {\displaystyle t,v,u} and assume that w σ = t v {\displaystyle w_{\sigma }=tv} and w τ = v u {\displaystyle w_{\tau }=vu} are leading words for some σ , τ S {\displaystyle \sigma ,\tau \in S} . This is called an overlap ambiguity, because there are two possible reductions, namely r 1 σ u {\displaystyle r_{1\sigma u}} and r t τ 1 {\displaystyle r_{t\tau 1}} . This ambiguity is resolvable if t r 1 σ u {\displaystyle tr_{1\sigma u}} and r t τ 1 u {\displaystyle r_{t\tau 1}u} can be reduced to a common expression using compositions of reductions.

Secondly, one leading word may be contained in another i.e. w σ = t ω τ u {\displaystyle w_{\sigma }=t\omega _{\tau }u} for some words t , u {\displaystyle t,u} and some indices σ , τ S {\displaystyle \sigma ,\tau \in S} . Then we have an inclusion ambiguity. Again, this ambiguity is resolvable if s 1 r 1 σ 1 ( w ) = s 2 r t τ u ( w ) {\displaystyle s_{1}\circ r_{1\sigma 1}(w)=s_{2}\circ r_{t\tau u}(w)} , for some compositions of reductions s 1 {\displaystyle s_{1}} and s 2 {\displaystyle s_{2}} .[1]

Statement of the Lemma

The statement of the lemma is simple but involves the terminology defined above. This lemma is applicable as long as the underlying ring is associative.[6]

Let { g σ } σ S k X {\displaystyle \{g_{\sigma }\}_{\sigma \in S}\subseteq k\langle X\rangle } generate an ideal I {\displaystyle I} of k X {\displaystyle k\langle X\rangle } , where g σ = w σ f σ {\displaystyle g_{\sigma }=w_{\sigma }-f_{\sigma }} with w σ {\displaystyle w_{\sigma }} the leading words under some fixed admissible ordering of X {\displaystyle \langle X\rangle } . Then the following are equivalent:

  1. All overlap and inclusion ambiguities among the g σ {\displaystyle g_{\sigma }} are resolvable.
  2. All elements of k X {\displaystyle k\langle X\rangle } are reduction-unique.
  3. The images of the reduced words in k X / I {\displaystyle k\langle X\rangle /I} form a k {\displaystyle k} -basis.

Here the reductions are done with respect to the fixed set of generators { g σ } σ S {\displaystyle \{g_{\sigma }\}_{\sigma \in S}} of I {\displaystyle I} . When any of the above hold we say that { g σ } σ S {\displaystyle \{g_{\sigma }\}_{\sigma \in S}} is a Gröbner basis for I {\displaystyle I} .[1] Given a set of generators, one usually checks the first or second condition to confirm that the set is a k {\displaystyle k} -basis.

Examples

Resolving ambiguities

Take A = k x , y , z / ( y x p x y , z x q x z , z y r y z ) {\displaystyle A=k\langle x,y,z\rangle /(yx-pxy,zx-qxz,zy-ryz)} , which is the quantum polynomial ring in 3 variables, and assume x < y < z {\displaystyle x<y<z} . Take < {\displaystyle <} to be degree lexicographic order, then the leading words of the defining relations are y x {\displaystyle yx} , z x {\displaystyle zx} and z y {\displaystyle zy} . There is exactly one overlap ambiguity which is z y x {\displaystyle zyx} and no inclusion ambiguities. One may resolve via y x = p x y {\displaystyle yx=pxy} or via z y = r y z {\displaystyle zy=ryz} first. The first option gives us the following chain of reductions,

z y x = p z x y = p q x z y = p q r x y z , {\displaystyle zyx=pzxy=pqxzy=pqrxyz,}

whereas the second possibility gives,

z y x = r y z x = r q y x z = r q p x y z . {\displaystyle zyx=ryzx=rqyxz=rqpxyz.}

Since p , q , r {\displaystyle p,q,r} are commutative the above are equal. Thus the ambiguity resolves and the Lemma implies that { y x p x y , z x q x z , z y r y z } {\displaystyle \{yx-pxy,zx-qxz,zy-ryz\}} is a Gröbner basis of I {\displaystyle I} .

Non-resolving ambiguities

Let A = k x , y , z / ( z 2 x y y x , z x x z , z y y z ) {\displaystyle A=k\langle x,y,z\rangle /(z^{2}-xy-yx,zx-xz,zy-yz)} . Under the same ordering as in the previous example, the leading words of the generators of the ideal are z 2 {\displaystyle z^{2}} , z x {\displaystyle zx} and z y {\displaystyle zy} . There are two overlap ambiguities, namely z 2 x {\displaystyle z^{2}x} and z 2 y {\displaystyle z^{2}y} . Let us consider z 2 x {\displaystyle z^{2}x} . If we resolve z 2 {\displaystyle z^{2}} first we get,

z 2 x = ( x y + y x ) x = x y x + y x 2 , {\displaystyle z^{2}x=(xy+yx)x=xyx+yx^{2},}

which contains no leading words and is therefore reduced. Resolving z x {\displaystyle zx} first we obtain,

z 2 x = z x z = x z 2 = x ( x y + y x ) = x 2 y + x y x . {\displaystyle z^{2}x=zxz=xz^{2}=x(xy+yx)=x^{2}y+xyx.}

Since both of the above are reduced but not equal we see that the ambiguity does not resolve. Hence { z 2 x y y x , z x x z , z y y z } {\displaystyle \{z^{2}-xy-yx,zx-xz,zy-yz\}} is not a Gröbner basis for the ideal it generates.

Algorithm

The following short algorithm follows from the proof of Bergman's Diamond Lemma. It is based on adding new relations which resolve previously unresolvable ambiguities. Suppose that w = w σ u = t w τ {\displaystyle w=w_{\sigma }u=tw_{\tau }} is an overlap ambiguity which does not resolve. Then, for some compositions of reductions s 1 {\displaystyle s_{1}} and s 2 {\displaystyle s_{2}} , we have that h 1 = s 1 r 1 σ u ( w ) {\displaystyle h_{1}=s_{1}\circ r_{1\sigma u}(w)} and h 2 = s 2 r t τ 1 ( w ) {\displaystyle h_{2}=s_{2}\circ r_{t\tau 1}(w)} are distinct linear combinations of reduced words. Therefore, we obtain a new non-zero relation h 1 h 2 I {\displaystyle h_{1}-h_{2}\in I} . The leading word of this relation is necessarily different from the leading words of existing relations. Now scale this relation by a non-zero constant such that its leading word has coefficient 1 and add it to the generating set of I {\displaystyle I} . The process is analogous for inclusion ambiguities.[1]

Now, the previously unresolvable overlap ambiguity resolves by construction of the new relation. However, new ambiguities may arise. This process may terminate after a finite number of iterations producing a Gröbner basis for the ideal or never terminate. The infinite set of relations produced in the case where the algorithm never terminates is still a Gröbner basis, but it may not be useful unless a pattern in the new relations can be found.[7]

Example

Let us continue with the example from above where A = k x , y , z / ( z 2 x y y x , z x x z , z y y z ) {\displaystyle A=k\langle x,y,z\rangle /(z^{2}-xy-yx,zx-xz,zy-yz)} . We found that the overlap ambiguity z 2 x {\displaystyle z^{2}x} does not resolve. This gives us h 1 = x y x + y x 2 {\displaystyle h_{1}=xyx+yx^{2}} and h 2 = x 2 y + x y x {\displaystyle h_{2}=x^{2}y+xyx} . The new relation is therefore h 1 h 2 = y x 2 x 2 y I {\displaystyle h_{1}-h_{2}=yx^{2}-x^{2}y\in I} whose leading word is y x 2 {\displaystyle yx^{2}} with coefficient 1. Hence we do not need to scale it and can add it to our set of relations which is now { z 2 x y y x , z x x z , z y y z , y x 2 x 2 y } {\displaystyle \{z^{2}-xy-yx,zx-xz,zy-yz,yx^{2}-x^{2}y\}} . The previous ambiguity now resolves to either h 1 {\displaystyle h_{1}} or h 2 {\displaystyle h_{2}} . Adding the new relation did not add any ambiguities so we are left with the overlap ambiguity z 2 y {\displaystyle z^{2}y} we identified above. Let us try and resolve it with the relations we currently have. Again, resolving z 2 {\displaystyle z^{2}} first we obtain,

z 2 y = ( x y + y x ) y = x y 2 + y x y . {\displaystyle z^{2}y=(xy+yx)y=xy^{2}+yxy.}

On the other hand resolving z y {\displaystyle zy} twice first and then z 2 {\displaystyle z^{2}} we find,

z 2 y = z y z = y z 2 = y ( x y + y x ) = y x y + y 2 x . {\displaystyle z^{2}y=zyz=yz^{2}=y(xy+yx)=yxy+y^{2}x.}

Thus we have h 3 = x y 2 + y x y {\displaystyle h_{3}=xy^{2}+yxy} and h 4 = y x y + y 2 x {\displaystyle h_{4}=yxy+y^{2}x} and the new relation is h 3 h 4 = x y 2 y 2 x {\displaystyle h_{3}-h_{4}=xy^{2}-y^{2}x} with leading word y 2 x {\displaystyle y^{2}x} . Since the coefficient of the leading word is -1 we scale the relation and then add y 2 x x y 2 {\displaystyle y^{2}x-xy^{2}} to the set of defining relations. Now all ambiguities resolve and Bergman's Diamond Lemma implies that

{ z 2 x y y x , z x x z , z y y z , y x 2 x 2 y , y 2 x x y 2 } {\displaystyle \{z^{2}-xy-yx,zx-xz,zy-yz,yx^{2}-x^{2}y,y^{2}x-xy^{2}\}} is a Gröbner basis for the ideal it defines.

Further generalisations

The importance of the diamond lemma can be seen by how many other mathematical structures it has been adapted for:

The lemma has been used to prove the Poincaré–Birkhoff–Witt theorem.[2]

References

  1. ^ a b c d e f g h Rogalski, D. (2014-03-12). "An introduction to Noncommutative Projective Geometry". arXiv:1403.3065 [math.RA].
  2. ^ a b c d Bergman, George (1978-02-01). "The diamond lemma for ring theory". Advances in Mathematics. 29 (2): 178–218. doi:10.1016/0001-8708(78)90010-5. ISSN 0001-8708.
  3. ^ Dotsenko, Vladimir; Tamaroff, Pedro (2020-10-28). "Tangent complexes and the Diamond Lemma". arXiv:2010.14792 [math.RA].
  4. ^ a b Lopatkin, Viktor (2021-10-12). "Garside Theory: a Composition--Diamond Lemma Point of View". arXiv:2109.07595 [math.RA].
  5. ^ Reyes, A., Suárez, H. (2016-12-01) "Bases for Quantum Algebras and skew Poincare-Birkhoff-Witt Extensions". MOMENTO No 54. ISSN 0121-4470
  6. ^ a b Hellström, L (2002-10-22) "The Diamond Lemma for Power Series Algebras". Print & Media, Umeå universitet, Umeå. ISBN 91-7305-327-9
  7. ^ Li, Huishi (2009-06-23). "Algebras Defined by Monic Gr\"obner Bases over Rings". arXiv:0906.4396 [math.RA].
  8. ^ Elias, Ben (2019-07-24). "A diamond lemma for Hecke-type algebras". arXiv:1907.10571 [math.RT].
  9. ^ Bokut, L. A.; Chen, Yuqun; Li, Yu (2011-01-07). "Gr\"obner-Shirshov bases for categories". arXiv:1101.1563 [math.RA].
  10. ^ Dotsenko, Vladimir; Khoroshkin, Anton (2010-06-01). "Gröbner bases for operads". Duke Mathematical Journal. 153 (2): 363–396. arXiv:0812.4069. doi:10.1215/00127094-2010-026. ISSN 0012-7094. S2CID 12243016.