Testwiki:Reference desk/Archives/Mathematics/2017 January 9

From testwiki
Revision as of 11:39, 13 March 2023 by imported>Legobot (Bot: Fixing lint errors, replacing obsolete HTML tags: <center> (1x))
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Error:not substituted

{| width = "100%"

|- ! colspan="3" align="center" | Mathematics desk |- ! width="20%" align="left" | < January 8 ! width="25%" align="center"|<< Dec | January | Feb >> ! width="20%" align="right" |Current desk > |}

Welcome to the Wikipedia Mathematics Reference Desk Archives
The page you are currently viewing is a transcluded archive page. While you can leave answers for any questions shown below, please ask new questions on one of the current reference desk pages.


January 9

Computing the minimal polynomial of an algebraic number

Given an algebraic number of the form n=i=0kai, with ai positive integers, is there an algorithm to compute the coefficients of its minimal polynomial? DTLHS (talk) 18:36, 9 January 2017 (UTC)

By analogy with the Swinnerton-Dyer polynomials, a not-necessarily-minimal polynomial with integer coefficients is
m=02k+11(xj=0k(1)bitj(m)aj)
(that is the product over all possible combinations of signs for the square roots). For example for k=2, it is
(xa0a1a2)(x+a0a1a2)(xa0+a1a2)(x+a0+a1a2)(xa0a1+a2)(x+a0a1+a2)(xa0+a1+a2)(x+a0+a1+a2)
which multiplied out is
x84x6a04x6a14x6a2+6x4a02+4x4a0a1+4x4a0a2+6x4a12+4x4a1a2+6x4a224x2a03+4x2a02a1+4x2a02a240x2a0a2a1+4x2a0a22+4x2a12a04x2a13+4x2a12a2+4x2a1a224x2a23+a044a03a14a03a2+6a02a12+4a02a1a2+6a02a224a0a13+4a0a12a2+4a0a1a224a0a23+a144a13a2+6a12a224a1a23+a24
Then, for a more specific example, choosing a0=3, a1=4 and a2=11, the polynomial is
x872x6+1232x46144x2+1024
This can be factored by the Berlekamp–Zassenhaus algorithm to give candidate minimal polynomials
(x48x34x2+80x32)(x4+8x34x280x32)
Testing each factor in turn by substituting x=3+2+11, the first one x48x34x2+80x32 is found to be the minimal polynomial sought. You may have hoped for an algorithm that went straight to the minimal polynomial without the factoring step, but the question did not stipulate that as a requirement. --catslash (talk) 00:20, 10 January 2017 (UTC)
The Swinnerton-Dyer polynomials mentioned above cannot be factored over the integers, but they do make the Berlekamp–Zassenhaus factoring algorithm grind very slowly in the attempt. Consequently, the approach described may be inefficient. However, simply skipping alternating the signs of the radicals of those aj which happen to be squares (a1 in the above example), gives a polynomial with integer coefficients which is unlikely to have proper factors. Revisiting the example, the initial polynomial becomes
(xa0a1a2)(x+a0a1a2)(xa0a1+a2)(x+a0a1+a2)
which multiplied out is
x44x3a12x2a0+6x2a12x2a2+4xa1a04xa132+4xa2a1+a022a0a12a0a2+a122a1a2+a22
and putting a0=3, a1=4 and a2=11 gives
x48x34x2+80x32
immediately (the wanted minimal polynomial with no factoring). --catslash (talk) 01:32, 10 January 2017 (UTC)
Alternatively, you could use the Lenstra–Lenstra–Lovász lattice basis reduction algorithm (see the second paragraph of the Applications section --catslash (talk) 00:30, 10 January 2017 (UTC)
Thanks, that's very helpful. What if I just wanted the degree of the minimal polynomial and didn't care about the actual coefficients? DTLHS (talk) 01:10, 10 January 2017 (UTC)
I don't think there is a simple answer. Q[n] is a subfield of Q[√a1,...,√ak] so the degree is a power of 2 ≤ 2k+1. At first I thought the exponent would be the number of distinct primes in the factorizations of the square-free parts of the ai's. So the degrees of √2+√3 and √2+√3+√6 are both 4. But the degree of √6+√10+√15 is also 4 so the idea doesn't work all the time. It looks like if you need to find the rank r (over Z2) of the matrix formed by the exponents when you factor the ai's. Then I think the answer would be 2r. Looks messy to prove this assuming it's true, the case where the ai's are distinct primes is much easier though.--RDBury (talk) 20:41, 10 January 2017 (UTC)
Hm, that seems to work in almost every case. I found some exceptions bruteforcing random values: (6,10,15) gives a matrix of <(1,1,0), (1,0,1), (0,1,1)>, which has rank 3 when it should be 2. Similarly (6,15,40) gives a matrix of <(1,1,0), (0,1,1), (1,0,1)> which also has rank 3 when the degree of the minimal polynomial is 4. Any idea why it works in most cases but not all? DTLHS (talk) 06:02, 11 January 2017 (UTC)
The matrix <(1,1,0), (1,0,1), (0,1,1)> is rank 2 over Z2 which is what I meant. I'd be surprised if someone hasn't already determined the Galois group of Q[√a1,...,√ak] over Q given the ai are relatively prime, and it seems to me that would be very useful here. --RDBury (talk) 09:59, 14 January 2017 (UTC)

formations question(group theory)

From the article: "In mathematical group theory, a formation is a class of groups closed under taking images and such that if G/M and G/N are in the formation then so is G/M∩N...", and somewhat later in the article: "A Melnikov formation is closed under taking quotients, normal subgroups and group extensions...". My question is: Isn't being closed under taking quotients the same thing as being closed under taking images,since an image is isomorphic to a quotient under the first isomorphism theorem? Thanks144.35.45.77 (talk) 18:43, 9 January 2017 (UTC)

Seems like it to me; just different ways of phrasing the same property. --RDBury (talk) 20:48, 10 January 2017 (UTC)

I'm missing something about the proof:


We shall prove the first case, f(a)<y<f(b). The second case is similar.

Let A={x[a,b]:f(x)<y} . Then S is non-empty since aA , and A is bounded above by b . Hence, by completeness, the supremum c=supA exists. We claim that f(c)=y .

Fix some ε>0 . Since f is continuous, there is a δ>0 such that |f(x)f(c)|<ε whenever |xc|<δ . This means that

f(x)ε<f(c)<f(x)+ε

for all x(cδ,c+δ) . By the properties of the supremum, there exist a*(cδ,c] that is contained in A , so that for that a*

f(c)<f(a*)+ε<y+ε (Why is this true?)

Choose a**[c,c+δ) that will obviously not be contained in A , so we have

f(c)>f(a**)εyε (Why is this true?)

Both inequalities

yε<f(c)<y+ε

are valid for all ε>0 , from which we deduce f(c)=y as the only possible value, as stated.

I need help from the first question and on. יהודה שמחה ולדמן (talk) 21:43, 9 January 2017 (UTC)

The supremum is defined as the smallest upper bound.
c is the supremeum of A, that is, the smallest upper bound of A. This means there is no smaller upper bound for A. cδ<c so cδ cannot be an upper bound of A. Since cδ is not an upper bound of A, there must be some element a* of A such that a*>cδ. But c is an upper bound of A, so from a*A it follows that a*c. Therefore a*(cδ,c] and this also means that a*(cδ,c+δ).
In the previous step we've shown that if x(cδ,c+δ) then f(x)ε<f(c)<f(x)+ε. It follows that f(a*)ε<f(c)<f(a*)+ε and in particular f(c)<f(a*)+ε. Also a*A so by the definition of A we have f(a*)<y and therefore f(a*)+ε<y+ε. This means that f(c)<f(a*)+ε<y+ε, which concludes the proof of this step.
The second part is proven similarly.
-- Meni Rosenfeld (talk) 22:53, 9 January 2017 (UTC)
So you're saying this?

a*(cδ,c]Af(a*)ε<f(c)<f(a*)+ε<y+ε

a**[c,c+δ)Ayε<f(a**)ε<f(c)<f(a**)+ε

a*<a**f(a*)<f(a**)

f(a*)ε<yε<f(a**)ε <f(c)< f(a*)+ε<y+ε<f(a**)+ε

יהודה שמחה ולדמן (talk) 00:45, 10 January 2017 (UTC)
Well, what you've written isn't correct. It's not guaranteed that (cδ,c]A. What we do have is that a*(cδ,c] and also a*A.
Likewise, it's not sufficient that [c,c+δ)⊈A, you need that a**∉A. You do have (c,c+δ)A= (which is quite different from (c,c+δ)⊈A), from which it follows that a**∉A.
Also, from a*<a** it doesn't follow directly that f(a*)<f(a**) as you have implied, since we're not given that f is strictly increasing. It is true anyway (since a*A and a**∉A, so f(a*)<y and f(a**)y, so f(a*)<f(a**)). But you don't have to use this fact, you can simply follow the proof you've given in the question. In part one you've proven that f(c)<y+ε and in the second part you've proven that f(c)>yε, putting this together you have yε<f(c)<y+ε for every ε>0, which means that f(c)=y. -- Meni Rosenfeld (talk) 10:49, 10 January 2017 (UTC)
Template:Ping By the way, the property that the supremum exists for every bounded subset is actually the least upper bound property, which is not the same thing as the reals forming a complete metric space. I strongly suggest reading up on the former concept, which is absolutely fundamental to real analysis.
By the way, as Jenny Harrison once advised me, writing your whole comment in just symbols makes it hard to follow - write it out in words.--Jasper Deng (talk) 05:48, 10 January 2017 (UTC)
Note that according to the page you've linked, "completeness" is one of the names of the least upper bound property. It appears (I don't remember all the nuances) that this is a special case of completeness in order theory, which is distinct from (though probably related to) completeness of metric spaces. -- Meni Rosenfeld (talk) 10:53, 10 January 2017 (UTC)