Quartic equation

Polynomial equation

In mathematics, a quartic equation is one which can be expressed as a quartic function equaling zero. The general form of a quartic equation is

Graph of a polynomial function of degree 4, with its 4 roots and 3 critical points.
a x 4 + b x 3 + c x 2 + d x + e = 0 {\displaystyle ax^{4}+bx^{3}+cx^{2}+dx+e=0\,}

where a ≠ 0.

The quartic is the highest order polynomial equation that can be solved by radicals in the general case (i.e., one in which the coefficients can take any value).

History

Lodovico Ferrari is attributed with the discovery of the solution to the quartic in 1540, but since this solution, like all algebraic solutions of the quartic, requires the solution of a cubic to be found, it could not be published immediately.[1] The solution of the quartic was published together with that of the cubic by Ferrari's mentor Gerolamo Cardano in the book Ars Magna (1545).

The proof that this was the highest order general polynomial for which such solutions could be found was first given in the Abel–Ruffini theorem in 1824, proving that all attempts at solving the higher order polynomials would be futile. The notes left by Évariste Galois before his death in a duel in 1832 later led to an elegant complete theory of the roots of polynomials, of which this theorem was one result.[2]

Solving a quartic equation, special cases

Consider a quartic equation expressed in the form a 0 x 4 + a 1 x 3 + a 2 x 2 + a 3 x + a 4 = 0 {\displaystyle a_{0}x^{4}+a_{1}x^{3}+a_{2}x^{2}+a_{3}x+a_{4}=0} :

There exists a general formula for finding the roots to quartic equations, provided the coefficient of the leading term is non-zero. However, since the general method is quite complex and susceptible to errors in execution, it is better to apply one of the special cases listed below if possible.

Degenerate case

If the constant term a4 = 0, then one of the roots is x = 0, and the other roots can be found by dividing by x, and solving the resulting cubic equation,

a 0 x 3 + a 1 x 2 + a 2 x + a 3 = 0. {\displaystyle a_{0}x^{3}+a_{1}x^{2}+a_{2}x+a_{3}=0.\,}

Evident roots: 1 and −1 and −k

Call our quartic polynomial Q(x). Since 1 raised to any power is 1,

Q ( 1 ) = a 0 + a 1 + a 2 + a 3 + a 4   . {\displaystyle Q(1)=a_{0}+a_{1}+a_{2}+a_{3}+a_{4}\ .}

Thus if   a 0 + a 1 + a 2 + a 3 + a 4 = 0   , {\displaystyle \ a_{0}+a_{1}+a_{2}+a_{3}+a_{4}=0\ ,} Q(1) = 0 and so x = 1 is a root of Q(x). It can similarly be shown that if   a 0 + a 2 + a 4 = a 1 + a 3   , {\displaystyle \ a_{0}+a_{2}+a_{4}=a_{1}+a_{3}\ ,} x = −1 is a root.

In either case the full quartic can then be divided by the factor (x − 1) or (x + 1) respectively yielding a new cubic polynomial, which can be solved to find the quartic's other roots.

If   a 1 = a 0 k   , {\displaystyle \ a_{1}=a_{0}k\ ,}   a 2 = 0   {\displaystyle \ a_{2}=0\ } and   a 4 = a 3 k   , {\displaystyle \ a_{4}=a_{3}k\ ,} then   x = k   {\displaystyle \ x=-k\ } is a root of the equation. The full quartic can then be factorized this way:

  a 0 x 4 + a 0 k x 3 + a 3 x + a 3 k = a 0 x 3 ( x + k ) + a 3 ( x + k ) = ( a 0 x 3 + a 3 ) ( x + k )   . {\displaystyle \ a_{0}x^{4}+a_{0}kx^{3}+a_{3}x+a_{3}k=a_{0}x^{3}(x+k)+a_{3}(x+k)=(a_{0}x^{3}+a_{3})(x+k)\ .}

Alternatively, if   a 1 = a 0 k   , {\displaystyle \ a_{1}=a_{0}k\ ,}   a 3 = a 2 k   , {\displaystyle \ a_{3}=a_{2}k\ ,} and   a 4 = 0   , {\displaystyle \ a_{4}=0\ ,} then x = 0 and x = −k become two known roots. Q(x) divided by x(x + k) is a quadratic polynomial.

Biquadratic equations

A quartic equation where a3 and a1 are equal to 0 takes the form

a 0 x 4 + a 2 x 2 + a 4 = 0 {\displaystyle a_{0}x^{4}+a_{2}x^{2}+a_{4}=0\,\!}

and thus is a biquadratic equation, which is easy to solve: let z = x 2 {\displaystyle z=x^{2}} , so our equation turns to

a 0 z 2 + a 2 z + a 4 = 0 {\displaystyle a_{0}z^{2}+a_{2}z+a_{4}=0\,\!}

which is a simple quadratic equation, whose solutions are easily found using the quadratic formula:

z = a 2 ± a 2 2 4 a 0 a 4 2 a 0 {\displaystyle z={\frac {-a_{2}\pm {\sqrt {a_{2}^{2}-4a_{0}a_{4}}}}{2a_{0}}}\,\!}

When we've solved it (i.e. found these two z values), we can extract x from them

x 1 = + z + {\displaystyle x_{1}=+{\sqrt {z_{+}}}\,\!}
x 2 = z + {\displaystyle x_{2}=-{\sqrt {z_{+}}}\,\!}
x 3 = + z {\displaystyle x_{3}=+{\sqrt {z_{-}}}\,\!}
x 4 = z {\displaystyle x_{4}=-{\sqrt {z_{-}}}\,\!}

If either of the z solutions were negative or complex numbers, then some of the x solutions are complex numbers.

Quasi-symmetric equations

a 0 x 4 + a 1 x 3 + a 2 x 2 + a 1 m x + a 0 m 2 = 0 {\displaystyle a_{0}x^{4}+a_{1}x^{3}+a_{2}x^{2}+a_{1}mx+a_{0}m^{2}=0\,}

Steps:

  1. Divide by x 2.
  2. Use variable change z = x + m/x.
  3. So, z 2 = x 2 + (m/x) 2 + 2m.

This leads to:

a 0 ( x 2 + m 2 / x 2 ) + a 1 ( x + m / x ) + a 2 = 0 {\displaystyle a_{0}(x^{2}+m^{2}/x^{2})+a_{1}(x+m/x)+a_{2}=0} ,
a 0 ( z 2 2 m ) + a 1 ( z ) + a 2 = 0 {\displaystyle a_{0}(z^{2}-2m)+a_{1}(z)+a_{2}=0} ,
z 2 + ( a 1 / a 0 ) z + ( a 2 / a 0 2 m ) = 0 {\displaystyle z^{2}+(a_{1}/a_{0})z+(a_{2}/a_{0}-2m)=0} (a quadratic in z = x + m/x)

Multiple roots

If the quartic has a double root, it can be found by taking the polynomial greatest common divisor with its derivative. Then they can be divided out and the resulting quadratic equation solved.

In general, there exist only four possible cases of quartic equations with multiple roots, which are listed below:[3]

  1. Multiplicity-4 (M4): when the general quartic equation can be expressed as a ( x l ) 4 = 0 {\displaystyle a(x-l)^{4}=0} , for some real number l {\displaystyle l} . This case can always be reduced to a biquadratic equation.
  2. Multiplicity-3 (M3): when the general quartic equation can be expressed as a ( x l ) 3 ( x m ) = 0 {\displaystyle a(x-l)^{3}(x-m)=0} , where l {\displaystyle l}  and m {\displaystyle m}  are a couple of two different real numbers. This is the only case that can never be reduced to a biquadratic equation.
  3. Double Multiplicity-2 (DM2): when the general quartic equation can be expressed as a ( x l ) 2 ( x m ) 2 = 0 {\displaystyle a(x-l)^{2}(x-m)^{2}=0} , where l {\displaystyle l}  and m {\displaystyle m}  are a couple of two different real numbers or a couple of non-real complex conjugate numbers. This case can also always be reduced to a biquadratic equation.
  4. Single Multiplicity-2 (SM2): when the general quartic equation can be expressed as a ( x l ) 2 ( x m ) ( x n ) = 0 {\displaystyle a(x-l)^{2}(x-m)(x-n)=0} , where l {\displaystyle l} m {\displaystyle m} , and n {\displaystyle n}  are three different real numbers or l {\displaystyle l} is a real number and m {\displaystyle m}  and n {\displaystyle n} are a couple of non-real complex conjugate numbers. This case is divided into two subcases, those that can be reduced to a biquadratic equation and those in which this is impossible.

So, if the three non-monic coefficients of the depressed quartic equation, x 4 + p x 2 + q x + r = 0 {\displaystyle x^{4}+px^{2}+qx+r=0} , in terms of the five coefficients of the general quartic equation are given as follows: p = 8 a c 3 b 2 8 a 2 {\displaystyle p={\frac {8ac-3b^{2}}{8a^{2}}}} , q = b 3 4 a b c + 8 a 2 d 8 a 3 {\displaystyle q={\frac {b^{3}-4abc+8a^{2}d}{8a^{3}}}}  and r = 16 a b 2 c 64 a 2 b d 3 b 4 + 256 a 3 e 256 a 4 {\displaystyle r={\frac {16ab^{2}c-64a^{2}bd-3b^{4}+256a^{3}e}{256a^{4}}}} , then the criteria to identify a priori each case of quartic equations with multiple roots and their respective solutions are exposed below.

  • M4. The general quartic equation corresponds to this case whenever p = q = r = 0 {\displaystyle p=q=r=0} , so the four roots of this equation are given as follows: x 1 = x 2 = x 3 = x 4 = b 4 a {\displaystyle x_{1}=x_{2}=x_{3}=x_{4}=-{\frac {b}{4a}}} .
  • M3. The general quartic equation corresponds to this case whenever p 2 = 12 r > 0 {\displaystyle p^{2}=-12r>0} and 27 q 2 = 8 p 3 > 0 {\displaystyle 27q^{2}=-8p^{3}>0} , so the four roots of this equation are given as follows: x 1 = x 2 = x 3 = p 6 b 4 a {\displaystyle x_{1}=x_{2}=x_{3}={\sqrt {-{\frac {p}{6}}}}-{\frac {b}{4a}}}   and  x 4 = 3 p 2 b 4 a {\displaystyle x_{4}=-{\sqrt {-{\frac {3p}{2}}}}-{\frac {b}{4a}}} , whether q > 0 {\displaystyle q>0} ; otherwise, x 1 = x 2 = x 3 = p 6 b 4 a {\displaystyle x_{1}=x_{2}=x_{3}=-{\sqrt {-{\frac {p}{6}}}}-{\frac {b}{4a}}}   and  x 4 = 3 p 2 b 4 a {\displaystyle x_{4}={\sqrt {-{\frac {3p}{2}}}}-{\frac {b}{4a}}} .
  • DM2. The general quartic equation corresponds to this case whenever p 2 = 4 r > 0 = q {\displaystyle p^{2}=4r>0=q} , so the four roots of this equation are given as follows: x 1 = x 3 = p 2 b 4 a {\displaystyle x_{1}=x_{3}={\sqrt {-{\frac {p}{2}}}}-{\frac {b}{4a}}}  and x 2 = x 4 = p 2 b 4 a {\displaystyle x_{2}=x_{4}=-{\sqrt {-{\frac {p}{2}}}}-{\frac {b}{4a}}} .
  • Biquadratic SM2. The general quartic equation corresponds to this subcase of the SM2 equations whenever p q = r = 0 {\displaystyle p\neq q=r=0} , so the four roots of this equation are given as follows: x 1 = x 2 = b 4 a {\displaystyle x_{1}=x_{2}=-{\frac {b}{4a}}} , x 3 = p b 4 a {\displaystyle x_{3}={\sqrt {-p}}-{\frac {b}{4a}}}  and x 4 = p b 4 a {\displaystyle x_{4}=-{\sqrt {-p}}-{\frac {b}{4a}}} .
  • Non-Biquadratic SM2. The general quartic equation corresponds to this subcase of the SM2 equations whenever ( p 2 + 12 r ) 3 = [ p ( p 2 36 r ) + 27 2 q 2 ] 2 > 0 q {\displaystyle (p^{2}+12r)^{3}=[p(p^{2}-36r)+{\frac {27}{2}}q^{2}]^{2}>0\neq {q}} , so the four roots of this equation are given by the following formula:[4] x = 1 2 [ ξ s 1 ± 2 ( s 2 ξ q s 1 ) ] b 4 a {\displaystyle x={\frac {1}{2}}\left[\xi {\sqrt {s_{1}}}\pm {\sqrt {2{\biggl (}s_{2}-{\frac {\xi q}{\sqrt {s_{1}}}}{\biggr )}}}\right]-{\frac {b}{4a}}} , where s 1 = 9 q 2 32 p r p 2 + 12 r > 0 {\displaystyle s_{1}={\frac {9q^{2}-32pr}{p^{2}+12r}}>0} , s 2 = 2 p ( p 2 4 r ) + 9 q 2 2 ( p 2 + 12 r ) 0 {\displaystyle s_{2}=-{\frac {2p(p^{2}-4r)+9q^{2}}{2(p^{2}+12r)}}\neq 0} and ξ = ± 1 {\displaystyle \xi =\pm 1} .

The general case

The quartic formula.

To begin, the quartic must first be converted to a depressed quartic.

Converting to a depressed quartic

Let

  A x 4 + B x 3 + C x 2 + D x + E = 0   {\displaystyle \ Ax^{4}+Bx^{3}+Cx^{2}+Dx+E=0\ }

 

 

 

 

(1')

be the general quartic equation which it is desired to solve. Divide both sides by A,

  x 4 + B A x 3 + C A x 2 + D A x + E A = 0   . {\displaystyle \ x^{4}+{B \over A}x^{3}+{C \over A}x^{2}+{D \over A}x+{E \over A}=0\ .}

The first step, if B is not already zero, should be to eliminate the x3 term. To do this, change variables from x to u, such that

  x = u B 4 A   . {\displaystyle \ x=u-{B \over 4A}\ .}

Then

  ( u B 4 A ) 4 + B A ( u B 4 A ) 3 + C A ( u B 4 A ) 2 + D A ( u B 4 A ) + E A = 0   . {\displaystyle \ \left(u-{B \over 4A}\right)^{4}+{B \over A}\left(u-{B \over 4A}\right)^{3}+{C \over A}\left(u-{B \over 4A}\right)^{2}+{D \over A}\left(u-{B \over 4A}\right)+{E \over A}=0\ .}

Expanding the powers of the binomials produces

  ( u 4 B A u 3 + 6 u 2 B 2 16 A 2 4 u B 3 64 A 3 + B 4 256 A 4 ) + B A ( u 3 3 u 2 B 4 A + 3 u B 2 16 A 2 B 3 64 A 3 ) + C A ( u 2 u B 2 A + B 2 16 A 2 ) + D A ( u B 4 A ) + E A = 0   . {\displaystyle \ \left(u^{4}-{B \over A}u^{3}+{6u^{2}B^{2} \over 16A^{2}}-{4uB^{3} \over 64A^{3}}+{B^{4} \over 256A^{4}}\right)+{B \over A}\left(u^{3}-{3u^{2}B \over 4A}+{3uB^{2} \over 16A^{2}}-{B^{3} \over 64A^{3}}\right)+{C \over A}\left(u^{2}-{uB \over 2A}+{B^{2} \over 16A^{2}}\right)+{D \over A}\left(u-{B \over 4A}\right)+{E \over A}=0\ .}

Collecting the same powers of u yields

  u 4 + ( 3 B 2 8 A 2 + C A ) u 2 + ( B 3 8 A 3 B C 2 A 2 + D A ) u + ( 3 B 4 256 A 4 + C B 2 16 A 3 B D 4 A 2 + E A ) = 0   . {\displaystyle \ u^{4}+\left({-3B^{2} \over 8A^{2}}+{C \over A}\right)u^{2}+\left({B^{3} \over 8A^{3}}-{BC \over 2A^{2}}+{D \over A}\right)u+\left({-3B^{4} \over 256A^{4}}+{CB^{2} \over 16A^{3}}-{BD \over 4A^{2}}+{E \over A}\right)=0\ .}

Now rename the coefficients of u. Let

a = 3 B 2 8 A 2 + C A   , b = B 3 8 A 3 B C 2 A 2 + D A   , c = 3 B 4 256 A 4 + C B 2 16 A 3 B D 4 A 2 + E A   . {\displaystyle {\begin{aligned}a&={-3B^{2} \over 8A^{2}}+{C \over A}\ ,\\b&={B^{3} \over 8A^{3}}-{BC \over 2A^{2}}+{D \over A}\ ,\\c&={-3B^{4} \over 256A^{4}}+{CB^{2} \over 16A^{3}}-{BD \over 4A^{2}}+{E \over A}\ .\end{aligned}}}

The resulting equation is

  u 4 + a u 2 + b u + c = 0   {\displaystyle \ u^{4}+au^{2}+bu+c=0\ }

 

 

 

 

(1)

which is a depressed quartic equation.

If   b = 0   {\displaystyle \ b=0\ } then we have the special case of a biquadratic equation, which is easily solved, as explained above. Note that the general solution, given below, will not work for the special case   b = 0   . {\displaystyle \ b=0\ .} The equation must be solved as a biquadratic.

In either case, once the depressed quartic is solved for u, substituting those values into

  x = u B 4 A   {\displaystyle \ x=u-{B \over 4A}\ }

produces the values for x that solve the original quartic.

Solving a depressed quartic when b ≠ 0

After converting to a depressed quartic equation

u 4 + a u 2 + b u + c = 0 {\displaystyle u^{4}+au^{2}+bu+c=0}

and excluding the special case b = 0, which is solved as a biquadratic, we assume from here on that b ≠ 0 .

We will separate the terms left and right as

u 4 = a u 2 b u c {\displaystyle u^{4}=-au^{2}-bu-c}

and add in terms to both sides which make them both into perfect squares.

Let y be any solution of this cubic equation:

2 y 3 a y 2 2 c y + ( a c 1 4 b 2 ) = ( 2 y a ) ( y 2 c ) 1 4 b 2 = 0   . {\displaystyle 2y^{3}-ay^{2}-2cy+(ac-{\tfrac {1}{4}}b^{2})=(2y-a)(y^{2}-c)-{\tfrac {1}{4}}b^{2}=0\ .}

Then (since b ≠ 0)

2 y a 0 {\displaystyle 2y-a\neq 0}

so we may divide by it, giving

y 2 c = b 2 4 ( 2 y a )   . {\displaystyle y^{2}-c={\frac {b^{2}}{4(2y-a)}}\ .}

Then

( u 2 + y ) 2 = u 4 + 2 y u 2 + y 2 = ( 2 y a ) u 2 b u + ( y 2 c ) = ( 2 y a ) u 2 b u + b 2   4 ( 2 y a )   = ( 2 y a   u b 2 2 y a   ) 2   . {\displaystyle (u^{2}+y)^{2}=u^{4}+2yu^{2}+y^{2}=(2y-a)u^{2}-bu+(y^{2}-c)=(2y-a)u^{2}-bu+{\frac {b^{2}}{\ 4(2y-a)\ }}=\left({\sqrt {2y-a\ }}\,u-{\frac {b}{2{\sqrt {2y-a\ }}}}\right)^{2}\ .}

Subtracting, we get the difference of two squares which is the product of the sum and difference of their roots

( u 2 + y ) 2 ( 2 y a   u b 2 2 y a   ) 2 = ( u 2 + y + 2 y a   u b 2 2 y a   ) ( u 2 + y 2 y a   u + b 2 2 y a   ) = 0 {\displaystyle (u^{2}+y)^{2}-\left({\sqrt {2y-a\ }}\,u-{\frac {b}{2{\sqrt {2y-a\ }}}}\right)^{2}=\left(u^{2}+y+{\sqrt {2y-a\ }}\,u-{\frac {b}{2{\sqrt {2y-a\ }}}}\right)\left(u^{2}+y-{\sqrt {2y-a\ }}\,u+{\frac {b}{2{\sqrt {2y-a\ }}}}\right)=0}

which can be solved by applying the quadratic formula to each of the two factors. So the possible values of u are:

u = 1 2 ( 2 y a   + 2 y a + 2 b 2 y a     )   , {\displaystyle u={\tfrac {1}{2}}\left(-{\sqrt {2y-a\ }}+{\sqrt {-2y-a+{\frac {2b}{\sqrt {2y-a\ }}}\ }}\right)\ ,}
u = 1 2 ( 2 y a   2 y a + 2 b 2 y a     )   , {\displaystyle u={\tfrac {1}{2}}\left(-{\sqrt {2y-a\ }}-{\sqrt {-2y-a+{\frac {2b}{\sqrt {2y-a\ }}}\ }}\right)\ ,}
u = 1 2 ( 2 y a   + 2 y a 2 b 2 y a     )   , {\displaystyle u={\tfrac {1}{2}}\left({\sqrt {2y-a\ }}+{\sqrt {-2y-a-{\frac {2b}{\sqrt {2y-a\ }}}\ }}\right)\ ,} or
u = 1 2 ( 2 y a   2 y a 2 b 2 y a     )   . {\displaystyle u={\tfrac {1}{2}}\left({\sqrt {2y-a\ }}-{\sqrt {-2y-a-{\frac {2b}{\sqrt {2y-a\ }}}\ }}\right)\ .}

Using another y from among the three roots of the cubic simply causes these same four values of u to appear in a different order. The solutions of the cubic are:

  y = a 6 + w p 3 w   {\displaystyle \ y={\frac {a}{6}}+w-{\frac {p}{3w}}\ }
  w = q 2 + q 2 4 + p 3 27     3 {\displaystyle \ w={\sqrt[{3}]{-{\frac {q}{2}}+{\sqrt {{\frac {q^{2}}{4}}+{\frac {p^{3}}{27}}\ }}\ }}}

using any one of the three possible cube roots. A wise strategy is to choose the sign of the square-root that makes the absolute value of w as large as possible.

  p = a 2 12 c   , {\displaystyle \ p=-{\frac {a^{2}}{12}}-c\ ,}
  q = a 3 108 + a c 3 b 2 8   . {\displaystyle \ q=-{\frac {a^{3}}{108}}+{\frac {ac}{3}}-{\frac {b^{2}}{8}}\ .}

Ferrari's solution

Otherwise, the depressed quartic can be solved by means of a method discovered by Lodovico Ferrari. Once the depressed quartic has been obtained, the next step is to add the valid identity

( u 2 + a ) 2 u 4 2 a u 2 = a 2 {\displaystyle \left(u^{2}+a\right)^{2}-u^{4}-2au^{2}=a^{2}}

to equation (1), yielding

( u 2 + a ) 2 + b u + c = a u 2 + a 2 . {\displaystyle \left(u^{2}+a\right)^{2}+bu+c=au^{2}+a^{2}.}

 

 

 

 

(2)

The effect has been to fold up the u4 term into a perfect square: (u2 + a)2. The second term, au2 did not disappear, but its sign has changed and it has been moved to the right side.

The next step is to insert a variable y into the perfect square on the left side of equation (2), and a corresponding 2y into the coefficient of u2 in the right side. To accomplish these insertions, the following valid formulas will be added to equation (2),

( u 2 + a + y ) 2 ( u 2 + a ) 2 = 2 y ( u 2 + a ) + y 2     = 2 y u 2 + 2 y a + y 2 , {\displaystyle {\begin{aligned}(u^{2}+a+y)^{2}-(u^{2}+a)^{2}&=2y(u^{2}+a)+y^{2}\ \ \\&=2yu^{2}+2ya+y^{2},\end{aligned}}}

and

0 = ( a + 2 y ) u 2 2 y u 2 a u 2 {\displaystyle 0=(a+2y)u^{2}-2yu^{2}-au^{2}\,}

These two formulas, added together, produce

( u 2 + a + y ) 2 ( u 2 + a ) 2 = ( a + 2 y ) u 2 a u 2 + 2 y a + y 2 ( y -insertion ) {\displaystyle \left(u^{2}+a+y\right)^{2}-\left(u^{2}+a\right)^{2}=\left(a+2y\right)u^{2}-au^{2}+2ya+y^{2}\qquad \qquad (y{\hbox{-insertion}})\,}

which added to equation (2) produces

( u 2 + a + y ) 2 + b u + c = ( a + 2 y ) u 2 + ( 2 y a + y 2 + a 2 ) . {\displaystyle \left(u^{2}+a+y\right)^{2}+bu+c=\left(a+2y\right)u^{2}+\left(2ya+y^{2}+a^{2}\right).\,}

This is equivalent to

( u 2 + a + y ) 2 = ( a + 2 y ) u 2 b u + ( y 2 + 2 y a + a 2 c ) . {\displaystyle (u^{2}+a+y)^{2}=(a+2y)u^{2}-bu+(y^{2}+2ya+a^{2}-c).}

 

 

 

 

(3)

The objective now is to choose a value for y such that the right side of equation (3) becomes a perfect square. This can be done by letting the discriminant of the quadratic function become zero. To explain this, first expand a perfect square so that it equals a quadratic function:

( s u + t ) 2 = ( s 2 ) u 2 + ( 2 s t ) u + ( t 2 ) . {\displaystyle \left(su+t\right)^{2}=\left(s^{2}\right)u^{2}+\left(2st\right)u+\left(t^{2}\right).\,}

The quadratic function on the right side has three coefficients. It can be verified that squaring the second coefficient and then subtracting four times the product of the first and third coefficients yields zero:

( 2 s t ) 2 4 ( s 2 ) ( t 2 ) = 0. {\displaystyle \left(2st\right)^{2}-4\left(s^{2}\right)\left(t^{2}\right)=0.\,}

Therefore to make the right side of equation (3) into a perfect square, the following equation must be solved:

( b ) 2 4 ( 2 y + a ) ( y 2 + 2 y a + a 2 c ) = 0. {\displaystyle (-b)^{2}-4\left(2y+a\right)\left(y^{2}+2ya+a^{2}-c\right)=0.\,}

Multiply the binomial with the polynomial,

b 2 4 ( 2 y 3 + 5 a y 2 + ( 4 a 2 2 c ) y + ( a 3 a c ) ) = 0 {\displaystyle b^{2}-4\left(2y^{3}+5ay^{2}+\left(4a^{2}-2c\right)y+\left(a^{3}-ac\right)\right)=0\,}

Divide both sides by −4, and move the −b2/4 to the right,

2 y 3 + 5 a y 2 + ( 4 a 2 2 c ) y + ( a 3 a c b 2 4 ) = 0 {\displaystyle 2y^{3}+5ay^{2}+\left(4a^{2}-2c\right)y+\left(a^{3}-ac-{\frac {b^{2}}{4}}\right)=0}

Divide both sides by 2,

y 3 + 5 2 a y 2 + ( 2 a 2 c ) y + ( a 3 2 a c 2 b 2 8 ) = 0. {\displaystyle y^{3}+{\frac {5}{2}}ay^{2}+\left(2a^{2}-c\right)y+\left({a^{3} \over 2}-{ac \over 2}-{b^{2} \over 8}\right)=0.}

 

 

 

 

(4)

This is a cubic equation in y. Solve for y using any method for solving such equations (e.g. conversion to a reduced cubic and application of Cardano's formula). Any of the three possible roots will do.

Folding the second perfect square

With the value for y so selected, it is now known that the right side of equation (3) is a perfect square of the form

( s 2 ) u 2 + ( 2 s t ) u + ( t 2 ) = ( ( s 2 ) u + ( 2 s t ) 2 s 2 ) 2 {\displaystyle \left(s^{2}\right)u^{2}+(2st)u+\left(t^{2}\right)=\left(\left({\sqrt {s^{2}}}\right)u+{(2st) \over 2{\sqrt {s^{2}}}}\right)^{2}}
(This is correct for both signs of square root, as long as the same sign is taken for both square roots. A ± is redundant, as it would be absorbed by another ± a few equations further down this page.)

so that it can be folded:

( a + 2 y ) u 2 + ( b ) u + ( y 2 + 2 y a + a 2 c ) = ( ( a + 2 y ) u + ( b ) 2 a + 2 y ) 2 . {\displaystyle (a+2y)u^{2}+(-b)u+\left(y^{2}+2ya+a^{2}-c\right)=\left(\left({\sqrt {a+2y}}\right)u+{(-b) \over 2{\sqrt {a+2y}}}\right)^{2}.}
Note: If b ≠ 0 then a + 2y ≠ 0. If b = 0 then this would be a biquadratic equation, which we solved earlier.

Therefore equation (3) becomes

( u 2 + a + y ) 2 = ( ( a + 2 y ) u b 2 a + 2 y ) 2 . {\displaystyle \left(u^{2}+a+y\right)^{2}=\left(\left({\sqrt {a+2y}}\right)u-{b \over 2{\sqrt {a+2y}}}\right)^{2}.}

 

 

 

 

(5)

Equation (5) has a pair of folded perfect squares, one on each side of the equation. The two perfect squares balance each other.

If two squares are equal, then the sides of the two squares are also equal, as shown by:

( u 2 + a + y ) = ± ( ( a + 2 y ) u b 2 a + 2 y ) . {\displaystyle \left(u^{2}+a+y\right)=\pm \left(\left({\sqrt {a+2y}}\right)u-{b \over 2{\sqrt {a+2y}}}\right).}

 

 

 

 

(5')

Collecting like powers of u produces

u 2 + ( s a + 2 y ) u + ( a + y ± s b 2 a + 2 y ) = 0. {\displaystyle u^{2}+\left(\mp _{s}{\sqrt {a+2y}}\right)u+\left(a+y\pm _{s}{b \over 2{\sqrt {a+2y}}}\right)=0.}

 

 

 

 

(6)

Note: The subscript s of ± s {\displaystyle \pm _{s}} and s {\displaystyle \mp _{s}} is to note that they are dependent.

Equation (6) is a quadratic equation for u. Its solution is

u = ± s a + 2 y ± t ( a + 2 y ) 4 ( a + y ± s b 2 a + 2 y ) 2 . {\displaystyle u={\frac {\pm _{s}{\sqrt {a+2y}}\pm _{t}{\sqrt {(a+2y)-4\left(a+y\pm _{s}{b \over 2{\sqrt {a+2y}}}\right)}}}{2}}.}

Simplifying, one gets

u = ± s a + 2 y ± t ( 3 a + 2 y ± s 2 b a + 2 y ) 2 . {\displaystyle u={\pm _{s}{\sqrt {a+2y}}\pm _{t}{\sqrt {-\left(3a+2y\pm _{s}{2b \over {\sqrt {a+2y}}}\right)}} \over 2}.}

This is the solution of the depressed quartic, therefore the solutions of the original quartic equation are

x = B 4 A + ± s a + 2 y ± t ( 3 a + 2 y ± s 2 b a + 2 y ) 2 . {\displaystyle x=-{B \over 4A}+{\pm _{s}{\sqrt {a+2y}}\pm _{t}{\sqrt {-\left(3a+2y\pm _{s}{2b \over {\sqrt {a+2y}}}\right)}} \over 2}.}

 

 

 

 

(6')

Remember: The two ± s {\displaystyle \pm _{s}} come from the same place in equation (5'), and should both have the same sign, while the sign of ± t {\displaystyle \pm _{t}} is independent.

Summary of Ferrari's method

Given the quartic equation

A x 4 + B x 3 + C x 2 + D x + E = 0 , {\displaystyle Ax^{4}+Bx^{3}+Cx^{2}+Dx+E=0,\,}

its solution can be found by means of the following calculations:

a = 3 B 2 8 A 2 + C A , {\displaystyle a=-{3B^{2} \over 8A^{2}}+{C \over A},}
b = B 3 8 A 3 B C 2 A 2 + D A , {\displaystyle b={B^{3} \over 8A^{3}}-{BC \over 2A^{2}}+{D \over A},}
c = 3 B 4 256 A 4 + C B 2 16 A 3 B D 4 A 2 + E A . {\displaystyle c=-{3B^{4} \over 256A^{4}}+{CB^{2} \over 16A^{3}}-{BD \over 4A^{2}}+{E \over A}.}

If b = 0 , {\displaystyle \,b=0,} then

x = B 4 A ± s a ± t a 2 4 c 2 (for  b = 0  only) . {\displaystyle x=-{B \over 4A}\pm _{s}{\sqrt {-a\pm _{t}{\sqrt {a^{2}-4c}} \over 2}}\qquad {\mbox{(for }}b=0{\mbox{ only)}}.}

Otherwise, continue with

P = a 2 12 c , {\displaystyle P=-{a^{2} \over 12}-c,}
Q = a 3 108 + a c 3 b 2 8 , {\displaystyle Q=-{a^{3} \over 108}+{ac \over 3}-{b^{2} \over 8},}
R = Q 2 ± Q 2 4 + P 3 27 , {\displaystyle R=-{Q \over 2}\pm {\sqrt {{Q^{2} \over 4}+{P^{3} \over 27}}},}

(either sign of the square root will do)

U = R 3 , {\displaystyle U={\sqrt[{3}]{R}},}

(there are 3 complex roots, any one of them will do)

y = 5 6 a + { U = 0 Q 3 U 0 , U P 3 U , {\displaystyle y=-{5 \over 6}a+{\begin{cases}U=0&\to -{\sqrt[{3}]{Q}}\\U\neq 0,&\to U-{P \over 3U},\end{cases}}\quad \quad \quad }
W = a + 2 y {\displaystyle W={\sqrt {a+2y}}}
x = B 4 A + ± s W ± t ( 3 a + 2 y ± s 2 b W ) 2 . {\displaystyle x=-{B \over 4A}+{\pm _{s}W\pm _{t}{\sqrt {-\left(3a+2y\pm _{s}{2b \over W}\right)}} \over 2}.}
The two ±s must have the same sign, the ±t is independent. To get all roots, compute x for ±st = +,+ and for +,−; and for −,+ and for −,−. This formula handles repeated roots without problem.

Ferrari was the first to discover one of these labyrinthine solutions[citation needed]. The equation which he solved was

x 4 + 6 x 2 60 x + 36 = 0 {\displaystyle x^{4}+6x^{2}-60x+36=0}

which was already in depressed form. It has a pair of solutions which can be found with the set of formulas shown above.

Ferrari's solution in the special case of real coefficients

If the coefficients of the quartic equation are real then the nested depressed cubic equation (5) also has real coefficients, thus it has at least one real root.

Furthermore the cubic function

C ( v ) = v 3 + P v + Q , {\displaystyle C(v)=v^{3}+Pv+Q,}

where P and Q are given by (5) has the properties that

C ( a 3 ) = b 2 8 < 0 {\displaystyle C\left({a \over 3}\right)={-b^{2} \over 8}<0} and

lim v C ( v ) = , {\displaystyle \lim _{v\to \infty }C(v)=\infty ,} where a and b are given by (1).

This means that (5) has a real root greater than a 3 {\displaystyle a \over 3} , and therefore that (4) has a real root greater than a 2 {\displaystyle -a \over 2} .

Using this root the term a + 2 y {\displaystyle {\sqrt {a+2y}}} in (8) is always real, which ensures that the two quadratic equations (8) have real coefficients.[5]

Obtaining alternative solutions the hard way

It could happen that one only obtained one solution through the formulae above, because not all four sign patterns are tried for four solutions, and the solution obtained is complex. It may also be the case that one is only looking for a real solution. Let x1 denote the complex solution. If all the original coefficients A, B, C, D and E are real—which should be the case when one desires only real solutions – then there is another complex solution x2 which is the complex conjugate of x1. If the other two roots are denoted as x3 and x4 then the quartic equation can be expressed as

( x x 1 ) ( x x 2 ) ( x x 3 ) ( x x 4 ) = 0 , {\displaystyle (x-x_{1})(x-x_{2})(x-x_{3})(x-x_{4})=0,\,}

but this quartic equation is equivalent to the product of two quadratic equations:

( x x 1 ) ( x x 2 ) = 0 {\displaystyle (x-x_{1})(x-x_{2})=0}

 

 

 

 

(9)

and

( x x 3 ) ( x x 4 ) = 0. {\displaystyle (x-x_{3})(x-x_{4})=0.}

 

 

 

 

(10)

Since

x 2 = x 1 {\displaystyle x_{2}=x_{1}^{\star }}

then

( x x 1 ) ( x x 2 ) = x 2 ( x 1 + x 1 ) x + x 1 x 1 = x 2 2 Re ( x 1 ) x + [ Re ( x 1 ) ] 2 + [ Im ( x 1 ) ] 2 . {\displaystyle {\begin{aligned}(x-x_{1})(x-x_{2})&=x^{2}-(x_{1}+x_{1}^{\star })x+x_{1}x_{1}^{\star }\\&=x^{2}-2\operatorname {Re} (x_{1})x+[\operatorname {Re} (x_{1})]^{2}+[\operatorname {Im} (x_{1})]^{2}.\end{aligned}}}

Let

a = 2 Re ( x 1 ) , {\displaystyle a=-2\operatorname {Re} (x_{1}),}
b = [ Re ( x 1 ) ] 2 + [ Im ( x 1 ) ] 2 {\displaystyle b=\left[\operatorname {Re} (x_{1})\right]^{2}+\left[\operatorname {Im} (x_{1})\right]^{2}}

so that equation (9) becomes

x 2 + a x + b = 0. {\displaystyle x^{2}+ax+b=0.}

 

 

 

 

(11)

Also let there be (unknown) variables w and v such that equation (10) becomes

x 2 + w x + v = 0. {\displaystyle x^{2}+wx+v=0.}

 

 

 

 

(12)

Multiplying equations (11) and (12) produces

x 4 + ( a + w ) x 3 + ( b + w a + v ) x 2 + ( w b + v a ) x + v b = 0. {\displaystyle x^{4}+(a+w)x^{3}+(b+wa+v)x^{2}+(wb+va)x+vb=0.}

 

 

 

 

(13)

Comparing equation (13) to the original quartic equation, it can be seen that

a + w = B A , {\displaystyle a+w={B \over A},}
b + w a + v = C A , {\displaystyle b+wa+v={C \over A},}
w b + v a = D A , {\displaystyle wb+va={D \over A},}

and

v b = E A . {\displaystyle vb={E \over A}.}

Therefore

w = B A a = B A + 2 Re ( x 1 ) , {\displaystyle w={B \over A}-a={B \over A}+2\operatorname {Re} (x_{1}),}
v = E A b = E A ( [ Re ( x 1 ) ] 2 + [ Im ( x 1 ) ] 2 ) . {\displaystyle v={E \over Ab}={\frac {E}{A\left(\left[\operatorname {Re} (x_{1})\right]^{2}+\left[\operatorname {Im} (x_{1})\right]^{2}\right)}}.}

Equation (12) can be solved for x yielding

x 3 = w + w 2 4 v 2 , {\displaystyle x_{3}={-w+{\sqrt {w^{2}-4v}} \over 2},}
x 4 = w w 2 4 v 2 . {\displaystyle x_{4}={-w-{\sqrt {w^{2}-4v}} \over 2}.}

One of these two solutions should be the desired real solution.

Alternative methods

Quick and memorable solution from first principles

Most textbook solutions of the quartic equation require a substitution that is hard to memorize. Here is an approach that makes it easy to understand. The job is done if we can factor the quartic equation into a product of two quadratics. Let

0 = x 4 + b x 3 + c x 2 + d x + e = ( x 2 + p x + q ) ( x 2 + r x + s ) = x 4 + ( p + r ) x 3 + ( q + s + p r ) x 2 + ( p s + q r ) x + q s {\displaystyle {\begin{aligned}0&=x^{4}+bx^{3}+cx^{2}+dx+e\\&=\left(x^{2}+px+q\right)\left(x^{2}+rx+s\right)\\&=x^{4}+(p+r)x^{3}+(q+s+pr)x^{2}+(ps+qr)x+qs\end{aligned}}}

By equating coefficients, this results in the following set of simultaneous equations:

b = p + r c = q + s + p r d = p s + q r e = q s {\displaystyle {\begin{aligned}b&=p+r\\c&=q+s+pr\\d&=ps+qr\\e&=qs\end{aligned}}}

This is harder to solve than it looks, but if we start again with a depressed quartic where b = 0 {\displaystyle b=0} , which can be obtained by substituting ( x b / 4 ) {\displaystyle (x-b/4)} for x {\displaystyle x} , then r = p {\displaystyle r=-p} , and:

c + p 2 = s + q d / p = s q e = s q {\displaystyle {\begin{aligned}c+p^{2}&=s+q\\d/p&=s-q\\e&=sq\end{aligned}}}

It's now easy to eliminate both s {\displaystyle s} and q {\displaystyle q} by doing the following:

( c + p 2 ) 2 ( d / p ) 2 = ( s + q ) 2 ( s q ) 2 = 4 s q = 4 e {\displaystyle {\begin{aligned}\left(c+p^{2}\right)^{2}-(d/p)^{2}&=(s+q)^{2}-(s-q)^{2}\\&=4sq\\&=4e\end{aligned}}}

If we set P = p 2 {\displaystyle P=p^{2}} , then this equation turns into the cubic equation:

P 3 + 2 c P 2 + ( c 2 4 e ) P d 2 = 0 {\displaystyle P^{3}+2cP^{2}+\left(c^{2}-4e\right)P-d^{2}=0}

which is solved elsewhere. Once you have p {\displaystyle p} , then:

r = p 2 s = c + p 2 + d / p 2 q = c + p 2 d / p {\displaystyle {\begin{aligned}r&=-p\\2s&=c+p^{2}+d/p\\2q&=c+p^{2}-d/p\end{aligned}}}

The symmetries in this solution are easy to see. There are three roots of the cubic, corresponding to the three ways that a quartic can be factored into two quadratics, and choosing positive or negative values of p {\displaystyle p} for the square root of P {\displaystyle P} merely exchanges the two quadratics with one another.

Galois theory and factorization

The symmetric group S4 on four elements has the Klein four-group as a normal subgroup. This suggests using a resolvent whose roots may be variously described as a discrete Fourier transform or a Hadamard matrix transform of the roots. Suppose ri for i from 0 to 3 are roots of

x 4 + b x 3 + c x 2 + d x + e = 0 ( 1 ) {\displaystyle x^{4}+bx^{3}+cx^{2}+dx+e=0\qquad (1)}

If we now set

s 0 = 1 2 ( r 0 + r 1 + r 2 + r 3 ) , s 1 = 1 2 ( r 0 r 1 + r 2 r 3 ) , s 2 = 1 2 ( r 0 + r 1 r 2 r 3 ) , s 3 = 1 2 ( r 0 r 1 r 2 + r 3 ) , {\displaystyle {\begin{aligned}s_{0}&={\tfrac {1}{2}}(r_{0}+r_{1}+r_{2}+r_{3}),\\s_{1}&={\tfrac {1}{2}}(r_{0}-r_{1}+r_{2}-r_{3}),\\s_{2}&={\tfrac {1}{2}}(r_{0}+r_{1}-r_{2}-r_{3}),\\s_{3}&={\tfrac {1}{2}}(r_{0}-r_{1}-r_{2}+r_{3}),\end{aligned}}}

then since the transformation is an involution, we may express the roots in terms of the four si in exactly the same way. Since we know the value s0 = −b/2, we really only need the values for s1, s2 and s3. These we may find by expanding the polynomial

( z 2 s 1 2 ) ( z 2 s 2 2 ) ( z 2 s 3 2 ) ( 2 ) {\displaystyle \left(z^{2}-s_{1}^{2}\right)\left(z^{2}-s_{2}^{2}\right)\left(z^{2}-s_{3}^{2}\right)\qquad (2)}

which if we make the simplifying assumption that b = 0, is equal to

z 6 + 2 c z 4 + ( c 2 4 e ) z 2 d 2 ( 3 ) {\displaystyle z^{6}+2cz^{4}+\left(c^{2}-4e\right)z^{2}-d^{2}\qquad (3)}

This polynomial is of degree six, but only of degree three in z2, and so the corresponding equation is solvable. By trial we can determine which three roots are the correct ones, and hence find the solutions of the quartic.

We can remove any requirement for trial by using a root of the same resolvent polynomial for factoring; if w is any root of (3), and if

F 1 = x 2 + w x + 1 2 w 2 + 1 2 c 1 2 c 2 w d 1 2 w 5 d c w 3 d + 2 e w d {\displaystyle F_{1}=x^{2}+wx+{\frac {1}{2}}w^{2}+{\frac {1}{2}}c-{\frac {1}{2}}\cdot {\frac {c^{2}w}{d}}-{\frac {1}{2}}\cdot {\frac {w^{5}}{d}}-{\frac {cw^{3}}{d}}+2{\frac {ew}{d}}}
F 2 = x 2 w x + 1 2 w 2 + 1 2 c + 1 2 w 5 d + c w 3 d 2 e w d + 1 2 c 2 w d {\displaystyle F_{2}=x^{2}-wx+{\frac {1}{2}}w^{2}+{\frac {1}{2}}c+{\frac {1}{2}}\cdot {\frac {w^{5}}{d}}+{\frac {cw^{3}}{d}}-2{\frac {ew}{d}}+{\frac {1}{2}}\cdot {\frac {c^{2}w}{d}}}

then

F 1 F 2 = x 4 + c x 2 + d x + e ( 4 ) {\displaystyle F_{1}F_{2}=x^{4}+cx^{2}+dx+e\qquad \qquad (4)}

We therefore can solve the quartic by solving for w and then solving for the roots of the two factors using the quadratic formula.

Approximate methods

The methods described above are, in principle, exact root-finding methods. It is also possible to use successive approximation methods which iteratively converge towards the roots, such as the Durand–Kerner method. Iterative methods are the only ones available for quintic and higher-order equations, beyond trivial or special cases.

See also

References

  • Ferrari's achievement
  • Quartic formula as four single equations at PlanetMath.

Notes

  1. ^ "Lodovico Ferrari".
  2. ^ Stewart, Ian, Galois Theory, Third Edition (Chapman & Hall/CRC Mathematics, 2004)
  3. ^ Chávez-Pichardo, Mauricio; Martínez-Cruz, Miguel A.; Trejo-Martínez, Alfredo; Martínez-Carbajal, Daniel; Arenas-Resendiz, Tanya (July 2022). "A Complete Review of the General Quartic Equation with Real Coefficients and Multiple Roots". Mathematics. 10 (14): 2377. doi:10.3390/math10142377. ISSN 2227-7390.
  4. ^ Chávez-Pichardo, Mauricio; Martínez-Cruz, Miguel A.; Trejo-Martínez, Alfredo; Vega-Cruz, Ana Beatriz; Arenas-Resendiz, Tanya (March 2023). "On the Practicality of the Analytical Solutions for all Third- and Fourth-Degree Algebraic Equations with Real Coefficients". Mathematics. 11 (6): 1447. doi:10.3390/math11061447. ISSN 2227-7390.
  5. ^ Carstensen, Jens, Komplekse tal, First Edition, (Systime 1981), ISBN 87-87454-71-8. (in Danish)

External links

  • Calculator for solving Quartics