![MathJax Logo](/templates/jsp/_style2/_tandf/pb2/images/math-jax.gif)
Abstract
Zero divided by zero is arguably the single most important concept underlying calculus. For functions of more than one variable, methods of proof for indeterminate limits are not as familiar as for functions of a single variable. We present a l’Hôpital’s rule that provides a way to simplify and resolve a wide variety of zero-over-zero limits in terms of quotients of their derivatives.
1 Introduction
It is interesting to reflect on how many times in mathematics the word “impossible” only means “you’re not ready to understand that yet.” At different stages of school we may be instructed that it is impossible to subtract 3 minus 5, or to take the square root of 6, or to take the square root of –1, only to find later that these were only impossible within the given framework that we knew. And even in mathematical research, a generally agreed-upon impossibility may melt away in the presence of a context-expanding new insight.
Now what about zero divided by zero? We are sternly informed in school that dividing by zero is impossible and must be shunned. Does that prohibition ever get softened?
In fact, a more accurate statement, (once we are ready to understand it) is that it is impossible to divide zero by zero out of context. All other arithmetic operations are independent of context, a rather remarkable fact. So six divided by three equals two, no matter whether you are sharing expenses with a couple of friends or making a fraction of a recipe of bread. Not so with zero divided by zero: if you take a photograph of two racecars and observe that in the picture both cars travel zero feet in zero seconds, that is no help in comparing the cars’ velocities.
But if you can find some contextual information to a zero-over-zero problem, like a sequence or a quotient of functions, then you are back in business.
In 1696 the Marquis de l’Hôpital published the first calculus text, in which was revealed the elegant and enduring rule that bears his name. Single-variable indeterminate limits were thus supplied with a go-to method of resolution.
However, methods for resolving indeterminate limits in several variables are not as universally established. A calculus textbook will usually tell how to prove certain multivariable limits do not exist, by restricting the domain to individual lines through the singular point and obtaining different limits. This is followed by a warning not to try a proof using the same method to claim that a limit does exist. A standard counterexample is as
, whose limit along each line through the origin is zero, but along the parabola
its limit is 1/2.
The student may then naturally ask what is the right way to prove existence of a multivariable limit, but may find this question addressed only partially. What about a l’Hôpital’s rule for multivariable functions? At one point the present author knew why such a rule was impossible; we will comment further on this below, where we will also acknowledge the vital contribution of an anonymous referee.
Papers by Dobrescu and Siclovan [Citation3] and Young [9, p. 71], both present a very specific version of l’Hôpital’s rule for a two-variable indeterminate limit resolvable by taking the mixed second derivative of the numerator and denominator functions.
A paper of Fine and Kass [Citation4] has a version using first-order derivatives, taking directional derivatives always in the direction toward the singular point. While this version provides an interesting perspective, it seems difficult to find examples where the rule simplifies the functions and resolves the limit.
Carter [Citation2] discusses when l’Hôpital’s rule does and does not work for complex-valued functions.
Kishka et al. [Citation5] prove that l’Hôpital’s rule works for matrix functions under certain circumstances; an example they give is that the limit of , as the n-by-n matrix X approaches the zero matrix, is the identity matrix.
There are some papers with a good treatment of the indeterminate limit of a quotient of a vector-valued function over a real-valued function, but these papers concern functions of a single variable. See the Rosenholtz paper [Citation7], as well as papers by Albrycht [Citation1], Popa [Citation6], and Waewski [Citation8] for l’Hôpital-style theorems of this type.
2 Features and difficulties
The expanded territory in the multivariable setting is both a feature and a challenge. On the one hand, it provides for a novel type of singularity not possible with single-variable functions. On the other hand, the potential difficulties will require side hypotheses as well as the extra step of working out which partial (and iterated partial) derivatives to include in the comparison.
Unlike the single-variable setting, a zero-over-zero singularity can be nonisolated, as with the function(1)
(1)
In such a case, when we take the limit we implicitly exclude the points where g = 0 from the domain.
This function has a limit of 1 as we approach the origin, as Theorem 4 will prove. The limit is suggested by the quotients
But we need to take more care than just examining the derivatives, as shown by the function(2)
(2) whose graph is shown in . Near zero, the numerators of (1) and (2) are very nearly the same, as are their denominators. The first partial derivatives might suggest that the limit of (2) is also 1, but this is not the case. In the second example, the set of points where
does not contain the set where
, so we can find points arbitrarily near the origin at which g is much closer to 0 than is f; thus, the quotient f/g is unbounded as we approach the origin.
This type of counterexample originally caused the author to believe that a l’Hôpital’s rule could not exist for multivariable functions. Not until later did the resolution of this problem present itself; we simply make the side hypothesis that the zero set of f contain that of g within a neighborhood of the singularity.
The author was also perplexed by examples with isolated singularities such as(3)
(3) see for the graph when k = 3. When k > 2, the limit at the origin is zero, and when
, the limit does not exist, as shown by approaching the origin along the curves
for different values of m. But which quotient(s) of partial derivatives would establish this result?
The author’s disbelief in the existence of a l’Hôpital’s rule for isolated singularities persisted for years after finding a result for the nonisolated case. Indeed, the first submission of the present article did not include a rule for isolated singularities. The author is most grateful for an anonymous referee’s reassuring disbelief in the nonexistence of that vital part of l’Hôpital’s rule, and his or her suggestion to look at different pathways toward the singular point. (Not lost on the author is the historical precedent for anonymity in contributing to l’Hôpital’s rules!)
The key turns out to be understanding which groups of iterated partial derivatives to divide and compare, and then finding the right curves along which to approach the singular point; see Theorem 5.
As with the nonisolated singularities, we must check one side hypothesis, as illustrated by the examplesee . This limit does not exist, as shown by examining the function restricted (in turn) to each of the lines y = x and
. The derivative quotients prescribed in our l’Hôpital’s rule below do not detect the subtlety inherent in this example. We resolve this by requiring as a side hypothesis that a Taylor polynomial of the denominator have an isolated zero at (0, 0).
3 l’Hôpital’s rule
Our l’Hôpital’s rule will have three parts, grouped into the two theorems of the present section, together covering a considerable range of zero-over-zero, indeterminate limits. The strategy of proof in all cases will be to apply the generalized mean value theorem to restrictions of the multivariate functions along certain paths, namely straight lines, paths of steepest ascent or descent of the denominator function , and paths parameterized using certain integer powers of t.
Note that the standard single-variable l’Hôpital’s rule includes cases that allow infinity to play various roles in the limits. In the present article, we omit infinite limits, leaving them for future research.
We begin by proving a result giving some basic control over the behavior of paths of steepest change, ensuring that they cannot wander too far.
Lemma 1.
Let h be a function on an open set
with
for some
and
in
except possibly at p. Then for any
there exists
such that for every
at which
is positive (respectively, negative), a path of steepest descent (respectively, ascent) of h from
cannot leave
before reaching the value
.
Proof.
We may restrict attention to sufficiently small so that the closure of the ball
is contained in
. Given such an
, let k > 0 be the minimum value of
in the annulus
. Then given any path of steepest ascent or descent of h beginning at a point closer to p than distance
, if the path could extend further from p than distance
, then
must change by at least
as the path passes through A. Thus, we need only choose
sufficiently small so that within
, and the result will follow.■
Definition 2.
Let for each i. Let
be the
-simplex in
with vertices on the axes at
. For any nonnegative integer lattice point
, let
be the differential operator that differentiates a function qi
times by each variable xi
.
For any with nonnegative integer entries we say α lies below
if α is not in S but lies on the same side of S as the origin. Let the simplicial Taylor polynomial
, centered at p, be the polynomial satisfying
for all
and
for all
.
We say a function g is dominant at p with respect to
if
the pure partial derivatives
are all nonzero,
the pure and mixed partial derivatives
are zero for all lattice points α below S, and
the simplicial Taylor polynomial
has an isolated zero at p, which will require in particular that each ki is an even integer.
We will use the notation in connection with Theorem 5 and its supporting Lemma 3. Elsewhere a simple subscript notation for partial derivatives will be more convenient.
Lemma 3.
Let be an open neighborhood of
and let
be a
function. Let
with
for each
, and let
for each i, where each mi is a free variable. Define
Then for any , the derivative satisfies
which equals
times the sum of terms of the Taylor polynomial of g corresponding to the vectors
that appear in the above sum, with each xi replaced by mi.
Proof.
We may assume that g is an analytic function, since the result only depends on the rules of differentiation, which will give the same results as long as g is sufficiently many times differentiable. For simpler notation assume p is the origin; otherwise we would subtract pi from each xi in what follows.
Let be a term of the power series for g. This term will only contribute to the jth derivative of
at t = 0 if
. Plugging in the values
, the term becomes
whose jth derivative at t = 0 is
(4)
(4)
On the other hand, if we differentiate by each xi qi
times, we get
(5)
(5)
Dividing (4) by (5) we obtain the desired expression.■
We are now ready to disprove the nonexistence of a l’Hôpital’s rule for multivariable functions.
Theorem 4
(l’Hôpital’s rule for multivariable functions, nonisolated singularities). Let f and g be functions defined in a neighborhood
of
. Suppose that within
, whenever
then
as well. Then
If any first partial derivative
is nonzero, then
If
only at
, then
if the right-hand side exists (and is a finite real number, not infinity) and is equal for all i; this last limit is taken over x such that
Proof.
First consider claim 1. Since the partial derivative is nonzero, we can restrict to a possibly smaller neighborhood of p in which the derivative is everywhere nonzero.
Let L be the line through p parallel to the xi
-axis, and let C be the level set g = 0. Again since is nonzero, L is transverse to C at p.
We need to know that lines parallel to L and sufficiently near L also cross C near p. Choose an angle θ strictly closer to than the angle between L and the normal
to C. Form a double cone K as the union of lines through p that make an angle of θ with
; see . Now the directional derivatives of g at p in the cone directions are all a positive constant for half the cone and its negative for the other half. Since g is
, in a small neighborhood B2 of p these directional derivatives have constant sign. There is a smaller neighborhood B3 of p such that all lines parallel to L that pass through B3 must pass through both halves of K within B2. These will therefore pass through points where g > 0 and points where g < 0; by the intermediate value theorem all such lines also pass, near p, through the zero set of g.
Now consider, as in , any sequence of points in B3 converging to p, with
for all j. Let
be the points guaranteed above, at which
and
is a scalar multiple of the coordinate vector ei
for each j. The points
also converge to p.
Now for each j, as in the proof of the single-variable l’Hôpital’s rule, apply the generalized mean value theorem to the quotient f/g restricted to the line L. We find that there is a point , lying in between
and
, such that
But now since the points also converge to p, the desired result follows.
Now consider claim 2 in the theorem. We will apply the generalized mean value theorem to f/g restricted to paths of quickest ascent or descent of g.
Let be the common value of the limits of
.
Given , choose
such that for all
and such that for all i satisfying
,
so that(6)
(6)
By Lemma 1 choose so that paths of steepest change of g beginning in
remain within
up until g = 0.
Take a point with
. Without loss of generality,
.
Let , be a parameterization of a path of fastest decrease of g from
. A priori it might happen that
has infinite length without
ever reaching zero. But we do know at least that g nears zero in finite time. For if
were to stay above some
, then
would have to stay outside some ball about the origin. But then by compactness and the fact that
would be bounded below by a positive quantity. This would force
to decrease to zero, a contradiction.
So goes as near to zero as we wish in finite time. Now if
did not also converge to zero, there would exist
and a sequence
with
but
for all i. Then by compactness, a subsequence of
would converge to some
, forcing
but
, a contradiction.
Now is a scalar multiple of
for each
. By the chain rule and the generalized mean value theorem, there exists
such that
But since and
can be made arbitrarily small, we can ensure that the quantity
is less than
. Then
Since the path x must stay within , by (6), the above expression is less than
, as required.■
Theorem 5
(l’Hôpital’s rule for multivariable functions, isolated singularities).
Let f and g be functions in a neighborhood
, with f and g equaling zero only at p. For each i let
be the smallest natural number such that the pure iterated partial
is nonzero. Let S be the simplex whose vertices are the points
. If g is dominant (Definition 2) then the limit of f/g exists and equals
, if
for all α lying in or below S.
Conversely, if for all α below S but there does not exist such a λ, then the limit of f/g does not exist.
Proof. A
s before, for simplicity’s sake assume that the singularity p is at the origin. Work within a closed ball inside
within which
is zero only at 0.
Assume first that such a λ does exist. We note that
Thus it will suffice to prove the theorem with a simplicial polynomial as the denominator; then λ will equal 1 when we apply the theorem to , and we can divide the limits of
and
to obtain the desired result. (The advantage is to make it easier to verify the hypothesis that the denominator have nonzero derivative when applying the Cauchy mean value theorem.)
Let L be the least common multiple of , and set
for each i. The key idea is to approach the origin along curves
(7)
(7) for
and
.
Given any starting point in
, by continuity we can choose
such that
Let ; then
, and the path (7) lies in
and passes through z when
.
Define
Apply the Cauchy mean value theorem to on the interval
. We obtain
for some . Repeat the application L – 1 more times, obtaining in the end that
for some
; the last equation follows because
is a simplicial polynomial and the denominator is a function only of x and is constant in t.
By Lemma 3, for fixed x the limit as of
equals λ; we need only be concerned about the uniformity of that limit.
Let W be the (positive) minimum value of the coefficient of tL
in , for x satisfying
. Then
Now by the (regular) mean value theorem, the numerator equals cL
times the st derivative of F evaluated at some
. So there is a constant X depending on finitely many derivatives of f, which in turn are bounded on
, such that
Since as
, the limit of this discrepancy is zero.■
4 Examples
We now give five examples whose analysis is straightforward given l’Hôpital’s rule, leaving the reader to resolve them. The sixth example is a bit fancier.
for each
.
Hint: The hypotheses of l’Hôpital’s rule are not satisfied in the last example because the points (6, 0), (2, 2), and (0, 6) are not collinear, but try imbedding the problem into three dimensions by setting z = xy and eliminating some of the occurrences of y.
Acknowledgments
The author is most grateful to the second referee for suggesting the existence of the rule for isolated singularities, and sketching an idea of how it might be done.
Additional information
Notes on contributors
Gary R. Lawlor
GARY LAWLOR is an associate professor at his alma mater, Brigham Young University, with a Ph.D. from Stanford. Previously he was an instructor and assistant professor at Princeton from 1988 to 1991. One of the principal moving forces in his research has been the insatiable hunger to know: Is there a simpler way?
He is a husband, dad, grandpa and BYU sports fan, and an avid genealogist and student of ancient and modern scripture, but has no idea how to apply l’Hôpital’s rule to any of these pursuits.
References
- Albrycht, J. (1951). L’Hôpital’s rule for vector-valued functions. Colloq. Math. 2: 176–177. DOI: 10.4064/cm-2-3-4-176-177.
- Carter, D. S. (1958). L’Hospital’s rule for complex-valued functions. Amer. Math. Monthly. 65(4): 264–266. DOI: 10.2307/2310244.
- Dobrescu, E., Siclovan, I. (1965). Considerations on functions of two variables. Analele Universitatii Timisoara Seria Stiinte Matematica-Fizica. 3: 109–121.
- Fine, A. I., Kass, S. (1966). Indeterminate forms for multi-place functions. Ann. Polon. Math. 18: 59–64. DOI: 10.4064/ap-18-1-59-64.
- Kishka, Z. M., Abul-Ez, M., Saleem, M., Abd-Elmageed, H. (2013). L’Hospital rule for matrix functions. J. Egyptian Math. Soc. 21(2): 115–118. DOI: 10.1016/j.joems.2013.01.007.
- Popa, D. (1999). On the vector form of the Lagrange formula, the Darboux property and l’Hôpital’s rule (English summary). Real Anal. Exchange. 25(2): 787–793. DOI: 10.2307/44154034.
- Rosenholtz, I. (1991). A topological mean value theorem for the plane. Amer. Math. Monthly. 98(2): 149–154. DOI: 10.2307/2323948.
- Wazewski, T. (1951). Une généralisation des théorèmes sur les accroissements finis au cas des espaces de Banach et application a la généralisation du théorème de l’Hôpital. Ann. Soc. Polon. Math. 24(2): 132–147.
- Young, W. H. (1910). On indeterminate forms. Proc. Lond. Math. Soc. 2(8): 40–76. DOI: 10.1112/plms/s2-8.1.40.