What does it mean to differentiate a complex function?
If we have a complex function, f:C→C, then we can try to define a derivative by analogy with the real version: we say that f has derivative f′(z) at z if limh→0f(z+h)−f(z)h=f′(z).
The catch is that now h is a complex number. For this to make sense, it must give the same answer no matter how h→0. The usual thing is then to insist that you get the same answer if h is pure real or pure imaginary, and find some constraints that have to be satisfied: these are the Cauchy-Riemann equations.
If you're of a suspicious turn of mind, you might ask why it's enough for it to work for h pure real or pure imaginary: that makes h approach 0 along one of two rather special lines. What if h doesn't do that? Also, this argument shows that if f is complex differentiable, it must satisfy the Cauchy-Riemann equations: but it doesn't tell us that if the Cauchy-Riemann equations are satisfied, then f must actuallly be complex differentiable.
But there's another approach, which I think has a few advantages.
- It's based on a general definition of derivative, rather than guessing something that 'looks like' the real (variable, that is) derivative.
- It doesn't relay on h approaching 0 in any particular way.
- It shows that the Cauchy-Riemann equations are necessary and sufficient for complex differentiability.
- I just think it's prettier.
So, let's recall what the derivative really is: if you have a function ff:Rn→Rm, soff is a vector with components f1…fm, then ff is differentiable at xx with derivative Dff(x) (an m×n matrix) if ff(xx+hh)=ff(xx)+Dff(xx)hh+o(‖hh‖).
Then the i,j element of Dff(x) is ∂fi/∂xj.
In the case where ff:R2→R2, we can be much more explicit than that. We write ff(xx)=(u(x,y)v(x,y)) where xx=(xy) and then Dff=(∂u∂x∂u∂y∂v∂x∂v∂y) has the property that (u(x+hx,y+hy)v(x+hx,y+hy))=(u(x,y)v(x,y))+(∂u∂x∂u∂y∂v∂x∂v∂y)(hxhy)+o(√h2x+h2y)=(u(x,y)v(x,y))+(∂u∂xhx+∂u∂yhy∂v∂xhx+∂v∂yhy)+o(√h2x+h2y).
But now we can define the complex number z=x+iy, and the complex function f:C→C by f(x+iy)=u(x,y)+iv(x,y). So f will be differentiable at z with derivative f′(z) if f(z+h)=f(z)+f′(z)h+o(|h|)=f(z)+f′(z)h+o(√h2x+h2y) where h=hx+ihy.
So what is this f′(z)?
Let's call it dx+idy, so that dx and dy are the real and imaginary parts of f′(z) respectively. Then we have f′(z)h=(dx+idy)(hx+ihy)=dxhx−dyhy+i(dyhx+dxhy).
So now we have two different ways of describing the same function: once as a function R2→R2, and once as a function C→C. But it's the same function, both times, so the derivative must really be the same. In other words, since the expressions must match for all possible hx and hy, ff can be thought of as a complex differentiable function if and only if ∂u∂xhx+∂u∂yhy=dxhx−dyhy∂v∂xhx+∂v∂yhy=dyhx+dxhy.
Matching these up, then, we have the conditions dx=∂u∂x=∂v∂ydy=∂v∂x=−∂u∂y.
And putting it all together, we see that the real function given by (x,y)→(u(x,y),v(x,y)) can be thought of as a complex differentiable function x+iy→u(x,y)+iv(x,y) if and only if u and v satisfy the Cauchy-Riemann equations ∂u∂x=∂v∂y∂v∂x=−∂u∂y.
I, at least, find this more satisfying.