User:IssaRice/Linear algebra/Determinant as signed volume of transformation

From Machinelearning
Revision as of 04:35, 28 December 2019 by IssaRice (talk | contribs) (→‎References)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Let f:RnRn be some function (not necessarily linear) and let ΩRn be some region in space. we will assume we can assign some "volume" to Ω, e.g. by cutting it up into little cubes and adding up the volumes of the cubes (the volume of a cube is just the product of its edge lengths).

since f takes this space to itself, the image of Ω under f, denoted f(Ω), is another region in space. let's assume f is nice enough that we can assign a volume to f(Ω). we can now ask, what is the volume of f(Ω)? is it related to the volume of Ω somehow? does the volume change if we translate Ω, stretch it, rotate it, etc.?

in general, i don't think we can say anything too interesting here. (consider a quadratic function, where distance to the origin changes how much the volume changes.) but we can restrict attention to the following functions: for all xRn and all ΩRn, volf(x+Ω)=volf(Ω). in other words, f alters volume "globally" in the sense that no matter where you place Ω in space, the deformed volume is the same. now write Ω as a disjoint union of tiny cubes + some even smaller almost-cubes. by the property of f, these tiny cubes and almost-cubes also get deformed in such a way that their deformed volumes are the same wherever they are. but... the different cubes are simply translations of each other! so we really just need to know how f acts on one of the cubes. then we move it around inside Ω -- each of these are just translations of the original tiny cube. by the property of f, the images of these translated cubes will have the same volume as the image of the original cube. so we can summarize f by assigning it a number that tells us by what factor it deforms a cube. let's call this the volume of f, volf. to be concrete, we can take the unit cube, {(x1,,xn):0xj1 for each j}, and say that volf is the volume of the image of it under f. (there is no risk of confusion, since we've previously only assigned volumes to regions of space; so we are just overloading the notion for a different kind of object.)

for such a function f, we can write volf(Ω)=(volf)(volΩ) for every Ω.

now let f,g:RnRn both have this property. can we assign a volume to fg? well, volf(Ω)=(volf)(volΩ) for every Ω. in particular, it's true for the region g(Ω). so we get volf(g(Ω))=(volf)(volg(Ω)). and we know volg(Ω)=(volg)(volΩ) since g also has this property. so we end up with vol(fg)(Ω)=(volf)(volg)(volΩ). so vol(fg)=(volf)(volg).

(actually, we should also check that fg has the volume-invariance-under-translations property)

now let's investigate what kinds of functions have the property for f mentioned above. one important class of functions that transform space is linear maps. let T:RnRn be a linear map (operator). if we assume volume is invariant under transformations, then for any x and Ω we have T(x+Ω)=T({x+y:yΩ})={T(x+y):yΩ}={Tx+Ty:yΩ}=Tx+T(Ω) Thus volT(x+Ω)=volT(Ω).

the interesting thing about linear maps is that we can summarize their entire behavior using an array of numbers. and volume is also a summary of the map, which must be some function of the array of numbers. so we can find a "volume formula" as a function of the numbers which make up the array.

References