Adjoint functors (original) (raw)

Relationship between two functors abstracting many common constructions

In mathematics, specifically category theory, adjunction is a relationship that two functors may exhibit, intuitively corresponding to a weak form of equivalence between two related categories. Two functors that stand in this relationship are known as adjoint functors, one being the left adjoint and the other the right adjoint. Pairs of adjoint functors are ubiquitous in mathematics and often arise from constructions of "optimal solutions" to certain problems (i.e., constructions of objects having a certain universal property), such as the construction of a free group on a set in algebra, or the construction of the Stone–Čech compactification of a topological space in topology.

By definition, an adjunction between categories C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}} and D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}} is a pair of functors (assumed to be covariant)

F : D → C {\displaystyle F:{\mathcal {D}}\rightarrow {\mathcal {C}}} {\displaystyle F:{\mathcal {D}}\rightarrow {\mathcal {C}}} and G : C → D {\displaystyle G:{\mathcal {C}}\rightarrow {\mathcal {D}}} {\displaystyle G:{\mathcal {C}}\rightarrow {\mathcal {D}}}

and, for all objects c {\displaystyle c} {\displaystyle c} in C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}} and d {\displaystyle d} {\displaystyle d} in D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}}, a bijection between the respective morphism sets

h o m C ( F d , c ) ≅ h o m D ( d , G c ) {\displaystyle \mathrm {hom} _{\mathcal {C}}(Fd,c)\cong \mathrm {hom} _{\mathcal {D}}(d,Gc)} {\displaystyle \mathrm {hom} _{\mathcal {C}}(Fd,c)\cong \mathrm {hom} _{\mathcal {D}}(d,Gc)}

such that this family of bijections is natural in c {\displaystyle c} {\displaystyle c} and d {\displaystyle d} {\displaystyle d}. Naturality here means that there are natural isomorphisms between the pair of functors C ( F − , c ) : D → S e t op {\displaystyle {\mathcal {C}}(F-,c):{\mathcal {D}}\to \mathrm {Set^{\text{op}}} } {\displaystyle {\mathcal {C}}(F-,c):{\mathcal {D}}\to \mathrm {Set^{\text{op}}} } and D ( − , G c ) : D → S e t op {\displaystyle {\mathcal {D}}(-,Gc):{\mathcal {D}}\to \mathrm {Set^{\text{op}}} } {\displaystyle {\mathcal {D}}(-,Gc):{\mathcal {D}}\to \mathrm {Set^{\text{op}}} } for a fixed c {\displaystyle c} {\displaystyle c} in C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}}, and also the pair of functors C ( F d , − ) : C → S e t {\displaystyle {\mathcal {C}}(Fd,-):{\mathcal {C}}\to \mathrm {Set} } {\displaystyle {\mathcal {C}}(Fd,-):{\mathcal {C}}\to \mathrm {Set} } and D ( d , G − ) : C → S e t {\displaystyle {\mathcal {D}}(d,G-):{\mathcal {C}}\to \mathrm {Set} } {\displaystyle {\mathcal {D}}(d,G-):{\mathcal {C}}\to \mathrm {Set} } for a fixed d {\displaystyle d} {\displaystyle d} in D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}}.

The functor F {\displaystyle F} {\displaystyle F} is called a left adjoint functor or left adjoint to G {\displaystyle G} {\displaystyle G}, while G {\displaystyle G} {\displaystyle G} is called a right adjoint functor or right adjoint to F {\displaystyle F} {\displaystyle F}. We write F ⊣ G {\displaystyle F\dashv G} {\displaystyle F\dashv G}.

An adjunction between categories C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}} and D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}} is somewhat akin to a "weak form" of an equivalence between C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}} and D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}}, and indeed every equivalence is an adjunction. In many situations, an adjunction can be "upgraded" to an equivalence, by a suitable natural modification of the involved categories and functors.

Terminology and notation

[edit]

The terms adjoint and adjunct are both used, and are cognates: one is taken directly from Latin, the other from Latin via French. In the classic text Categories for the Working Mathematician, Mac Lane makes a distinction between the two. Given a family

φ c d : h o m C ( F d , c ) ≅ h o m D ( d , G c ) {\displaystyle \varphi _{cd}:\mathrm {hom} _{\mathcal {C}}(Fd,c)\cong \mathrm {hom} _{\mathcal {D}}(d,Gc)} {\displaystyle \varphi _{cd}:\mathrm {hom} _{\mathcal {C}}(Fd,c)\cong \mathrm {hom} _{\mathcal {D}}(d,Gc)}

of hom-set bijections, we call φ {\displaystyle \varphi } {\displaystyle \varphi } an adjunction or an adjunction between F {\displaystyle F} {\displaystyle F} and G {\displaystyle G} {\displaystyle G}. If f {\displaystyle f} {\displaystyle f} is an arrow in h o m C ( F d , c ) {\displaystyle \mathrm {hom} _{\mathcal {C}}(Fd,c)} {\displaystyle \mathrm {hom} _{\mathcal {C}}(Fd,c)}, φ f {\displaystyle \varphi f} {\displaystyle \varphi f} is the right adjunct of f {\displaystyle f} {\displaystyle f} (p. 81). The functor F {\displaystyle F} {\displaystyle F} is left adjoint to G {\displaystyle G} {\displaystyle G}, and G {\displaystyle G} {\displaystyle G} is right adjoint to F {\displaystyle F} {\displaystyle F}. (Note that G {\displaystyle G} {\displaystyle G} may have itself a right adjoint that is quite different from F {\displaystyle F} {\displaystyle F}; see below for an example.)

In general, the phrases " F {\displaystyle F} {\displaystyle F} is a left adjoint" and " F {\displaystyle F} {\displaystyle F} has a right adjoint" are equivalent. We call F {\displaystyle F} {\displaystyle F} a left adjoint because it is applied to the left argument of h o m C {\displaystyle \mathrm {hom} _{\mathcal {C}}} {\displaystyle \mathrm {hom} _{\mathcal {C}}}, and G {\displaystyle G} {\displaystyle G} a right adjoint because it is applied to the right argument of h o m D {\displaystyle \mathrm {hom} _{\mathcal {D}}} {\displaystyle \mathrm {hom} _{\mathcal {D}}}.

If F is left adjoint to G, we also write

F ⊣ G . {\displaystyle F\dashv G.} {\displaystyle F\dashv G.}

The terminology comes from the Hilbert space idea of adjoint operators T {\displaystyle T} {\displaystyle T}, U {\displaystyle U} {\displaystyle U} with ⟨ T y , x ⟩ = ⟨ y , U x ⟩ {\displaystyle \langle Ty,x\rangle =\langle y,Ux\rangle } {\displaystyle \langle Ty,x\rangle =\langle y,Ux\rangle }, which is formally similar to the above relation between hom-sets. The analogy to adjoint maps of Hilbert spaces can be made precise in certain contexts.[1]

Introduction and motivation

[edit]

The slogan is "Adjoint functors arise everywhere".

Common mathematical constructions are very often adjoint functors. Consequently, general theorems about left/right adjoint functors encode the details of many useful and otherwise non-trivial results. Such general theorems include the equivalence of the various definitions of adjoint functors, the uniqueness of a right adjoint for a given left adjoint, the fact that left/right adjoint functors respectively preserve colimits/limits (which are also found in every area of mathematics), and the general adjoint functor theorems giving conditions under which a given functor is a left/right adjoint.

Solutions to optimization problems

[edit]

In a sense, an adjoint functor is a way of giving the most efficient solution to some problem via a method that is formulaic. For example, an elementary problem in ring theory is how to turn a rng (which is like a ring that might not have a multiplicative identity) into a ring. The most efficient way is to adjoin an element '1' to the rng, adjoin all (and only) the elements that are necessary for satisfying the ring axioms (e.g. r+1 for each r in the ring), and impose no relations in the newly formed ring that are not forced by axioms. Moreover, this construction is formulaic in the sense that it works in essentially the same way for any rng.

This is rather vague, though suggestive, and can be made precise in the language of category theory: a construction is most efficient if it satisfies a universal property, and is formulaic if it defines a functor. Universal properties come in two types: initial properties and terminal properties. Since these are dual notions, it is only necessary to discuss one of them.

The idea of using an initial property is to set up the problem in terms of some auxiliary category E, so that the problem at hand corresponds to finding an initial object of E. This has an advantage that the _optimization_—the sense that the process finds the most efficient solution—means something rigorous and recognisable, rather like the attainment of a supremum. The category E is also formulaic in this construction, since it is always the category of elements of the functor to which one is constructing an adjoint.

Back to our example: take the given rng R, and make a category E whose objects are rng homomorphisms RS, with S a ring having a multiplicative identity. The morphisms in E between R → _S_1 and R → _S_2 are commutative triangles of the form (R → _S_1, R → _S_2, _S_1 → _S_2) where S1 → S2 is a ring map (which preserves the identity). (Note that this is precisely the definition of the comma category of R over the inclusion of unitary rings into rng.) The existence of a morphism between R → _S_1 and R → _S_2 implies that _S_1 is at least as efficient a solution as _S_2 to our problem: _S_2 can have more adjoined elements and/or more relations not imposed by axioms than _S_1. Therefore, the assertion that an object RR* is initial in E, that is, that there is a morphism from it to any other element of E, means that the ring R* is a most efficient solution to our problem.

The two facts that this method of turning rngs into rings is most efficient and formulaic can be expressed simultaneously by saying that it defines an adjoint functor. More explicitly: Let F denote the above process of adjoining an identity to a rng, so F(R)=R*. Let G denote the process of “forgetting″ whether a ring S has an identity and considering it simply as a rng, so essentially G(S)=S. Then F is the left adjoint functor of G.

Note however that we haven't actually constructed R* yet; it is an important and not altogether trivial algebraic fact that such a left adjoint functor RR* actually exists.

Symmetry of optimization problems

[edit]

It is also possible to start with the functor F, and pose the following (vague) question: is there a problem to which F is the most efficient solution?

The notion that F is the most efficient solution to the problem posed by G is, in a certain rigorous sense, equivalent to the notion that G poses the most difficult problem that F solves.

This gives the intuition behind the fact that adjoint functors occur in pairs: if F is left adjoint to G, then G is right adjoint to F.

There are various equivalent definitions for adjoint functors:

The equivalency of these definitions is quite useful. Adjoint functors arise everywhere, in all areas of mathematics. Since the structure in any of these definitions gives rise to the structures in the others, switching between them makes implicit use of many details that would otherwise have to be repeated separately in every subject area.

The theory of adjoints has the terms left and right at its foundation, and there are many components that live in one of two categories C and D that are under consideration. Therefore it can be helpful to choose letters in alphabetical order according to whether they live in the "lefthand" category C or the "righthand" category D, and also to write them down in this order whenever possible.

In this article for example, the letters X, F, f, ε will consistently denote things that live in the category C, the letters Y, G, g, η will consistently denote things that live in the category D, and whenever possible such things will be referred to in order from left to right (a functor F : DC can be thought of as "living" where its outputs are, in C). If the arrows for the left adjoint functor F were drawn they would be pointing to the left; if the arrows for the right adjoint functor G were drawn they would be pointing to the right.

Definition via universal morphisms

[edit]

By definition, a functor F : D → C {\displaystyle F:{\mathcal {D}}\to {\mathcal {C}}} {\displaystyle F:{\mathcal {D}}\to {\mathcal {C}}} is a left adjoint functor if for each object X {\displaystyle X} {\displaystyle X} in C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}} there exists a universal morphismfrom F {\displaystyle F} {\displaystyle F} to X {\displaystyle X} {\displaystyle X}. Spelled out, this means that for each object X {\displaystyle X} {\displaystyle X} in C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}} there exists an object G ( X ) {\displaystyle G(X)} {\displaystyle G(X)} in D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}} and a morphism ϵ X : F ( G ( X ) ) → X {\displaystyle \epsilon _{X}:F(G(X))\to X} {\displaystyle \epsilon _{X}:F(G(X))\to X} such that for every object Y {\displaystyle Y} {\displaystyle Y} in D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}} and every morphism f : F ( Y ) → X {\displaystyle f:F(Y)\to X} {\displaystyle f:F(Y)\to X} there exists a unique morphism g : Y → G ( X ) {\displaystyle g:Y\to G(X)} {\displaystyle g:Y\to G(X)} with ϵ X ∘ F ( g ) = f {\displaystyle \epsilon _{X}\circ F(g)=f} {\displaystyle \epsilon _{X}\circ F(g)=f}.

The latter equation is expressed by the following commutative diagram:

Here the counit is a universal morphism.

Here the counit is a universal morphism.

In this situation, one can show that G {\displaystyle G} {\displaystyle G} can be turned into a functor G : C → D {\displaystyle G:{\mathcal {C}}\to {\mathcal {D}}} {\displaystyle G:{\mathcal {C}}\to {\mathcal {D}}} in a unique way such that ϵ X ∘ F ( G ( f ) ) = f ∘ ϵ X ′ {\displaystyle \epsilon _{X}\circ F(G(f))=f\circ \epsilon _{X'}} {\displaystyle \epsilon _{X}\circ F(G(f))=f\circ \epsilon _{X'}}for all morphisms f : X ′ → X {\displaystyle f:X'\to X} {\displaystyle f:X'\to X} in C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}}; F {\displaystyle F} {\displaystyle F} is then called a left adjoint to G {\displaystyle G} {\displaystyle G}.

Similarly, we may define right-adjoint functors. A functor G : C → D {\displaystyle G:{\mathcal {C}}\to {\mathcal {D}}} {\displaystyle G:{\mathcal {C}}\to {\mathcal {D}}} is a right adjoint functor if for each object Y {\displaystyle Y} {\displaystyle Y} in D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}}, there exists a universal morphism from Y {\displaystyle Y} {\displaystyle Y} to G {\displaystyle G} {\displaystyle G}. Spelled out, this means that for each object Y {\displaystyle Y} {\displaystyle Y} in D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}}, there exists an object F ( Y ) {\displaystyle F(Y)} {\displaystyle F(Y)} in C {\displaystyle C} {\displaystyle C} and a morphism η Y : Y → G ( F ( Y ) ) {\displaystyle \eta _{Y}:Y\to G(F(Y))} {\displaystyle \eta _{Y}:Y\to G(F(Y))} such that for every object X {\displaystyle X} {\displaystyle X} in C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}}and every morphism g : Y → G ( X ) {\displaystyle g:Y\to G(X)} {\displaystyle g:Y\to G(X)} there exists a unique morphism f : F ( Y ) → X {\displaystyle f:F(Y)\to X} {\displaystyle f:F(Y)\to X} with G ( f ) ∘ η Y = g {\displaystyle G(f)\circ \eta _{Y}=g} {\displaystyle G(f)\circ \eta _{Y}=g}.

The existence of the unit, a universal morphism, can prove the existence of an adjunction.

The existence of the unit, a universal morphism, can prove the existence of an adjunction.

Again, this F {\displaystyle F} {\displaystyle F} can be uniquely turned into a functor F : D → C {\displaystyle F:{\mathcal {D}}\to {\mathcal {C}}} {\displaystyle F:{\mathcal {D}}\to {\mathcal {C}}} such that G ( F ( g ) ) ∘ η Y = η Y ′ ∘ g {\displaystyle G(F(g))\circ \eta _{Y}=\eta _{Y'}\circ g} {\displaystyle G(F(g))\circ \eta _{Y}=\eta _{Y'}\circ g} for g : Y → Y ′ {\displaystyle g:Y\to Y'} {\displaystyle g:Y\to Y'} a morphism in D {\displaystyle {\mathcal {D}}} {\displaystyle {\mathcal {D}}}; G {\displaystyle G} {\displaystyle G} is then called a right adjoint to F {\displaystyle F} {\displaystyle F}.

It is true, as the terminology implies, that F {\displaystyle F} {\displaystyle F} is left adjoint to G {\displaystyle G} {\displaystyle G} if and only if G {\displaystyle G} {\displaystyle G} is right adjoint to F {\displaystyle F} {\displaystyle F}.

These definitions via universal morphisms are often useful for establishing that a given functor is left or right adjoint, because they are minimalistic in their requirements. They are also intuitively meaningful in that finding a universal morphism is like solving an optimization problem.

Definition via Hom-set adjunction

[edit]

A hom-set adjunction between two categories C and D consists of two functors F : DC and G : CD and a natural isomorphism

Φ : h o m C ( F − , − ) → h o m D ( − , G − ) {\displaystyle \Phi :\mathrm {hom} _{C}(F-,-)\to \mathrm {hom} _{D}(-,G-)} {\displaystyle \Phi :\mathrm {hom} _{C}(F-,-)\to \mathrm {hom} _{D}(-,G-)}.

This specifies a family of bijections

Φ Y , X : h o m C ( F Y , X ) → h o m D ( Y , G X ) {\displaystyle \Phi _{Y,X}:\mathrm {hom} _{C}(FY,X)\to \mathrm {hom} _{D}(Y,GX)} {\displaystyle \Phi _{Y,X}:\mathrm {hom} _{C}(FY,X)\to \mathrm {hom} _{D}(Y,GX)}

for all objects X in C and Y in D.

In this situation, F is left adjoint to G and G is right adjoint to F .

This definition is a logical compromise in that it is more difficult to satisfy than the universal morphism definitions, and has fewer immediate implications than the counit–unit definition. It is useful because of its obvious symmetry, and as a stepping-stone between the other definitions.

In order to interpret Φ as a natural isomorphism, one must recognize hom_C_(F_–, –) and hom_D(–, _G_–) as functors. In fact, they are both bifunctors from _D_op × C to Set (the category of sets). For details, see the article on hom functors. Explicitly, the naturality of Φ means that for all morphisms f : XX′ in C and all morphisms g : _Y_′ → Y in D the following diagram commutes:

Naturality of Φ

Naturality of Φ

The vertical arrows in this diagram are those induced by composition. Formally, Hom(Fg, f) : HomC(FY, X) → HomC(FY′, X′) is given by hf o h o Fg for each h in HomC(FY, X). Hom(g, Gf) is similar.

Definition via counit–unit adjunction

[edit]

A counit–unit adjunction between two categories C and D consists of two functors F : DC and G : CD and two natural transformations

ε : F G → 1 C η : 1 D → G F {\displaystyle {\begin{aligned}\varepsilon &:FG\to 1_{\mathcal {C}}\\\eta &:1_{\mathcal {D}}\to GF\end{aligned}}} {\displaystyle {\begin{aligned}\varepsilon &:FG\to 1_{\mathcal {C}}\\\eta &:1_{\mathcal {D}}\to GF\end{aligned}}}

respectively called the counit and the unit of the adjunction (terminology from universal algebra), such that the compositions

F → F η F G F → ε F F {\displaystyle F{\xrightarrow {\;F\eta \;}}FGF{\xrightarrow {\;\varepsilon F\,}}F} {\displaystyle F{\xrightarrow {\;F\eta \;}}FGF{\xrightarrow {\;\varepsilon F\,}}F}

G → η G G F G → G ε G {\displaystyle G{\xrightarrow {\;\eta G\;}}GFG{\xrightarrow {\;G\varepsilon \,}}G} {\displaystyle G{\xrightarrow {\;\eta G\;}}GFG{\xrightarrow {\;G\varepsilon \,}}G}

are the identity transformations 1_F_ and 1_G_ on F and G respectively.

In this situation we say that F is left adjoint to G and G is right adjoint to F , and may indicate this relationship by writing ( ε , η ) : F ⊣ G {\displaystyle (\varepsilon ,\eta ):F\dashv G} {\displaystyle (\varepsilon ,\eta ):F\dashv G} , or simply F ⊣ G {\displaystyle F\dashv G} {\displaystyle F\dashv G} .

In equation form, the above conditions on (ε,η) are the counit–unit equations

1 F = ε F ∘ F η 1 G = G ε ∘ η G {\displaystyle {\begin{aligned}1_{F}&=\varepsilon F\circ F\eta \\1_{G}&=G\varepsilon \circ \eta G\end{aligned}}} {\displaystyle {\begin{aligned}1_{F}&=\varepsilon F\circ F\eta \\1_{G}&=G\varepsilon \circ \eta G\end{aligned}}}

which mean that for each X in C and each Y in D,

1 F Y = ε F Y ∘ F ( η Y ) 1 G X = G ( ε X ) ∘ η G X {\displaystyle {\begin{aligned}1_{FY}&=\varepsilon _{FY}\circ F(\eta _{Y})\\1_{GX}&=G(\varepsilon _{X})\circ \eta _{GX}\end{aligned}}} {\displaystyle {\begin{aligned}1_{FY}&=\varepsilon _{FY}\circ F(\eta _{Y})\\1_{GX}&=G(\varepsilon _{X})\circ \eta _{GX}\end{aligned}}}.

Note that 1 C {\displaystyle 1_{\mathcal {C}}} {\displaystyle 1_{\mathcal {C}}} denotes the identify functor on the category C {\displaystyle {\mathcal {C}}} {\displaystyle {\mathcal {C}}}, 1 F {\displaystyle 1_{F}} {\displaystyle 1_{F}} denotes the identity natural transformation from the functor F to itself, and 1 F Y {\displaystyle 1_{FY}} {\displaystyle 1_{FY}} denotes the identity morphism of the object FY.

String diagram for adjunction.

These equations are useful in reducing proofs about adjoint functors to algebraic manipulations. They are sometimes called the triangle identities, or sometimes the zig-zag equations because of the appearance of the corresponding string diagrams. A way to remember them is to first write down the nonsensical equation 1 = ε ∘ η {\displaystyle 1=\varepsilon \circ \eta } {\displaystyle 1=\varepsilon \circ \eta } and then fill in either F or G in one of the two simple ways that make the compositions defined.

Note: The use of the prefix "co" in counit here is not consistent with the terminology of limits and colimits, because a colimit satisfies an initial property whereas the counit morphisms will satisfy terminal properties, and dually. The term unit here is borrowed from the theory of monads where it looks like the insertion of the identity 1 into a monoid.

The idea of adjoint functors was introduced by Daniel Kan in 1958.[2] Like many of the concepts in category theory, it was suggested by the needs of homological algebra, which was at the time devoted to computations. Those faced with giving tidy, systematic presentations of the subject would have noticed relations such as

hom(F(X), Y) = hom(X, G(Y))

in the category of abelian groups, where F was the functor − ⊗ A {\displaystyle -\otimes A} {\displaystyle -\otimes A} (i.e. take the tensor product with A), and G was the functor hom(A,–) (this is now known as the tensor-hom adjunction). The use of the equals sign is an abuse of notation; those two groups are not really identical but there is a way of identifying them that is natural. It can be seen to be natural on the basis, firstly, that these are two alternative descriptions of the bilinear mappings from X × A to Y. That is, however, something particular to the case of tensor product. In category theory the 'naturality' of the bijection is subsumed in the concept of a natural isomorphism.

The construction of free groups is a common and illuminating example.

Let F : SetGrp be the functor assigning to each set Y the free group generated by the elements of Y, and let G : GrpSet be the forgetful functor, which assigns to each group X its underlying set. Then F is left adjoint to G:

Initial morphisms. For each set Y, the set GFY is just the underlying set of the free group FY generated by Y. Let η Y : Y → G F Y {\displaystyle \eta _{Y}:Y\to GFY} {\displaystyle \eta _{Y}:Y\to GFY} be the set map given by "inclusion of generators". This is an initial morphism from Y to G, because any set map from Y to the underlying set GW of some group W will factor through η Y : Y → G F Y {\displaystyle \eta _{Y}:Y\to GFY} {\displaystyle \eta _{Y}:Y\to GFY} via a unique group homomorphism from FY to W. This is precisely the universal property of the free group on Y.

Terminal morphisms. For each group X, the group FGX is the free group generated freely by GX, the elements of X. Let ε X : F G X → X {\displaystyle \varepsilon _{X}:FGX\to X} {\displaystyle \varepsilon _{X}:FGX\to X} be the group homomorphism that sends the generators of FGX to the elements of X they correspond to, which exists by the universal property of free groups. Then each ( G X , ε X ) {\displaystyle (GX,\varepsilon _{X})} {\displaystyle (GX,\varepsilon _{X})} is a terminal morphism from F to X, because any group homomorphism from a free group FZ to X will factor through ε X : F G X → X {\displaystyle \varepsilon _{X}:FGX\to X} {\displaystyle \varepsilon _{X}:FGX\to X} via a unique set map from Z to GX. This means that (F,G) is an adjoint pair.

Hom-set adjunction. Group homomorphisms from the free group FY to a group X correspond precisely to maps from the set Y to the set GX: each homomorphism from FY to X is fully determined by its action on generators, another restatement of the universal property of free groups. One can verify directly that this correspondence is a natural transformation, which means it is a hom-set adjunction for the pair (F,G).

counit–unit adjunction. One can also verify directly that ε and η are natural. Then, a direct verification that they form a counit–unit adjunction ( ε , η ) : F ⊣ G {\displaystyle (\varepsilon ,\eta ):F\dashv G} {\displaystyle (\varepsilon ,\eta ):F\dashv G} is as follows:

The first counit–unit equation 1 F = ε F ∘ F η {\displaystyle 1_{F}=\varepsilon F\circ F\eta } {\displaystyle 1_{F}=\varepsilon F\circ F\eta } says that for each set Y the composition

F Y → F ( η Y ) F G F Y → ε F Y F Y {\displaystyle FY{\xrightarrow {\;F(\eta _{Y})\;}}FGFY{\xrightarrow {\;\varepsilon _{FY}\,}}FY} {\displaystyle FY{\xrightarrow {\;F(\eta _{Y})\;}}FGFY{\xrightarrow {\;\varepsilon _{FY}\,}}FY}

should be the identity. The intermediate group FGFY is the free group generated freely by the words of the free group FY. (Think of these words as placed in parentheses to indicate that they are independent generators.) The arrow F ( η Y ) {\displaystyle F(\eta _{Y})} {\displaystyle F(\eta _{Y})} is the group homomorphism from FY into FGFY sending each generator y of FY to the corresponding word of length one (y) as a generator of FGFY. The arrow ε F Y {\displaystyle \varepsilon _{FY}} {\displaystyle \varepsilon _{FY}} is the group homomorphism from FGFY to FY sending each generator to the word of FY it corresponds to (so this map is "dropping parentheses"). The composition of these maps is indeed the identity on FY.

The second counit–unit equation 1 G = G ε ∘ η G {\displaystyle 1_{G}=G\varepsilon \circ \eta G} {\displaystyle 1_{G}=G\varepsilon \circ \eta G} says that for each group X the composition

G X → η G X G F G X → G ( ε X ) G X {\displaystyle GX{\xrightarrow {\;\eta _{GX}\;}}GFGX{\xrightarrow {\;G(\varepsilon _{X})\,}}GX} {\displaystyle GX{\xrightarrow {\;\eta _{GX}\;}}GFGX{\xrightarrow {\;G(\varepsilon _{X})\,}}GX}

should be the identity. The intermediate set GFGX is just the underlying set of FGX. The arrow η G X {\displaystyle \eta _{GX}} {\displaystyle \eta _{GX}} is the "inclusion of generators" set map from the set GX to the set GFGX. The arrow G ( ε X ) {\displaystyle G(\varepsilon _{X})} {\displaystyle G(\varepsilon _{X})} is the set map from GFGX to GX, which underlies the group homomorphism sending each generator of FGX to the element of X it corresponds to ("dropping parentheses"). The composition of these maps is indeed the identity on GX.

Free constructions and forgetful functors

[edit]

Free objects are all examples of a left adjoint to a forgetful functor, which assigns to an algebraic object its underlying set. These algebraic free functors have generally the same description as in the detailed description of the free group situation above.

Diagonal functors and limits

[edit]

Products, fibred products, equalizers, and kernels are all examples of the categorical notion of a limit. Any limit functor is right adjoint to a corresponding diagonal functor (provided the category has the type of limits in question), and the counit of the adjunction provides the defining maps from the limit object (i.e. from the diagonal functor on the limit, in the functor category). Below are some specific examples.

The cartesian product of sets, the product of rings, the product of topological spaces etc. follow the same pattern; it can also be extended in a straightforward manner to more than just two factors. More generally, any type of limit is right adjoint to a diagonal functor.

A suitable variation of this example also shows that the kernel functors for vector spaces and for modules are right adjoints. Analogously, one can show that the cokernel functors for abelian groups, vector spaces and modules are left adjoints.

Colimits and diagonal functors

[edit]

Coproducts, fibred coproducts, coequalizers, and cokernels are all examples of the categorical notion of a colimit. Any colimit functor is left adjoint to a corresponding diagonal functor (provided the category has the type of colimits in question), and the unit of the adjunction provides the defining maps into the colimit object. Below are some specific examples.

Analogous examples are given by the direct sum of vector spaces and modules, by the free product of groups and by the disjoint union of sets.

Every partially ordered set can be viewed as a category (where the elements of the poset become the category's objects and we have a single morphism from x to y if and only if xy). A pair of adjoint functors between two partially ordered sets is called a Galois connection (or, if it is contravariant, an antitone Galois connection). See that article for a number of examples: the case of Galois theory of course is a leading one. Any Galois connection gives rise to closure operators and to inverse order-preserving bijections between the corresponding closed elements.

As is the case for Galois groups, the real interest lies often in refining a correspondence to a duality (i.e. antitone order isomorphism). A treatment of Galois theory along these lines by Kaplansky was influential in the recognition of the general structure here.

The partial order case collapses the adjunction definitions quite noticeably, but can provide several themes:

The role of quantifiers in predicate logics is in forming propositions and also in expressing sophisticated predicates by closing formulas with possibly more variables. For example, consider a predicate ψ f {\displaystyle \psi _{f}} {\displaystyle \psi _{f}} with two open variables of sort X {\displaystyle X} {\displaystyle X} and Y {\displaystyle Y} {\displaystyle Y}. Using a quantifier to close X {\displaystyle X} {\displaystyle X}, we can form the set

{ y ∈ Y ∣ ∃ x . ψ f ( x , y ) ∧ ϕ S ( x ) } {\displaystyle \{y\in Y\mid \exists x.\,\psi _{f}(x,y)\land \phi _{S}(x)\}} {\displaystyle \{y\in Y\mid \exists x.\,\psi _{f}(x,y)\land \phi _{S}(x)\}}

of all elements y {\displaystyle y} {\displaystyle y} of Y {\displaystyle Y} {\displaystyle Y} for which there is an x {\displaystyle x} {\displaystyle x} to which it is ψ f {\displaystyle \psi _{f}} {\displaystyle \psi _{f}}-related, and which itself is characterized by the property ϕ S {\displaystyle \phi _{S}} {\displaystyle \phi _{S}}. Set theoretic operations like the intersection ∩ {\displaystyle \cap } {\displaystyle \cap } of two sets directly corresponds to the conjunction ∧ {\displaystyle \land } {\displaystyle \land } of predicates. In categorical logic, a subfield of topos theory, quantifiers are identified with adjoints to the pullback functor. Such a realization can be seen in analogy to the discussion of propositional logic using set theory but the general definition make for a richer range of logics.

So consider an object Y {\displaystyle Y} {\displaystyle Y} in a category with pullbacks. Any morphism f : X → Y {\displaystyle f:X\to Y} {\displaystyle f:X\to Y} induces a functor

f ∗ : Sub ( Y ) ⟶ Sub ( X ) {\displaystyle f^{*}:{\text{Sub}}(Y)\longrightarrow {\text{Sub}}(X)} {\displaystyle f^{*}:{\text{Sub}}(Y)\longrightarrow {\text{Sub}}(X)}

on the category that is the preorder of subobjects. It maps subobjects T {\displaystyle T} {\displaystyle T} of Y {\displaystyle Y} {\displaystyle Y} (technically: monomorphism classes of T → Y {\displaystyle T\to Y} {\displaystyle T\to Y}) to the pullback X × Y T {\displaystyle X\times _{Y}T} {\displaystyle X\times _{Y}T}. If this functor has a left- or right adjoint, they are called ∃ f {\displaystyle \exists _{f}} {\displaystyle \exists _{f}} and ∀ f {\displaystyle \forall _{f}} {\displaystyle \forall _{f}}, respectively.[5] They both map from Sub ( X ) {\displaystyle {\text{Sub}}(X)} {\displaystyle {\text{Sub}}(X)} back to Sub ( Y ) {\displaystyle {\text{Sub}}(Y)} {\displaystyle {\text{Sub}}(Y)}. Very roughly, given a domain S ⊂ X {\displaystyle S\subset X} {\displaystyle S\subset X} to quantify a relation expressed via f {\displaystyle f} {\displaystyle f} over, the functor/quantifier closes X {\displaystyle X} {\displaystyle X} in X × Y T {\displaystyle X\times _{Y}T} {\displaystyle X\times _{Y}T} and returns the thereby specified subset of Y {\displaystyle Y} {\displaystyle Y}.

Example: In Set {\displaystyle \operatorname {Set} } {\displaystyle \operatorname {Set} }, the category of sets and functions, the canonical subobjects are the subset (or rather their canonical injections). The pullback f ∗ T = X × Y T {\displaystyle f^{*}T=X\times _{Y}T} {\displaystyle f^{*}T=X\times _{Y}T} of an injection of a subset T {\displaystyle T} {\displaystyle T} into Y {\displaystyle Y} {\displaystyle Y} along f {\displaystyle f} {\displaystyle f} is characterized as the largest set which knows all about f {\displaystyle f} {\displaystyle f} and the injection of T {\displaystyle T} {\displaystyle T} into Y {\displaystyle Y} {\displaystyle Y}. It therefore turns out to be (in bijection with) the inverse image f − 1 [ T ] ⊆ X {\displaystyle f^{-1}[T]\subseteq X} {\displaystyle f^{-1}[T]\subseteq X}.

For S ⊆ X {\displaystyle S\subseteq X} {\displaystyle S\subseteq X}, let us figure out the left adjoint, which is defined via

Hom ( ∃ f S , T ) ≅ Hom ( S , f ∗ T ) , {\displaystyle {\operatorname {Hom} }(\exists _{f}S,T)\cong {\operatorname {Hom} }(S,f^{*}T),} {\displaystyle {\operatorname {Hom} }(\exists _{f}S,T)\cong {\operatorname {Hom} }(S,f^{*}T),}

which here just means

∃ f S ⊆ T ↔ S ⊆ f − 1 [ T ] {\displaystyle \exists _{f}S\subseteq T\leftrightarrow S\subseteq f^{-1}[T]} {\displaystyle \exists _{f}S\subseteq T\leftrightarrow S\subseteq f^{-1}[T]}.

Consider f [ S ] ⊆ T {\displaystyle f[S]\subseteq T} {\displaystyle f[S]\subseteq T}. We see S ⊆ f − 1 [ f [ S ] ] ⊆ f − 1 [ T ] {\displaystyle S\subseteq f^{-1}[f[S]]\subseteq f^{-1}[T]} ![{\displaystyle S\subseteq f^{-1}[f[S]]\subseteq f^{-1}[T]}](https://wikimedia.org/api/rest_v1/media/math/render/svg/9c2845f7f5ca6c49df476c5de6e8c4c4385f1539). Conversely, If for an x ∈ S {\displaystyle x\in S} {\displaystyle x\in S} we also have x ∈ f − 1 [ T ] {\displaystyle x\in f^{-1}[T]} {\displaystyle x\in f^{-1}[T]}, then clearly f ( x ) ∈ T {\displaystyle f(x)\in T} {\displaystyle f(x)\in T}. So S ⊆ f − 1 [ T ] {\displaystyle S\subseteq f^{-1}[T]} {\displaystyle S\subseteq f^{-1}[T]} implies f [ S ] ⊆ T {\displaystyle f[S]\subseteq T} {\displaystyle f[S]\subseteq T}. We conclude that left adjoint to the inverse image functor f ∗ {\displaystyle f^{*}} {\displaystyle f^{*}} is given by the direct image. Here is a characterization of this result, which matches more the logical interpretation: The image of S {\displaystyle S} {\displaystyle S} under ∃ f {\displaystyle \exists _{f}} {\displaystyle \exists _{f}} is the full set of y {\displaystyle y} {\displaystyle y}'s, such that f − 1 [ { y } ] ∩ S {\displaystyle f^{-1}[\{y\}]\cap S} {\displaystyle f^{-1}[\{y\}]\cap S} is non-empty. This works because it neglects exactly those y ∈ Y {\displaystyle y\in Y} {\displaystyle y\in Y} which are in the complement of f [ S ] {\displaystyle f[S]} {\displaystyle f[S]}. So

∃ f S = { y ∈ Y ∣ ∃ ( x ∈ f − 1 [ { y } ] ) . x ∈ S } = f [ S ] . {\displaystyle \exists _{f}S=\{y\in Y\mid \exists (x\in f^{-1}[\{y\}]).\,x\in S\;\}=f[S].} {\displaystyle \exists _{f}S=\{y\in Y\mid \exists (x\in f^{-1}[\{y\}]).\,x\in S\;\}=f[S].}

Put this in analogy to our motivation { y ∈ Y ∣ ∃ x . ψ f ( x , y ) ∧ ϕ S ( x ) } {\displaystyle \{y\in Y\mid \exists x.\,\psi _{f}(x,y)\land \phi _{S}(x)\}} {\displaystyle \{y\in Y\mid \exists x.\,\psi _{f}(x,y)\land \phi _{S}(x)\}}.

The right adjoint to the inverse image functor is given (without doing the computation here) by

∀ f S = { y ∈ Y ∣ ∀ ( x ∈ f − 1 [ { y } ] ) . x ∈ S } . {\displaystyle \forall _{f}S=\{y\in Y\mid \forall (x\in f^{-1}[\{y\}]).\,x\in S\;\}.} {\displaystyle \forall _{f}S=\{y\in Y\mid \forall (x\in f^{-1}[\{y\}]).\,x\in S\;\}.}

The subset ∀ f S {\displaystyle \forall _{f}S} {\displaystyle \forall _{f}S} of Y {\displaystyle Y} {\displaystyle Y} is characterized as the full set of y {\displaystyle y} {\displaystyle y}'s with the property that the inverse image of { y } {\displaystyle \{y\}} {\displaystyle \{y\}} with respect to f {\displaystyle f} {\displaystyle f} is fully contained within S {\displaystyle S} {\displaystyle S}. Note how the predicate determining the set is the same as above, except that ∃ {\displaystyle \exists } {\displaystyle \exists } is replaced by ∀ {\displaystyle \forall } {\displaystyle \forall }.

See also powerset.

The twin fact in probability can be understood as an adjunction: that expectation commutes with affine transform, and that the expectation is in some sense the best solution to the problem of finding a real-valued approximation to a distribution on the real numbers.

Define a category based on R {\displaystyle \mathbb {R} } {\displaystyle \mathbb {R} }, with objects being the real numbers, and the morphisms being "affine functions evaluated at a point". That is, for any affine function f ( x ) = a x + b {\displaystyle f(x)=ax+b} {\displaystyle f(x)=ax+b} and any real number r {\displaystyle r} {\displaystyle r}, define a morphism ( r , f ) : r → f ( r ) {\displaystyle (r,f):r\to f(r)} {\displaystyle (r,f):r\to f(r)}.

Define a category based on M ( R ) {\displaystyle M(\mathbb {R} )} {\displaystyle M(\mathbb {R} )}, the set of probability distribution on R {\displaystyle \mathbb {R} } {\displaystyle \mathbb {R} } with finite expectation. Define morphisms on M ( R ) {\displaystyle M(\mathbb {R} )} {\displaystyle M(\mathbb {R} )} as "affine functions evaluated at a distribution". That is, for any affine function f ( x ) = a x + b {\displaystyle f(x)=ax+b} {\displaystyle f(x)=ax+b} and any μ ∈ M ( R ) {\displaystyle \mu \in M(\mathbb {R} )} {\displaystyle \mu \in M(\mathbb {R} )}, define a morphism ( μ , f ) : μ → μ ∘ f − 1 {\displaystyle (\mu ,f):\mu \to \mu \circ f^{-1}} {\displaystyle (\mu ,f):\mu \to \mu \circ f^{-1}}.

Then, the Dirac delta measure defines a functor: δ : x ↦ δ x {\displaystyle \delta :x\mapsto \delta _{x}} {\displaystyle \delta :x\mapsto \delta _{x}}, and the expectation defines another functor E : μ ↦ E [ μ ] {\displaystyle \mathbb {E} :\mu \mapsto \mathbb {E} [\mu ]} {\displaystyle \mathbb {E} :\mu \mapsto \mathbb {E} [\mu ]}, and they are adjoint: E ⊣ δ {\displaystyle \mathbb {E} \dashv \delta } {\displaystyle \mathbb {E} \dashv \delta }. (Somewhat disconcertingly, E {\displaystyle \mathbb {E} } {\displaystyle \mathbb {E} } is the left adjoint, even though E {\displaystyle \mathbb {E} } {\displaystyle \mathbb {E} } is "forgetful" and δ {\displaystyle \delta } {\displaystyle \delta } is "free".)

Adjunctions in full

[edit]

There are hence numerous functors and natural transformations associated with every adjunction, and only a small portion is sufficient to determine the rest.

An adjunction between categories C and D consists of

An equivalent formulation, where X denotes any object of C and Y denotes any object of D, is as follows:

For every _C_-morphism f : FYX, there is a unique D_-morphism Φ_Y, X(f) = g : YGX such that the diagrams below commute, and for every _D_-morphism g : YGX, there is a unique C_-morphism Φ−1_Y, X(g) = f : FYX in C such that the diagrams below commute:

From this assertion, one can recover that:

f = Φ Y , X − 1 ( g ) = ε X ∘ F ( g ) ∈ h o m C ( F ( Y ) , X ) g = Φ Y , X ( f ) = G ( f ) ∘ η Y ∈ h o m D ( Y , G ( X ) ) Φ G X , X − 1 ( 1 G X ) = ε X ∈ h o m C ( F G ( X ) , X ) Φ Y , F Y ( 1 F Y ) = η Y ∈ h o m D ( Y , G F ( Y ) ) {\displaystyle {\begin{aligned}f=\Phi _{Y,X}^{-1}(g)&=\varepsilon _{X}\circ F(g)&\in &\,\,\mathrm {hom} _{C}(F(Y),X)\\g=\Phi _{Y,X}(f)&=G(f)\circ \eta _{Y}&\in &\,\,\mathrm {hom} _{D}(Y,G(X))\\\Phi _{GX,X}^{-1}(1_{GX})&=\varepsilon _{X}&\in &\,\,\mathrm {hom} _{C}(FG(X),X)\\\Phi _{Y,FY}(1_{FY})&=\eta _{Y}&\in &\,\,\mathrm {hom} _{D}(Y,GF(Y))\\\end{aligned}}} {\displaystyle {\begin{aligned}f=\Phi _{Y,X}^{-1}(g)&=\varepsilon _{X}\circ F(g)&\in &\,\,\mathrm {hom} _{C}(F(Y),X)\\g=\Phi _{Y,X}(f)&=G(f)\circ \eta _{Y}&\in &\,\,\mathrm {hom} _{D}(Y,G(X))\\\Phi _{GX,X}^{-1}(1_{GX})&=\varepsilon _{X}&\in &\,\,\mathrm {hom} _{C}(FG(X),X)\\\Phi _{Y,FY}(1_{FY})&=\eta _{Y}&\in &\,\,\mathrm {hom} _{D}(Y,GF(Y))\\\end{aligned}}}

1 F Y = ε F Y ∘ F ( η Y ) 1 G X = G ( ε X ) ∘ η G X {\displaystyle {\begin{aligned}1_{FY}&=\varepsilon _{FY}\circ F(\eta _{Y})\\1_{GX}&=G(\varepsilon _{X})\circ \eta _{GX}\end{aligned}}} {\displaystyle {\begin{aligned}1_{FY}&=\varepsilon _{FY}\circ F(\eta _{Y})\\1_{GX}&=G(\varepsilon _{X})\circ \eta _{GX}\end{aligned}}}

In particular, the equations above allow one to define Φ, ε, and η in terms of any one of the three. However, the adjoint functors F and G alone are in general not sufficient to determine the adjunction. The equivalence of these situations is demonstrated below.

Universal morphisms induce hom-set adjunction

[edit]

Given a right adjoint functor G : CD; in the sense of initial morphisms, one may construct the induced hom-set adjunction by doing the following steps.

A similar argument allows one to construct a hom-set adjunction from the terminal morphisms to a left adjoint functor. (The construction that starts with a right adjoint is slightly more common, since the right adjoint in many adjoint pairs is a trivially defined inclusion or forgetful functor.)

counit–unit adjunction induces hom-set adjunction

[edit]

Given functors F : DC, G : CD, and a counit–unit adjunction (ε, η) : F ⊣ {\displaystyle \dashv } {\displaystyle \dashv } G, we can construct a hom-set adjunction by finding the natural transformation Φ : hom_C_(F_-,-) → hom_D(-,_G_-) in the following steps:

Φ Y , X ( f ) = G ( f ) ∘ η Y Ψ Y , X ( g ) = ε X ∘ F ( g ) {\displaystyle {\begin{aligned}\Phi _{Y,X}(f)=G(f)\circ \eta _{Y}\\\Psi _{Y,X}(g)=\varepsilon _{X}\circ F(g)\end{aligned}}} {\displaystyle {\begin{aligned}\Phi _{Y,X}(f)=G(f)\circ \eta _{Y}\\\Psi _{Y,X}(g)=\varepsilon _{X}\circ F(g)\end{aligned}}}

The transformations Φ and Ψ are natural because η and ε are natural.

Ψ Φ f = ε X ∘ F G ( f ) ∘ F ( η Y ) = f ∘ ε F Y ∘ F ( η Y ) = f ∘ 1 F Y = f {\displaystyle {\begin{aligned}\Psi \Phi f&=\varepsilon _{X}\circ FG(f)\circ F(\eta _{Y})\\&=f\circ \varepsilon _{FY}\circ F(\eta _{Y})\\&=f\circ 1_{FY}=f\end{aligned}}} {\displaystyle {\begin{aligned}\Psi \Phi f&=\varepsilon _{X}\circ FG(f)\circ F(\eta _{Y})\\&=f\circ \varepsilon _{FY}\circ F(\eta _{Y})\\&=f\circ 1_{FY}=f\end{aligned}}}

hence ΨΦ is the identity transformation.

Φ Ψ g = G ( ε X ) ∘ G F ( g ) ∘ η Y = G ( ε X ) ∘ η G X ∘ g = 1 G X ∘ g = g {\displaystyle {\begin{aligned}\Phi \Psi g&=G(\varepsilon _{X})\circ GF(g)\circ \eta _{Y}\\&=G(\varepsilon _{X})\circ \eta _{GX}\circ g\\&=1_{GX}\circ g=g\end{aligned}}} {\displaystyle {\begin{aligned}\Phi \Psi g&=G(\varepsilon _{X})\circ GF(g)\circ \eta _{Y}\\&=G(\varepsilon _{X})\circ \eta _{GX}\circ g\\&=1_{GX}\circ g=g\end{aligned}}}

hence ΦΨ is the identity transformation. Thus Φ is a natural isomorphism with inverse Φ−1 = Ψ.

Hom-set adjunction induces all of the above

[edit]

Given functors F : DC, G : CD, and a hom-set adjunction Φ : hom_C_(F_-,-) → hom_D(-,_G_-), one can construct a counit–unit adjunction

( ε , η ) : F ⊣ G {\displaystyle (\varepsilon ,\eta ):F\dashv G} {\displaystyle (\varepsilon ,\eta ):F\dashv G} ,

which defines families of initial and terminal morphisms, in the following steps:

Φ Y , X ( f ) = G ( f ) ∘ η Y Φ Y , X − 1 ( g ) = ε X ∘ F ( g ) {\displaystyle {\begin{aligned}\Phi _{Y,X}(f)=G(f)\circ \eta _{Y}\\\Phi _{Y,X}^{-1}(g)=\varepsilon _{X}\circ F(g)\end{aligned}}} {\displaystyle {\begin{aligned}\Phi _{Y,X}(f)=G(f)\circ \eta _{Y}\\\Phi _{Y,X}^{-1}(g)=\varepsilon _{X}\circ F(g)\end{aligned}}}

for each f: FYX and g: YGX (which completely determine Φ).

1 F Y = ε F Y ∘ F ( η Y ) {\displaystyle 1_{FY}=\varepsilon _{FY}\circ F(\eta _{Y})} {\displaystyle 1_{FY}=\varepsilon _{FY}\circ F(\eta _{Y})},

and substituting GX for Y and εX = Φ−1_GX, X_(1_GX_) for f in the first formula gives the second counit–unit equation

1 G X = G ( ε X ) ∘ η G X {\displaystyle 1_{GX}=G(\varepsilon _{X})\circ \eta _{GX}} {\displaystyle 1_{GX}=G(\varepsilon _{X})\circ \eta _{GX}}.

Not every functor G : CD admits a left adjoint. If C is a complete category, then the functors with left adjoints can be characterized by the adjoint functor theorem of Peter J. Freyd: G has a left adjoint if and only if it is continuous and a certain smallness condition is satisfied: for every object Y of D there exists a family of morphisms

f i : YG(X i)

where the indices i come from a set I, not a proper class, such that every morphism

h : YG(X)

can be written as

h = G(t) ∘ f i

for some i in I and some morphism

t : X iXC.

An analogous statement characterizes those functors with a right adjoint.

An important special case is that of locally presentable categories. If F : C → D {\displaystyle F:C\to D} {\displaystyle F:C\to D} is a functor between locally presentable categories, then

If the functor F : DC has two right adjoints G and _G_′, then G and _G_′ are naturally isomorphic. The same is true for left adjoints.

Conversely, if F is left adjoint to G, and G is naturally isomorphic to _G_′ then F is also left adjoint to G_′. More generally, if 〈_F, G, ε, η〉 is an adjunction (with counit–unit (ε,η)) and

σ : F → _F_′

τ : G → _G_′

are natural isomorphisms then 〈_F_′, _G_′, ε′, η′〉 is an adjunction where

η ′ = ( τ ∗ σ ) ∘ η ε ′ = ε ∘ ( σ − 1 ∗ τ − 1 ) . {\displaystyle {\begin{aligned}\eta '&=(\tau \ast \sigma )\circ \eta \\\varepsilon '&=\varepsilon \circ (\sigma ^{-1}\ast \tau ^{-1}).\end{aligned}}} {\displaystyle {\begin{aligned}\eta '&=(\tau \ast \sigma )\circ \eta \\\varepsilon '&=\varepsilon \circ (\sigma ^{-1}\ast \tau ^{-1}).\end{aligned}}}

Here ∘ {\displaystyle \circ } {\displaystyle \circ } denotes vertical composition of natural transformations, and ∗ {\displaystyle \ast } {\displaystyle \ast } denotes horizontal composition.

Adjunctions can be composed in a natural fashion. Specifically, if 〈F, G, ε, η〉 is an adjunction between C and D and 〈_F_′, _G_′, ε′, η′〉 is an adjunction between D and E then the functor

F ∘ F ′ : E → C {\displaystyle F\circ F':E\rightarrow C} {\displaystyle F\circ F':E\rightarrow C}

is left adjoint to

G ′ ∘ G : C → E . {\displaystyle G'\circ G:C\to E.} {\displaystyle G'\circ G:C\to E.}

More precisely, there is an adjunction between F F' and G' G with unit and counit given respectively by the compositions:

1 E → η ′ G ′ F ′ → G ′ η F ′ G ′ G F F ′ F F ′ G ′ G → F ε ′ G F G → ε 1 C . {\displaystyle {\begin{aligned}&1_{\mathcal {E}}{\xrightarrow {\eta '}}G'F'{\xrightarrow {G'\eta F'}}G'GFF'\\&FF'G'G{\xrightarrow {F\varepsilon 'G}}FG{\xrightarrow {\varepsilon }}1_{\mathcal {C}}.\end{aligned}}} {\displaystyle {\begin{aligned}&1_{\mathcal {E}}{\xrightarrow {\eta '}}G'F'{\xrightarrow {G'\eta F'}}G'GFF'\\&FF'G'G{\xrightarrow {F\varepsilon 'G}}FG{\xrightarrow {\varepsilon }}1_{\mathcal {C}}.\end{aligned}}}

This new adjunction is called the composition of the two given adjunctions.

Since there is also a natural way to define an identity adjunction between a category C and itself, one can then form a category whose objects are all small categories and whose morphisms are adjunctions.

The most important property of adjoints is their continuity: every functor that has a left adjoint (and therefore is a right adjoint) is continuous (i.e. commutes with limits in the category theoretical sense); every functor that has a right adjoint (and therefore is a left adjoint) is cocontinuous (i.e. commutes with colimits).

Since many common constructions in mathematics are limits or colimits, this provides a wealth of information. For example:

If C and D are preadditive categories and F : DC is an additive functor with a right adjoint G : CD, then G is also an additive functor and the hom-set bijections

Φ Y , X : h o m C ( F Y , X ) ≅ h o m D ( Y , G X ) {\displaystyle \Phi _{Y,X}:\mathrm {hom} _{\mathcal {C}}(FY,X)\cong \mathrm {hom} _{\mathcal {D}}(Y,GX)} {\displaystyle \Phi _{Y,X}:\mathrm {hom} _{\mathcal {C}}(FY,X)\cong \mathrm {hom} _{\mathcal {D}}(Y,GX)}

are, in fact, isomorphisms of abelian groups. Dually, if G is additive with a left adjoint F, then F is also additive.

Moreover, if both C and D are additive categories (i.e. preadditive categories with all finite biproducts), then any pair of adjoint functors between them are automatically additive.

Universal constructions

[edit]

As stated earlier, an adjunction between categories C and D gives rise to a family of universal morphisms, one for each object in C and one for each object in D. Conversely, if there exists a universal morphism to a functor G : CD from every object of D, then G has a left adjoint.

However, universal constructions are more general than adjoint functors: a universal construction is like an optimization problem; it gives rise to an adjoint pair if and only if this problem has a solution for every object of D (equivalently, every object of C).

Equivalences of categories

[edit]

If a functor F : DC is one half of an equivalence of categories then it is the left adjoint in an adjoint equivalence of categories, i.e. an adjunction whose unit and counit are isomorphisms.

Every adjunction 〈F, G, ε, η〉 extends an equivalence of certain subcategories. Define C_1 as the full subcategory of C consisting of those objects X of C for which ε_X is an isomorphism, and define D_1 as the full subcategory of D consisting of those objects Y of D for which η_Y is an isomorphism. Then F and G can be restricted to _D_1 and _C_1 and yield inverse equivalences of these subcategories.

In a sense, then, adjoints are "generalized" inverses. Note however that a right inverse of F (i.e. a functor G such that FG is naturally isomorphic to 1_D_) need not be a right (or left) adjoint of F. Adjoints generalize two-sided inverses.

Every adjunction 〈F, G, ε, η〉 gives rise to an associated monadT, η, μ〉 in the category D. The functor

T : D → D {\displaystyle T:{\mathcal {D}}\to {\mathcal {D}}} {\displaystyle T:{\mathcal {D}}\to {\mathcal {D}}}

is given by T = GF. The unit of the monad

η : 1 D → T {\displaystyle \eta :1_{\mathcal {D}}\to T} {\displaystyle \eta :1_{\mathcal {D}}\to T}

is just the unit η of the adjunction and the multiplication transformation

μ : T 2 → T {\displaystyle \mu :T^{2}\to T\,} {\displaystyle \mu :T^{2}\to T\,}

is given by μ = G_ε_F. Dually, the triple 〈FG, ε, _F_η_G_〉 defines a comonad in C.

Every monad arises from some adjunction—in fact, typically from many adjunctions—in the above fashion. Two constructions, called the category of Eilenberg–Moore algebras and the Kleisli category are two extremal solutions to the problem of constructing an adjunction that gives rise to a given monad.

  1. ^ Baez, John C. (1996). "Higher-Dimensional Algebra II: 2-Hilbert Spaces". arXiv:q-alg/9609018.
  2. ^ Kan, Daniel M. (1958). "Adjoint Functors" (PDF). Transactions of the American Mathematical Society. 87 (2): 294–329. doi:10.2307/1993102. JSTOR 1993102.
  3. ^ Lawvere, F. William, "Adjointness in foundations", Dialectica, 1969. The notation is different nowadays; an easier introduction by Peter Smith in these lecture notes, which also attribute the concept to the article cited.
  4. ^ "Indiscrete category". nLab.
  5. ^ Mac Lane, Saunders; Moerdijk, Ieke (1992) Sheaves in Geometry and Logic, Springer-Verlag. ISBN 0-387-97710-4 See page 58