Deep Focus vs Shallow Focus
Deep focus and shallow focus are opposite ends of a spectrum that determines how much of the image, from foreground to background, remains acceptably sharp. Deep focus keeps multiple planes — near subject, mid-ground, and distant background — simultaneously in sharp focus, creating images that feel expansive, objective, and information-dense. Shallow focus restricts sharpness to a narrow plane around the subject, blurring everything before and behind into soft bokeh, focusing attention and creating a sense of intimacy or isolation. Both are deliberate creative choices with distinct narrative and aesthetic uses, and both can be specified with precision in AI image prompts to shape the spatial reading of your generated images on Floniks.
Defining Deep Focus and Its Narrative Uses
Deep focus — also called pan focus — keeps every plane of the scene in sharp detail simultaneously, from objects close to the camera through the mid-ground to elements at the far background. Achieved optically through small apertures, wide-angle lenses, and sufficient lighting, deep focus is associated in cinematic history with directors who wanted to place the audience in a position of omniscient observation: the entire visual world of the scene is available to the viewer at once, with no element artificially foregrounded by selective sharpness. This creates a specific viewing experience: the audience must actively choose what to look at within the frame rather than being guided by selective focus toward a prescribed point of attention. This democratization of visual attention suits specific narrative purposes — stories where the relationship between characters and their environments is as important as the characters themselves; scenes where spatial relationships carry meaning (a character and the door they cannot reach, both in sharp focus); ensemble scenes where multiple characters at different depths are simultaneously in play. In AI prompts, deep focus is specified by describing the sharpness of multiple planes: 'deep focus photography, foreground element in sharp focus, mid-ground subject in equally sharp focus, distant background details also clearly rendered, no depth-of-field blur on any plane, extensive depth of field, documentary or wide-angle cinematic style, every element equally legible'. Adding the contextual use — documentary, ensemble drama, environmental storytelling — helps the model maintain the correct visual register alongside the focus instruction.
Shallow Focus: Directing Attention and Creating Intimacy
Shallow focus restricts sharpness to a narrow depth zone — often only a few centimeters deep at close focusing distances — so that the subject is rendered with crystal clarity while everything before and behind it falls progressively out of focus into bokeh. This deliberate restriction of visual information does two things simultaneously: it directs the viewer's attention to the sharp element with irresistible authority (the eye always goes to the sharpest point in a frame), and it creates a sense of intimacy by isolating the subject from its context, rendering the environment as impression rather than information. Shallow focus is the dominant aesthetic for portraiture, beauty photography, wildlife photography, and intimate dramatic scenes — any context where the viewer's relationship to the subject is personal and direct rather than spatial and environmental. In AI prompts, shallow focus requires specifying both the sharp plane and the defocused planes: 'extreme shallow depth of field, only the subject's eyes in sharp focus, nose and ears already softening out of focus, background completely defocused into smooth bokeh, shallow focus portrait, intimate close-up framing, 85mm or 50mm f/1.4 aesthetic, subject isolated by shallow focus'. For a less extreme version: 'moderate shallow depth of field, subject sharp from face to mid-body, background softly defocused but not completely blurred, some environmental context visible in soft focus, lifestyle portrait photography'.
Focal Planes and the Zone of Acceptable Sharpness
Between the extremes of deep focus and extreme shallow focus lies a range of intermediate depths of field where the zone of acceptable sharpness extends to some but not all elements of the scene. The creative choices within this range are equally important. A moderate depth of field that keeps the subject and immediate foreground sharp while blurring a distant background is the most common photographic configuration — it combines subject isolation with some environmental context, making it suitable for travel, street, lifestyle, and editorial photography. A split-focus setup — where two elements at different distances are both in sharp focus through careful depth management or through compositing — allows two narrative elements to be simultaneously foregrounded without one blurring the other. The split-focus diopter (a half-lens attachment) achieves this in practice; in AI prompting, it can be described as: 'two subjects at different depths both in sharp focus, nearer subject at close range, far subject 15 feet back, both rendered sharply, background behind far subject softly defocused, split-focus portrait, two focal planes simultaneously sharp'. The practical instruction for deep versus shallow focus in AI prompts is simpler: describe whether the background is sharp or blurred, how many planes of the scene are in focus, and the quality of any defocus (smooth and gradual versus abrupt). The model's interpretation of these spatial-quality descriptions is generally reliable when they are concrete and specific.
Focus Racking and Focus Pull in Video Prompts
In AI video generation, focus is a dynamic element that can shift during a clip through a focus pull — also called rack focus — where the point of sharp focus transitions from one element to another while the camera holds its position. A rack focus from a foreground object to a character in the background (or vice versa) is a classic cinematic device for redirecting audience attention and signaling narrative shifts. The defocusing of one element as another comes into focus creates a visual metaphor: what was important is now relinquishing its primacy; what was background is now foregrounded. In video prompts for Floniks: 'rack focus transition, begins with close foreground object in sharp focus and character in background blurred, focus pulls to character who comes into sharp focus as foreground object blurs, slow focus pull, cinematic drama, smooth transition, medium shot'. The camera's position remains static while only the focus point shifts — this is important to specify to distinguish a focus pull from a camera move. For a more dynamic narrative use: 'beginning of shot, background character in soft focus, dialogue begins, focus racks to background character bringing them into sharp focus while foreground speaker blurs, focus follows narrative emphasis'. In multi-step video workflows in /editor, you can describe the start-frame and end-frame focus state in separate generation nodes and blend them as a focus-pull transition.
Choosing Deep or Shallow Focus for Your Scene
The decision between deep and shallow focus — and the many degrees between them — should follow from the scene's narrative purpose and visual strategy rather than from a default aesthetic preference. Deep focus is the right choice when spatial relationships carry meaning: a character in the foreground and their goal or obstacle in the background, both equally visible; an environment whose detailed contents tell a story; an ensemble scene where the group's positions relative to each other are part of the communication. In these cases, specify full pan focus and describe all planes of the scene as sharp. Shallow focus is right when the scene's purpose is intimacy, isolation, or exclusive attention: a close-up emotion, a subject set apart from their world, a beauty detail that excludes all context. In these cases, describe the narrow focus plane, the degree of defocus in the planes before and behind, and the quality of the bokeh. The middle range — background defocused but mid-ground sharp, or foreground sharp with a soft but readable background — suits most lifestyle, editorial, and documentary contexts where the subject is primary but the environment provides context. Specify this explicitly: 'subject in sharp focus, mid-ground elements softly focused but legible, background smoothly defocused into soft bokeh, environmental context visible but not competing, standard lifestyle photography depth of field, 50mm portrait lens aesthetic'.
Related guides
Build it on Floniks
Image, video, digital humans, and reusable workflows on one canvas. Sign up gets you starter credits — no card required.
Explore Floniks