News Menu

Re-designing the camera for computational photography

Modified optical setups enable new optical imaging functionalities, including the ability to capture depth, varied angular perspectives, and multispectral content.

06 June 2011

Roarke Horstmeyer

Although digital sensors have become almost ubiquitous, the majority of cameras that contain them have changed little in form over the past century. After capturing a focused 2D image, digitization leads to efficient storage for future editing and sharing, but little else. Computational cameras aim to break down the wall between the optics that captures a photo and the post-processing used to enhance it, with designs that jointly optimize for both. The initial images from these cameras appear blurry or even unrecognizable, but contain much useful information for the computer that immediately processes it. A post-processed image is clear and sharp, and can also provide measurements of an object's 3D location, spectral properties (color spectrum), or material composition, for example.

Many optical systems have been developed to extract these unseen clues from a scene of interest, some requiring little or no post-processing computation at all. For example, clever illumination can provide a direct indication of object depth, as demonstrated by Microsoft's Kinect camera. Multispectral imagery can even be created with an unmodified camera, such as a regular point-and-shoot device. A combination of images of the same scene using different filters over the lens will capture more than the three standard color spectrum ranges (red, green, and blue).

The aim of many computational cameras is to add these useful functionalities to a conventional 2D image captured in a single snapshot. Such an ambitious goal inherently requires modification of the optical setup, which can often be realized by adding simple patterned elements to regular camera designs. The patterned optical elements used in current computational cameras fall into two general classes. The first includes elements that are placed at the camera aperture stop, typically referred to as pupil masks, that globally modify the entire image. The second consists of elements, placed very close to the image sensor, which locally modify regions of pixels, much like the Bayer filter pattern in most of today's color cameras. The post-processing of a computational image taken with cameras like these comes in a variety of forms, ranging from simple deconvolution and pixel ‘re-binning’ (recombining data from adjacent sensors to create one pixel of data) to more complex, sparse recovery procedures.

Pupil mask design is a constantly evolving area of research, with various mask patterns proposed to extract image depth,¹ extend a camera's depth of field,² or offer super-resolution,³ among other enhanced functionalities. Each mask alters the camera's 3D point-spread function (PSF) to better present the information to be extracted during post-processing. We have demonstrated a method to optimally design any desired PSF intensity pattern in 3D (see Figure 1).⁴

Figure 1. Example of designing a camera's point-spread function (PSF) at three planes of defocus (z₁, z₂, z₃). Three desired intensity distributions (I₁, I₂, I₃) are input to an optimization procedure that finds an optimal pupil mask. Simulation and experiment (using a Nikon single-lens reflex camera and a Nikon AF NIKKOR 50mm f/1.8D lens with a printed binary pupil mask) show close agreement. This PSF, which begins as one point, then defocuses into four points, and then nine points, offers a simple depth detection scheme.

Sensor-based coding can help obtain different angular perspectives of an object (its ‘light field’), which is closely related to detecting the phase of an incoming wavefront. Once captured, interesting effects like digital refocusing can be achieved in post-processing. Periodic arrays of small lenses or pinholes provide a simple way to extract these varied perspectives from a single image. More complex periodic pattern designs can lead to phase detection⁵ or pixel-level optical transfer function design with background noise reduction^6,7 (see example in Figure 2).

Figure 2. (a) A pixel-level high-pass optical transfer function (OTF) design using a surface-wave-enabled darkfield aperture and a darkfield image obtained with this ring-like design. (b) A low-pass OTF created by a circular sub-aperture and a conventional bright-field image obtained using this aperture geometry. a.u.: Arbitrary units. (Figure courtesy of Guoan Zheng, Biophotonics Laboratory, Caltech.)

Finally, pupil- and sensor-based coding can be combined. For example, we can obtain a multispectral image in a single snapshot by inserting a variable filter at the pupil and a periodic array near the sensor (see Figure 3).⁸ In this way, 27 spectral channels are directly captured at the expense of the image's spatial resolution.

Figure 3. (a) Schematic of the snapshot multispectral camera layout. (b) Head-on image of the camera lens with a variable bandpass filter inserted at the aperture stop. (c) Example output after post-processing. A multispectral data cube of crayons (235×141 spatial resolution with 27 spectral channels), with measured spectra shown for four example pixels.

A large degree of flexibility is gained when dynamic optical elements are used to improve the computational image capture process. Although research is still in its initial stages, we have developed a framework to optimally design the 3D PSF formation of a dynamic pupil mask, made with a small LCD screen in the camera lens.⁹ The screen's pattern changes during the exposure of one image to shape any desired 3D intensity pattern near the sensor. We have also demonstrated the extraction of mixed spatial, angular, and temporal scene content using a pupil element that changes over time.¹⁰ This design captures multiple frames of a scene's light field, allowing one to digitally refocus on a moving object, or create an image with varying spatial resolution. Likewise, compressive sensing is possible with variable sensor-based elements¹¹ (i.e., pixel-level optical control) that can also be used for object tracking and deblurring. These early results suggest how future cameras can greatly benefit from dynamic, adaptive optical elements in their specific imaging tasks.

In general, since a computational camera captures and processes optical data to measure something besides a simple 2D image, its light-capturing optics and post-processing procedures must be jointly optimized. Our future work will focus on applying the novel camera designs described to image otherwise undetectable features, such as biomedical, microscopic, or ultrafast phenomena.

This research is supported in part by a National Defense Science and Engineering Graduate Fellowship.

Roarke Horstmeyer

Media Lab
Massachusetts Institute of Technology

Cambridge, MA

References:

1. E. R. Dowski, W. T. Cathey, Extended depth of field through wave-front coding, Appl. Opt. 34, no. 11, pp. 1859-1866, 1995. doi:10.1364/AO.34.001859

2. A. Greengard, Y. Schechner, R. Piestun, Depth from diffracted rotation, Opt. Lett. 31, no. 2, pp. 181-183, 2006. doi:10.1364/OL.31.000181

3. A. Ashok, M. Neifeld, Pseudorandom phase masks for superresolution imaging from subpixel shifting, Appl. Opt. 46, no. 12, pp. 2256-2268, 2007. doi:10.1364/AO.46.002256

4. R. Horstmeyer, S. B. Oh, R. Raskar, Iterative aperture mask design in phase space using a rank constraint, Opt. Express 18, no. 21, pp. 22545-22555, 2010. doi:10.1364/OE.18.022545

5. X. Cui, M. Lew, C. Yang, Quantitative differential interference contrast microscopy based on structured-aperture interference, Appl. Phys. Lett. 93, no. 9, pp. 091113, 2008. doi:10.1063/1.2977870

6. G. Zheng, C. Yang, Improving weak-signal identification via predetection background suppression by a pixel-level, surface-wave enabled dark-field aperture, Opt. Lett. 35, no. 15, pp. 2636-2638, 2010. doi:10.1364/OL.35.002636

7. G. Zheng, Y. Wang, C. Yang, Pixel level optical-transfer-function design based on the surface-wave-interferometry aperture, Opt. Express 18, no. 16, pp. 16499-16506, 2010. doi:10.1364/OE.18.016499

8. R. Horstmeyer, R. A. Athale, G. Euliss, Modified light field architecture for reconfigurable multimode imaging, Proc. SPIE 7468, pp. 746804, 2009. doi:10.1117/12.828653

9. R. Horstmeyer, S. B. Oh, O. Gupta, R. Raskar, Partially coherent ambiguity functions for depth-variant point spread function design, Prog. Electromagn. Res. Symp. Proc., pp. 267-272, 2011.

10. A. Agrawal, A. Veeraraghavan, R. Raskar, Reinterpretable imager: towards variable post-capture space, angle, and time resolution in photography, Comput. Graph. Forum 29, no. 2, pp. 763-772, 2010. doi:10.1111/j.1467-8659.2009.01646.x

11. D. Reddy, A. Veeraraghavan, R. Chellappa, P2C2: programmable pixel compressive camera for high speed imaging, Proc. IEEE Int'l Conf. Comput. Photogr., pp. 329-336, 2011.