More poppycock: are mobile phones
as good as dedicated cameras?

Mobile phone makers have repeatedly stated that the cameras built into their top-of-the-line phones are as good as, or better than, consumer digital cameras. Is this true, or just marketing hyperbole and false representation of their own products?

The realm of the possible

It is useful to start this discussion by examining what it is possible to achieve, with current technology, in a digital camera that is restricted by the physical size of a modern mobile phone. This article on dxomark.com is a good starting point, sufficiently technical but still non-mathematical.

While the discussion at the above link is useful, the conclusions fail to address two important points:

1- How are the pictures used, and - especially - watched? The article at the above link contains only small images, and except for a few details of images selected to illustrate a point in the discussion, does not provide the original images for a comparison. Pictures shot on a mobile phone are most often just watched on the screen of the same phone. This is hardly a sufficient medium to compare image quality, principally because of its small physical size. At most, pictures shot on a mobile phone are uploaded to social media, which almost invariably place severe limits on the pixel-count of uploaded images (and the latter are often watched by others on a phone screen).
2- How much in-camera image processing is acceptable? This is discussed in detail below.

Make it or fake it

The sensors of current digital system cameras have a pixel size between roughly 3.3 and 6.6 μm. Typical current mobile phones have pixel sizes around 0.8 μm. By applying Abbe's criterion (used in microscopy to calculate the maximum possible resolution of an optical microscope), we see that the smallest pixel size that allows a sensor to resolve the smallest detail at a given wavelength is

d = λ / (2 NA)

where d is the minimum resolvable distance on the sensor, λ is the wavelength, and NA the numerical aperture of the lens. Assuming an f/2 lens, NA = 0.24, and the wavelength of green light (approximately in the middle of the visible spectrum) is 530 nm. In these conditions, d = 1.1 μm, which means the lens resolves a minimum distance higher than one pixel. The Nyquist limit of the sensor is twice the pixel size (i.e. a line pair can be at a minimum two pixels wide), but the lens cannot resolve a line pair this narrow. To make full use of the sensor pixel size, one should use a faster lens (i.e. a lower f/ratio, like f/1.4 or f/1.2). This is difficult in a phone camera, gives the space and cost constraints.At a minimum, a faster lens must have a wider front element, which in turn requires a larger number of optical elements to correct aberrations.

One way mobile phone designers get around this problem is by grouping adjacent pixels into a square cluster of two by two pixels. This effectively reduces the pixel count by four times, and simultaneously reduces the image noise in low light. Decreasing the pixel count, for example, from 16 Mpixels to 4 Mpixels implies a massive reduction in image resolution. Removing some of the image noise in post-processing, on the other hand, does not reduce the pixel count, but usually reduces the effective resolution, and the image becomes even fuzzier.

Things get even worse in the red channel of an image, where the wavelength is around 690 nm. Blue light has a shorter wavelength than green, so the image can theoretically be sharper in the blue channel, but this channel in a Bayer sensor only has half the number of sensels as the green channel. Mobile phone designers apply a variety of algorithms to reduce noise and increase the sharpness of the image. Trouble is, with the computing resources available on the CPU of a mobile phone with its limited battery power, none of these algorithms can recover true image information once it has been lost in the recorded image. They can only do a "best effort" job at creating make-believe information, which is not the same thing. Each algorithm produces its characteristic artifacts, and lessening one problem (e.g. noise, or sharpness, or dynamic range) is done at the expense of increasing other problems. The stronger the algorithm, the heavier and more visible are the resulting artifacts. Algorithms that remove image noise lower the effective resolution of the image (which is not the same thing as the pixel count), and artificially increasing the image resolution creates even worse artifacts.

The very small sensors of mobile phone cameras also result in a very high DOF. Since the current agreement among mass-consumers of digital images is that a high DOF is a bad thing, mobile phones process their camera images to separate the background from the subject, and subsequently make the background fuzzier to simulate a low DOF. The algorithms used to identify the background are only a best effort, and end up regarding some parts of the subject as background, or vice versa (the former case is more disturbing). Thus you can often see, for example, hair strands around the face of a person, or even the edges of a model's ears, inexplicably turn into bundles of fluffy mush, a golf club partly disappearing, a sharp tree branch with well-detailed leaves grow out of a person's head while the rest of the tree in the background is an indistinct blob, a model waving a three-fingered hand, and other evident artifacts (at least, evident to anyone who is used to watch better images).

In this respect, it would be better to leave the high DOF unchanged, but mobile phone producers have decided that their phones must produce images that ape the results provided by expensive system cameras and lenses, and do so at the expense of fundamental things like image accuracy. Some mobile phones allow the user to switch off or dial down the image "enhancements", but this requires a deep-dive into the phone menus. Most phone users find it easier to simply ignore the problems.

A dog with green spots?

R.R. Rife
Stella at home, informal picture taken with relatively old iPhone.
 

The above example is from an iPhone a few years old. It was taken in fairly normal interior diffuse illumination, of the same type I routinely use with my cameras for naturally-looking candid pictures. This little dog has been living with us for the past seven years, and I can guarantee that its fur has never had piebald green patches. Nor does it show green patches on any of the many pictures I have taken with my Olympus, OM System and Sony system cameras, regardless of the illumination type. In fact, Stella's fur is not piebald at all, it only shows subtle shifts of white, light brown and light gray shades. The green patches are entirely an artifact of the phone camera. Largely gone is the warm deep-brown of the eyes, which is faithfully reproduced by "real" cameras. Some of the slightly yellowish tint of the carpet (light hazel in reality) also seems to have "leaked" onto the fur of Stella's legs.

The illumination cannot be blamed, since there were no fluorescent tubes (which sometimes give greenish tones), only LED. In any case, even fluorescent illumination cannot produce the observed, well-delimited green patches, only a general greenish tone of the whole picture. It looks like some image-"enhancement" algorithm was at work looking for green patches of vegetation and brown patches of soil, and not finding them anywhere else in the picture, decided it had "found" them in parts of Stella's fur.

Other discussions of phone images artifacts

The artifacts I personally observed on mobile phone images are by no means unique. Here is a short list of links. Hundreds more can be found by Googling "phone camera artifacts" and similar expressions.

Extrapolating to the future

Mobile phone cameras still have a long way to go, before they can catch up with present-day digital cameras in straight-out-of-camera results. While mobile phone cameras continue to improve, digital cameras also continue to do so, and this will remain a race toward a continuously receding finish line. Since both phone cameras and dedicated digital cameras use the same technology, there is always bound to be an objective performance gap between the two device types, one designed to work best as a camera, the other designed as a compromise between multiple functions in a pocketable device with limited power and processing resources.

In theory, at some undetermined point in the future, both device types might reach a level of performance where the differences between the two, while still existing, no longer matter for most uses of the images. However, regardless of what mobile phone CEOs and marketers try to make us believe, we are still far from that point. With present technology, there is a clearly visible limit to the image quality of phone cameras. Also with present technology, computational imaging can only go part of the way to bridge the gap in image quality between the two camera types. My concern is that the mobile phone industry has already moved to a level of camera miniaturization and pixel count where further gains in objective performance are denied by the laws of physics, and is already using too much computational imaging to try and hide the former fact. Because of this overreaching, modern mobile phones produce tell-tale image artifacts, clearly visible when one knows what to look for. No amount of salesmen hype can hide this, if you compare phone images with artifact-free pictures.

I do expect that practical factors will make both system cameras and mobile phones obsolete well before we approach the theoretical point where the differences between mobile phone cameras and system cameras no longer matter, and replace both device types with more capable devices.