Flat Camera Uses No Lens

January 11, 2016

Early cameras and modern cameras work pretty much the same way. A lens (or a pinhole acting as a lens) focuses an image onto a sensor. Of course, sensor, in this case, is a broad term and could include a piece of film that–after all–does sense light via a chemical reaction. Sure, lenses and sensors get better or, at least, different, but the basic design has remained the same since the Chinese built the camera obscura around 400BC (and the Greeks not long after that).

Of course, the lens/sensor arrangement works well, but it does limit how thin you can make a camera. Cell phone cameras are pretty skinny, but there are applications where even they are too thick. That’s why researchers at Rice University are working on a new concept design for a flat camera that uses no lens. You can see a video about the new type of camera below.

The idea is simple: take a conventional sensor and place a mask over it that has a grid-like arrangement of apertures. The resulting image doesn’t match what you would see, but it provides enough information that a computer can reconstruct the picture.

At Hackaday, we’re no strangers to homebrew camera builds (including one built around an Arduino and a single pixel). However, none of those had the promise of being super thin.

51 thoughts on “Flat Camera Uses No Lens”

uminded says:

January 11, 2016 at 4:16 am

Thats pretty neat. When I was making my astrocam and pointing the raw sensor around my room you could see “ghosts” and I wondered if you could statistically tease out the image if you know the exact characteristics of the sensors pixels.

Now all we need is the industrial printable batteries and color e-ink displays!

Report comment

Reply
1. michal_229 says:
  
  January 11, 2016 at 4:41 am
  
  You could apply Wiener deconvolution to these “ghostly” pictures. Having uncompressed data and after tuning parameters it is possible to obtain clarity that allows You to recognize objects in scene.
  
  Report comment
  
  Reply
  1. Blue Footed Booby says:
    
    January 11, 2016 at 5:51 am
    
    “Wiener deconvolution” is right up there with “herringbone wang tiles” on my list of favorite tech terms.
    
    Report comment
    
    Reply
  2. Reg says:
    
    January 11, 2016 at 8:17 am
    
    It’s way cooler than Wiener decon. It exploits sparse solutions via convex optimization. For more info check out:
    
    http://statweb.stanford.edu/~donoho
    http://statweb.stanford.edu/~candes
    http://web.ece.rice.edu/richb/
    
    You can actually solve problems that were generally considered impossible.
    
    Under very specific constraints solving Ax=y given y and A will find the unique solution provided x is sparse.The proof is hard, but the implementation is particularly easy. The same procedure applies to predicting what movies you might like. Most of the proofs involve the randomness of A as a constraint.
    
    FWIW Wiener is the L2 solution. Donoho is the L1 solution.
    
    Report comment
    
    Reply
    1. BAR says:
      
      January 11, 2016 at 12:30 pm
      
      But that assumes A is sparse. Seeing as how A is a matrix mapping the output of a particular pixel to, let’s say, light received by the array from a particular direction (e.g., corresponding to an ‘image’ pixel), and how each pixel of your average CCD or CMOS array is generally identically sensitive across angles to all the other pixels, well… I’m going to guess that A is not sparse at all.
      
      Report comment
      
      Reply
      1. Reg says:
        
        January 11, 2016 at 3:24 pm
        
        It does NOT presume A is sparse. In general A is quite dense and the system of equations is underdetermined. Donoho’s 2004 papers give the key details. Also look at Candes’ papers from the same time period.
        
        Also see:
        
        http://www.ecs.umass.edu/~mduarte/images/csCamera-SPMag-web.pdf
        
        for the single pixel version.
        
        NB the mathematical concept is very similar to the Coded Aperture Mask, but the implementation is different (L1 vs L2).
        
        Report comment
2. Dax says:
  
  January 11, 2016 at 5:06 am
  
  What you were seeing there was the directionality property of a silicon photocell – basically the same thing as viewing angles are to an LCD monitor.
  
  Each cell records a different amplitude for the same point light source based on the slight angular difference, which causes vignetting in photos and the camera software has to deal with that. Without a lens, and without the sensor firmware trying to compensate for the effect, you could reconstruct the average location of a light source or an object.
  
  Report comment
  
  Reply
Adrian says:

January 11, 2016 at 4:21 am

How does the mask work? Is it similar to how Coded-Aperture Masks work on a telescope?

Report comment

Reply
Truth says:

January 11, 2016 at 5:03 am

So you have no physical lens, but from a large number of data points, you mathematically simulate a lens ?

Report comment

Reply
1. Dax says:
  
  January 11, 2016 at 5:12 am
  
  There’s a flat mask in front of the camera that diffracts the light somewhat and causes different pixels on the sensor to see the same points in the scenery slightly differently. You know the properties of the mask and you can “back-project” the information to figure out what the sensor is seeing.
  
  Another way to look at it is to say that the mask is an incredibly thin lens – it just doesn’t project the image in a coventional way.
  
  Report comment
  
  Reply
  1. Truth says:
    
    January 11, 2016 at 6:14 am
    
    So a bit like a Fresnel lens ?
    
    Report comment
    
    Reply
    1. zosh says:
      
      January 11, 2016 at 6:44 am
      
      No, not a fresnel lens at all, no lens at all, and Dax isn’t right either, there’s no diffraction happening. Basically they created a “grid of pinhole cameras” pointed in slightly different directions. It’s a grid of a huge number of tiny crappy pinhole cameras plus an imperial shitton of mathematics.
      
      Report comment
      
      Reply
      1. Thinkerer says:
        
        January 11, 2016 at 8:22 am
        
        Because an Imperial Shitton is 5/4 of a Standard one…so, a lot.
        
        Report comment
      2. Matt says:
        
        January 11, 2016 at 8:57 am
        
        Seems like it shares some concepts with a light field camera.
        
        Report comment
      3. taylorian71 says:
        
        January 11, 2016 at 11:21 am
        
        “Because an Imperial Shitton is 5/4 of a Standard one…so, a lot”
        
        As opposed to the Metric Shitton, which is always divisible by ten and only used in Canada by the Trailer Park Boys.
        
        Report comment
      4. pelrun says:
        
        January 11, 2016 at 5:55 pm
        
        The metric shitton is used in the entire world except for the US… (no, the entire world is not “the US and Canada”)
        
        Report comment
2. CRJEEA says:
  
  January 11, 2016 at 10:04 am
  
  So in theory if you had enough pixels and enough processing and enough colour resolution, you could extrapolate a three dimensional image.
  You could say divide the image sensor into four and compare the four images of the same scene, each taken from a slightly different angle.
  
  Report comment
  
  Reply
  1. neophile says:
    
    January 11, 2016 at 11:16 am
    
    Check out Lytro — it’s a light-field camera that does basically that. It uses a grid of tiny lenses and a very-high-resolution sensor to take “pictures” that can then be re-focused or even have the viewpoint shifted slightly.
    
    Report comment
    
    Reply
Dan says:

January 11, 2016 at 5:21 am

Coded aperture masks aren’t new – ESA’s been flying one for a decade and a half http://sci.esa.int/integral/19990-spi-coded-mask/

Report comment

Reply
1. Quin says:
  
  January 11, 2016 at 8:13 am
  
  Most people associate coded apertures with those hexagonal arrays and non-visible light. The flatcam folk’s filter looks exactly like a Modified Uniformly Redundant Array, offset a bit.
  
  Report comment
  
  Reply
idjut photographer says:

January 11, 2016 at 5:49 am

https://en.wikipedia.org/wiki/Zone_plate ?

Report comment

Reply
1. Dan says:
  
  January 11, 2016 at 2:16 pm
  
  Not exactly, the zone plates still have a focal length. The description of the technology should be, an imaging system that uses a virtual lens derived mathematically from the effects of a coded mask over the sensing array.
  
  Imagine a camera that recorded the fourier transform of an image and you had to do the inverse FFT to get the image, except this one operates spatially and not in the frequency domain.
  
  Report comment
  
  Reply
John Spencer says:

January 11, 2016 at 6:56 am

Nitpicking correction, there is diffraction happening, but maybe it doesn’t need to be taken into account.
Given that modern cameras usually have a metric tonne of pixels there might be some advantages to eliminating the lens- you don’t need to focus for one.

Report comment

Reply
1. Galane says:
  
  January 11, 2016 at 5:02 pm
  
  metric tonne? I thought we’d decided it was a metric shitton?
  
  Report comment
  
  Reply
Meh... says:

January 11, 2016 at 7:01 am

Let me see if I’ve got this right:
The camera has no lens, so it cannot be a ton of lenses. The camera works by measuring light that has undergone diffraction. That’s what the mask is for – to diffract the light in a controlled way.

Since the amount of diffraction is known, math is used to compute the amount of diffraction it has undergone and this is then used to reconstruct the image. I imagine color contrast plays an important role.

The images did have a weird depth of field.

Report comment

Reply
kabukicho2001 says:

January 11, 2016 at 7:05 am

Digital Brute Force To Make Pics

Report comment

Reply
ganzuul says:

January 11, 2016 at 7:16 am

A simplest way to get insight into how coded aperture works is to consider the application of turning low FPS footage into high FPS footage. This is done by vibrating a cut-out of something that looks like a 2D bar code at a high speed in front of a normal video camera. With one sweep of the cut-out per frame, you get an out-of-focus, motion-blurred shadow encoded in each pixel. This information can be used to reconstruct everything else which got motion-blurred in the frame.

Report comment

Reply
junior says:

January 11, 2016 at 7:23 am

maybe for some object recognition tasks you not even need to reconstruct the image

Report comment

Reply
Comedicles says:

January 11, 2016 at 7:42 am

The mask looks to about 10cm in front of the lens so the flat part of this is what? It can be folded flat when not used perhaps – like a pinhole? Need their longer video.

Report comment

Reply
1. zerth says:
  
  January 11, 2016 at 8:04 am
  
  The actual mask is directly on the sensor, but needs to be very tiny. Moving the mask away from the lens allows it to be large enough to cut and adjusted by hand while testing.
  
  Report comment
  
  Reply
  1. TheRegnirps says:
    
    January 11, 2016 at 9:38 am
    
    It looks like a Fourier or wavelet mask. I’m sure I have seen this somewhere in the last ten years. Perhaps related to a camera a Stanford that could look through a hedge – like moving your head back and forth to build an image through a lattice. I don’t see how it works up against the pixels unless maybe parts cover fractions of a pixel.
    
    Report comment
    
    Reply
Quin says:

January 11, 2016 at 8:22 am

If the metric shit-tonne of math involved in this interests you, I dug up a paper from Berkeley: http://accelconf.web.cern.ch/AccelConf/ibic2013/talks/weal1_talk.pdf

It’s also got pictures of different masks that work; like the zone-plate/fresnel that most photographers are aware of, modified uniform random arrays (MURA), no-two-hole-touching MURA, and more. Good paper if you want to recreate it yourself and equations with ⊗ don’t make your head spin. Me and tensor products, though, ugh.

Report comment

Reply
1. Quin says:
  
  January 11, 2016 at 8:32 am
  
  Okay, so ⊗ of sets, fine. G(x,y)⊗H(x,y) just starts to make my head go a bit wobbly. A(x,y)⊗Ã(x,y)=δ and I’m out. I’m sure it has a rational meaning that could be explained in a few sentences that the maths are just shorthand for, but I’ll take the word problem thank you.
  
  Report comment
  
  Reply
Rex says:

January 11, 2016 at 8:44 am

To repeat an age old comment – pics or it didn’t happen. I wish HAD would run articles on real events and not lab curiosities.

Report comment

Reply
1. Chris C. says:
  
  January 11, 2016 at 10:12 am
  
  Looks like there’s some pics in the video. But yeah, it may just be a lab curiosity:
  
  “Rice’s hand-built prototypes use off-the-shelf sensors and produce 512-by-512 images in seconds, but the researchers expect that resolution will improve as more advanced manufacturing techniques and reconstruction algorithms are developed.”
  
  Have been alive long enough to have seen the equivalent of this statement many times before, and how it usually plays out over time. Rephrased more realistically:
  
  “It works, but the image quality sucks. Seriously, there’s a huge penalty for doing this. We have no idea how to make it work better. Assuming standard camera technology advances far enough without being limited by inviolable physical or mathematical laws, it could work better; possibly one day even producing an image that would be considered useful today. But compared to images produced by those future competing technologies, it will still suck, so it may never find use beyond a few niche applications.”
  
  Report comment
  
  Reply
  1. ET says:
    
    January 11, 2016 at 10:39 am
    
    ^ This!!!!
    
    Report comment
    
    Reply
Julian says:

January 11, 2016 at 11:07 am

Quick spelling correction, in the first paragraph, “but the basic design has remainded the same since”, it should be “remained”. Also, the last sentence of the last paragraph is either missing a period, or some of the text got cut off. I’ll go with the latter, since that’s a rather awkward way to end a paragraph and the article as a whole. Great article otherwise!

Report comment

Reply
1. Macsimski says:
  
  January 11, 2016 at 3:26 pm
  
  Really? How interesting…
  
  Report comment
  
  Reply
Whatnot says:

January 11, 2016 at 12:03 pm

Isn’t the concept of light field photography that you use a tiny lens for each picture? And doesn’t that have the same effect? So does that mean this is not new since you can already make sensors very flat?

Related link: https://en.wikipedia.org/wiki/Microlens

Quote: “Wafer-level optics (WLO) enables the design and manufacture of miniaturized optics at the wafer level using advanced semiconductor-like techniques. The end product is cost effective, miniaturized optics that enable the reduced form factor of camera modules for mobile devices”

And on a sidenote: that first guy in the video has the speech mannerisms of Richard Feynman, odd to see

Report comment

Reply
wesfaler says:

January 11, 2016 at 12:05 pm

Looks like they fined tuned their Compressive Sensing experiments (explained here http://dsp.rice.edu/cscamera ), getting rid of any lenses. By pseudo-randomly masking the light, one can reconstruct the most likely input signal using a Compressive Sensing “L1” algorithm (like Least Squares (“L2”)).

Report comment

Reply
Wolfton says:

January 11, 2016 at 2:19 pm

Fingerprint sensors just became cameras…

Report comment

Reply
1. Whatnot says:
  
  January 11, 2016 at 2:34 pm
  
  Would be a concern, weren’t it so that anything with those sensors already has cameras or cameras pointed at them from various angles.
  
  Report comment
  
  Reply
Dan says:

January 11, 2016 at 2:24 pm

Interesting technology, but the results look pretty ordinary so far. They probably need to move to a custom chip with a processor under each pixel, and a lot more pixels. Then again with the speed things progress these days we could see consumer products within 5 years based on the idea.

Report comment

Reply
1. digital archaeologist says:
  
  February 1, 2021 at 2:49 pm
  
  > we could see consumer products within 5 years based on the idea
  
  but, we didn’t :(
  
  Report comment
  
  Reply
steve says:

January 11, 2016 at 2:55 pm

Pretty awesome stuff. Will not be applicable for regular imaging application due to limited angular resolution. Their paper is a joy to read!

Report comment

Reply
Paul Murray says:

January 11, 2016 at 5:27 pm

If you had a cylindrical lens (ie, with the entire surface acting as a flat camera) that was about the width of a human head, you might be able to reconstruct a seamless 3-d view of the surroundings.

Report comment

Reply
1. Qz says:
  
  January 11, 2016 at 10:56 pm
  
  spherical flat camera, that’d be rad but i wonder if the tech can do that, or if flatness is a prerequisite for it working
  
  Report comment
  
  Reply
Andrew Pullin (@AndrewPullin) says:

January 11, 2016 at 11:35 pm

Have a look at Pelican Imaging and their amazing PiCam:
http://www.pelicanimaging.com/technology/paper.html

As far as I can tell, that’s *it*, that is *the* solution for cameras for phones, and probably for auto-drive cars, too. I don’t know why Sony or Samsung hasn’t acquired them for $500m.

Their website is a little odd, though, which always raises suspicions.

Report comment

Reply
Kevin says:

January 12, 2016 at 1:35 am

You forget the Arabs in your introduction
https://en.m.wikipedia.org/wiki/Alhazen

Report comment

Reply
smerfj says:

February 15, 2016 at 11:41 am

Almost seems similar to synthetic radar aperture technology…

Report comment

Reply
1. Raj says:
  
  July 5, 2016 at 6:27 am
  
  Flatcam still uses mozi’s pin hole camera principle
  
  Report comment
  
  Reply

Hackaday

Flat Camera Uses No Lens

51 thoughts on “Flat Camera Uses No Lens”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Hacking When It Counts: DIY Prosthetics And The Prison Camp Lathe

Dearest C++, Let Me Count The Ways I Love/Hate Thee

Personal Reflections On Immutable Linux

Crunching The News For Fun And Little Profit

The End Of The Hackintosh Is Upon Us

Our Columns

Robots Want The Jobs You Can’t Do

Hackaday Links: July 13, 2025

Trickle Down: When Doing Something Silly Actually Makes Sense

Hackaday Podcast Episode 328: Benchies, Beanies, And Back To The Future

This Week In Security: Bitchat, CitrixBleed Part 2, Opossum, And TSAs

51 thoughts on “Flat Camera Uses No Lens”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns