Carnegie Mellon Performs Large-Scale 3D Motion Reconstruction by Combining Views of 480 Video Cameras

Carnegie Mellon University researchers have developed techniques for combining the views of 480 video cameras mounted in a two-story geodesic dome to perform large-scale 3D motion reconstruction, including volleyball games, the swirl of air currents and even a cascade of confetti.

Though the research was performed in a specialized, heavily instrumented video laboratory, Yaser Sheikh, an assistant research professor of robotics who led the research team, said the techniques might eventually be applied to large-scale reconstructions of sporting events or performances captured by hundreds of cameras wielded by spectators.


The video lab, called the Panoptic Studio, also can be used to capture the fine details of people interacting, whether it be college students casually conversing or a child being evaluated by a psychologist for signs of autism.

In contrast to most previous work, which typically has involved just 10 to 20 video feeds, the Carnegie Mellon researchers didn't have to worry about filling in gaps in data; their camera system can track 100,000 points at a time. Rather, they have to figure out how to choose which of the hundreds of video trajectories can see each of those points and select only those camera views for the reconstruction.

"At some point, extra camera views just become 'noise,'" said Hanbyul Joo, a Ph.D. student in the Robotics Institute. "To fully leverage hundreds of cameras, we need to figure out which cameras can see each target point at any given time."

The research team developed a technique for estimating visibility that uses motion as a cue. In contrast to motion capture systems that use balls or other markers, the researchers used established techniques for automatically identifying and tracking points based on appearance features — in this case, distinctive patterns. For each point, the system then seeks to determine which cameras see motion that is consistent with that point.

For instance, if a point on a person's chest is being tracked and most cameras show that point is moving to the right, a camera that picks up motion in the opposite direction is probably seeing a person or object that is in between the target and the camera. Or it may indicate the person has turned and the chest is no longer visible to the camera. In either case, the system knows that camera cannot see the target point and that its video feed is not useful for 3D reconstruction involving that point.

Other researchers have been able to use images from a large number of cameras, such as smartphones, to create 3D reconstructions of still images, Joo noted. But without methods such as the visibility estimation technique, 3D motion reconstruction at such a large scale has not been possible.

In the Panoptic Studio, the researchers have 480 video cameras, plus an additional 30 high-definition video cameras, arrayed all around and halfway up the walls of a geodesic dome that can easily accommodate 10 people.

Such a dense array of cameras enables the researchers to perform 3D motion reconstructions not previously possible. These include 3D reconstructions of a person tossing confetti into the air, with each piece of paper tracked until it reaches the floor. In another case, confetti is fed into a fan, enabling a motion capture of the air flow. "You couldn't put markers on the paper without changing the flow," Joo explained.

Likewise, such techniques might be used for reconstruction of the motion of animals, which typically can't be instrumented.

http://www.cmu.edu/

Featured Product

BitFlow Introduces 6th Generation Camera Link Frame Grabber: The Axion

BitFlow Introduces 6th Generation Camera Link Frame Grabber: The Axion

BitFlow has offered a Camera Link frame grabbers for almost 15 years. This latest offering, our 6th generation combines the power of CoaXPress with the requirements of Camera Link 2.0. Enabling a single or two camera system to operate at up to 850 MB/S per camera, the Axion-CL family is the best choice for CL frame grabber. Like the Cyton-CXP frame grabber, the Axion-CL leverages features such as the new StreamSync system, a highly optimized DMA engine, and expanded I/O capabilities that provide unprecedented flexibility in routing. There are two options available; Axion 1xE & Axion 2xE. The Axion 1xE is compatible with one base, medium, full or 80-bit camera offering PoCL, Power over Camera Link, on both connectors. The Axion 2xE is compatible with two base, medium, full or 80-bit cameras offering PoCL on both connectors for both cameras. The Axion-CL is a culmination of the continuous improvements and updates BitFlow has made to Camera Link frame grabbers.