December 20, 2011
LOS ALAMOS, New Mexico, Dec. 20 -- An essential question confronting neuroscientists and computer vision researchers alike is how objects can be identified by simply “looking” at an image. Introspectively, we know that the human brain solves this problem very well. We only have to look at something to know what it is.
But teaching a computer to “know” what it’s looking at is far harder. In research published this fall in the Public Library of Science (PLoS) Computational Biology journal, a team from Los Alamos National Laboratory, Chatham University, and Emory University first measured human performance on a visual task - identifying a certain kind of shape when an image is flashed in front of a viewer for a very short amount of time (20-200 milliseconds). Human performance gets worse, as expected, when the image is shown for shorter time periods. Also as expected, humans do worse when the shapes are more complicated.
But could a computer be taught to recognize shapes as well, and then do it faster than humans? The team tried developing a computer model based on human neural structure and function, to do what we do, and possibly do it better.
Their paper, “Model Cortical Association Fields Account for the Time Course and Dependence on Target Complexity of Human Contour Perception,” describes how, after measuring human performance, they created a computer model to also attempt to pick out the shapes.
“This model is biologically inspired and relies on leveraging lateral connections between neurons in the same layer of a model of the human visual system,” said Vadas Gintautas of Chatham University in Pittsburgh and formerly a researcher at Los Alamos.
Neuroscientists have characterized neurons in the primate visual cortex that appear to underlie object recognition, noted senior author Garrett Kenyon of Los Alamos. “These neurons, located in the inferotemporal cortex, can be strongly activated when particular objects are visible, regardless of how far away the objects are or how the objects are posed, a phenomenon referred to as viewpoint invariance.”
The brain has an uncanny ability to detect and identify certain things, even if they’re barely visible. Now the challenge is to get computers to do the same thing. And programming the computer to process the information laterally, like the brain does, might be a step in the right direction.
How inferotemporal neurons acquire their viewpoint invariant properties is unknown, but many neuroscientists point to the hierarchical organization of the human visual cortex as likely being an essential aspect.
“Lateral connections have been generally overlooked in similar models designed to solve similar tasks. We demonstrated that our model qualitatively reproduces human performance on the same task, both in terms of time and difficulty. Although this is certainly no guarantee that the human visual system is using lateral interactions in the same way to solve this task, it does open up a new way to approach object detection problems,” Gintautas said.
Simple features, such as particular edges of the image in a specific orientation, are extracted at the first cortical processing stage, called the primary visual cortex, or V1. Then subsequent cortical processing stages, V2, V4, etc., extract progressively more complex features, culminating in the inferotemporal cortex where that essential “viewpoint invariant object identification” is thought to occur. But, most of the connections in the human brain do not project up the cortical hierarchy, as might be expected from gross neuroanatomy, but rather connect neurons located at the same hierarchical level, called lateral connections, and also project down the cortical hierarchy to lower processing levels.
In the recently published work, the team modeled lateral interactions between cortical edge detectors to determine if such connections could explain the difficulty and time course of human contour perception. This research thus combined high-performance computer simulations of cortical circuits, using a National Science Foundation funded neural simulation toolbox, called PetaVision, developed by LANL researchers, along with “speed-of-sight” psychophysical measurements of human contour perception. The psychophysical measurements refer to an experimental technique that neuroscientists use to study mechanisms of cortical processing, using the open-source Psych toolbox software as an advanced starting point.
“Our research represented the first example of a large-scale cortical model being used to account for both the overall accuracy, as well as the processing time, of human subjects performing a challenging visual-perception task,” said Kenyon.
Link to PLoS paper: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002162
About Los Alamos National Laboratory (www.lanl.gov)
Los Alamos National Laboratory, a multidisciplinary research institution engaged in strategic science on behalf of national security, is operated by Los Alamos National Security, LLC, a team composed of Bechtel National, the University of California, The Babcock & Wilcox Company, and URS for the Department of Energy’s National Nuclear Security Administration.
Los Alamos enhances national security by ensuring the safety and reliability of the U.S. nuclear stockpile, developing technologies to reduce threats from weapons of mass destruction, and solving problems related to energy, environment, infrastructure, health, and global security concerns.
About Chatham University
Chatham University prepares students from around the world to help develop solutions to some of the world’s biggest challenges. Consistently ranked among the top master’s level institutions in the Northeast by U.S. News & World Report and The Princeton Review, Chatham University is also ranked in the top five percent of graduate-intensive institutions nationally and experienced the fastest-growing enrollment in the Pittsburgh region over the past decade. Founded in 1869, Chatham University includes the Shadyside Campus, with the historic 39-acre Woodland Road arboretum and Chatham Eastside facility; and the 388-acre Eden Hall Campus north of Pittsburgh. For more information, call 800-837-1290 or visit www.chatham.edu.
-----
Source: Los Alamos National Laboratory
There are 0 discussion items posted.
|
Join the Discussion |
PGI, Cray, and CAPS enterprise are moving quickly to get their new OpenACC-supported compilers into the hands of GPGPU developers. At NVIDIA's GPU Technology Conference this week, there was plenty of discussion around the new HPC accelerator framework, and all three OpenACC compiler makers, as well as NVIDIA, were talking up the technology.
Read more...
NVIDIA has introduced its first Kepler-generation GPU product for high performance computing, and revealed some of the inner working of the new architecture. The announcement took place at the kickoff of the company's GPU Technology Conference taking place this week in San Jose, California.
Read more...
Intel Corp. has launched three new families of Xeon processors, joining the Xeon E5-2600 series the chipmaker introduced in March. These latest chips span the entire market for the Xeon line, from four- and two-socket servers, down to entry-level workstations and microservers. A number of HPC server makers, including SGI, Dell, and Appro announced updated hardware based on the new silicon.
Read more...
May 16, 2012 |
Chief scientist discusses memory stacks, interconnects, and US technology leadership.
Read more...
May 15, 2012 |
GPU maker conjures up visualization technology for virtual desktops.
Read more...
May 14, 2012 |
Pessimistic predictions about technology have a poor track record, according to 451's John Barr.
Read more...
May 10, 2012 |
DRAM manufacturers gear up for DDR4.
Read more...
May 09, 2012 |
Steven Chu discusses the role of supercomputing in energy research.
Read more...