Wednesday, June 02, 2010

Software Describes Surveillance Footage In AI-Generated Text

From Slashdot:

"A computer vision research group at UCLA has put together a system that watches surveillance footage and generates a text description of the events in real time. It only works on traffic cameras for now but demonstrates how sophisticated computer vision is becoming. Interestingly, the system was built thanks to a database of millions of human-labeled images put together by Chinese workers."

Zhu and UCLA colleagues Benjamin Yao and Haifeng Gong developed a new system, called I2T (Image to Text) puts a series of computer vision algorithms into a system that takes images or video frames as input, and spits out summaries of what they depict. "That can be searched using simple text search, so it's very human-friendly," says Zhu.

More links:
http://www.imageparsing.com/

No comments: