• AI Search
  • Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Home Research/How to do it

Extracting audio from visual information-MIT Research

Paul Balo by Paul Balo
August 5, 2014
in Research/How to do it
Share on FacebookShare on Twitter

A collaboration between MIT, Microsoft, and Adobe has resulted in an algorithm that reconstructs an audio signal using the subtle vibrations of various objects depicted in video. Notably, in one experiment, the team could reproduce recognizable speech from the vibrations of a potato chip bag, an impressive feat accomplished even from 15 feet away through soundproof glass.

Further experimentations involved extracting viable audio signals from the videos of different objects, such as aluminum foil, the surface of a glass of water, and plant leaves. The team’s findings are due to be shared at Siggraph, a leading computer graphics conference.

Abe Davis, a postgraduate student of electrical engineering and computer science at MIT and the primary author of the paper, explains that vibrations caused by sound create a subtly perceptible signal often missed by the naked eye. Working alongside Davis is an ensemble of respected figures from both academia and industry, including MIT professors Frédo Durand and Bill Freeman, student Neal Wadhwa, and industry leaders Michael Rubinstein of Microsoft Research and Gautham Mysore of Adobe Research.

Reconstructing audio requires a high video frame rate, often surpassing the peak 60 fps achieved by some smartphones but not quite reaching the highest rates of commercial cameras. Nevertheless, the team was still able to deduce information about high-frequency vibrations from standard 60 fps video. While not as accurate as that gathered from high-speed cameras, the audio was good enough to identify speaker’s gender within a room or even the number of speakers.

Davis is particularly excited about the potential of this “new kind of imaging” that recovers sounds from objects, allowing for insights not only about surrounding sounds but also the object itself. His team is actively exploring the potential of identifying material and structural properties of objects based on their reactions to sound bursts.

The team’s innovative algorithm distills filter outputs to determine an object’s overall movement when impacted by sound waves – even when object edges move in discrete directions. The researchers also designed an alternative algorithm to work with the peculiarities of conventional video, repurposing the distortions associated with cost-effective sensor design to gather information about high-frequency vibrations. This data, too, can be turned into a valuable audio signal.

Alexei Efros, an associate professor at the University of California at Berkeley, praises the innovation, likening its impact to a Hollywood thriller. He also hints at possible future applications of the technology that might have not been imagined yet, suggesting that this type of groundbreaking innovation can often trigger a domino effect of scientific discovery.

This article was updated in 2025 to reflect current trends and insights.

Related Posts:

  • media_12153ee7e1793e302d7df9f27b4a1d9a2f00e8e33
    Adobe Launches Firefly AI Audio and Video Tools
  • Microsoft-Copilot-GPT-4
    Copilot Adds Audio Generation with Expressive Voices
  • media_1ce13353b25021da3fdf085cf6ca3dcbb98a3f0ab
    Adobe Expands Premiere Pro & After Effects with New…
  • openai logo
    OpenAI Released A Voice Cloning Model That Needs…
  • Apple-Studio-Display-and-Studio-Display-XDR-260303_big.jpg.large_2x
    Apple Launches Studio Display Line with 120Hz and mini-LED
  • google-pixel-8-pro
    Here Is The AI Centric $999 Google Pixel 8 Pro
  • adobe_premiere_1768910852105
    Premiere Adds Firefly AI as Adobe Rolls Out New…
  • VideoOverviewBanner.width-1200.format-webp
    Google Updates NotebookLM with Studio Panel, Video Overviews

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Paul Balo

Paul Balo

Paul Balo is the founder of TechBooky and a highly skilled wireless communications professional with a strong background in cloud computing, offering extensive experience in designing, implementing, and managing wireless communication systems.

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

subscription from
Loading

Freshly Squeezed

  • Meta Plans Sweeping Layoffs as AI Costs Surge March 14, 2026
  • Chatbots Now Emerging in ‘AI Psychosis’ and Mass-Casualty Cases, Lawyer Says March 14, 2026
  • Google Chrome To Debut Support for ARM64 Linux This Spring March 14, 2026
  • Google Meet Phases Out Legacy Duo Calling March 14, 2026
  • Instagram to Remove End-to-End Encryption for DMs in May 2026 March 14, 2026
  • China Approves First Brain Implant for Commercial Use March 13, 2026
  • Microsoft Pushes AI Adoption in Africa to Counter China’s DeepSeek March 12, 2026
  • Microsoft Fixes 77 Vulnerabilities in March Patch Tuesday March 11, 2026
  • Meta Rolls out New Features for Scam Protection March 11, 2026
  • Zoom Unveils AI Office Suite With Avatars Arriving This Month March 11, 2026
  • Adobe Adds AI Assistant To Photoshop; Firefly Gets New Editing Tools March 11, 2026
  • OpenAI GPT-5.4 Outperforms Humans in Desktop Navigation Tests March 11, 2026

Browse Archives

March 2026
MTWTFSS
 1
2345678
9101112131415
16171819202122
23242526272829
3031 
« Feb    

Quick Links

  • About TechBooky
  • Advertise Here
  • Contact us
  • Submit Article
  • Privacy Policy
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • Artificial Intelligence
  • Gadgets
  • Metaverse
  • Tips
  • AI Search
  • About TechBooky
  • Advertise Here
  • Submit Article
  • Contact us

© 2025 Designed By TechBooky Elite

Discover more from TechBooky

Subscribe now to keep reading and get access to the full archive.

Continue reading

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.