• AI Search
  • Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Home Research/How to do it

Extracting audio from visual information-MIT Research

Paul Balo by Paul Balo
August 5, 2014
in Research/How to do it
Share on FacebookShare on Twitter

A collaboration between MIT, Microsoft, and Adobe has resulted in an algorithm that reconstructs an audio signal using the subtle vibrations of various objects depicted in video. Notably, in one experiment, the team could reproduce recognizable speech from the vibrations of a potato chip bag, an impressive feat accomplished even from 15 feet away through soundproof glass.

Further experimentations involved extracting viable audio signals from the videos of different objects, such as aluminum foil, the surface of a glass of water, and plant leaves. The team’s findings are due to be shared at Siggraph, a leading computer graphics conference.

Abe Davis, a postgraduate student of electrical engineering and computer science at MIT and the primary author of the paper, explains that vibrations caused by sound create a subtly perceptible signal often missed by the naked eye. Working alongside Davis is an ensemble of respected figures from both academia and industry, including MIT professors Frédo Durand and Bill Freeman, student Neal Wadhwa, and industry leaders Michael Rubinstein of Microsoft Research and Gautham Mysore of Adobe Research.

Reconstructing audio requires a high video frame rate, often surpassing the peak 60 fps achieved by some smartphones but not quite reaching the highest rates of commercial cameras. Nevertheless, the team was still able to deduce information about high-frequency vibrations from standard 60 fps video. While not as accurate as that gathered from high-speed cameras, the audio was good enough to identify speaker’s gender within a room or even the number of speakers.

Davis is particularly excited about the potential of this “new kind of imaging” that recovers sounds from objects, allowing for insights not only about surrounding sounds but also the object itself. His team is actively exploring the potential of identifying material and structural properties of objects based on their reactions to sound bursts.

The team’s innovative algorithm distills filter outputs to determine an object’s overall movement when impacted by sound waves – even when object edges move in discrete directions. The researchers also designed an alternative algorithm to work with the peculiarities of conventional video, repurposing the distortions associated with cost-effective sensor design to gather information about high-frequency vibrations. This data, too, can be turned into a valuable audio signal.

Alexei Efros, an associate professor at the University of California at Berkeley, praises the innovation, likening its impact to a Hollywood thriller. He also hints at possible future applications of the technology that might have not been imagined yet, suggesting that this type of groundbreaking innovation can often trigger a domino effect of scientific discovery.

This article was updated in 2025 to reflect current trends and insights.

Related Posts:

  • media_12153ee7e1793e302d7df9f27b4a1d9a2f00e8e33
    Adobe Launches Firefly AI Audio and Video Tools
  • Microsoft-Copilot-GPT-4
    Copilot Adds Audio Generation with Expressive Voices
  • media_1ce13353b25021da3fdf085cf6ca3dcbb98a3f0ab
    Adobe Expands Premiere Pro & After Effects with New…
  • openai logo
    OpenAI Released A Voice Cloning Model That Needs…
  • Apple-Studio-Display-and-Studio-Display-XDR-260303_big.jpg.large_2x
    Apple Launches Studio Display Line with 120Hz and mini-LED
  • google-pixel-8-pro
    Here Is The AI Centric $999 Google Pixel 8 Pro
  • adobe_premiere_1768910852105
    Premiere Adds Firefly AI as Adobe Rolls Out New…
  • VideoOverviewBanner.width-1200.format-webp
    Google Updates NotebookLM with Studio Panel, Video Overviews

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Paul Balo

Paul Balo

Paul Balo is the founder of TechBooky and a highly skilled wireless communications professional with a strong background in cloud computing, offering extensive experience in designing, implementing, and managing wireless communication systems.

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

subscription from
Loading

Freshly Squeezed

  • IBM Rolls out ‘Bob’, an AI Development Partner Built around Multi-model Routing and Human Checkpoints April 29, 2026
  • iOS 27 Reportedly Adds New Apple Intelligence Photo Editing Tools April 29, 2026
  • Jack Dorsey-backed Divine brings Vine’s Six‑second Loops Back to Life April 29, 2026
  • Elon Musk Takes The Stand In High-Stakes OpenAI Trial Against Sam Altman April 28, 2026
  • Ethiopia’s Dodai Secures $13 Million to Scale Battery-Swapping EV Network April 28, 2026
  • OpenAI Revenue Growth Misses Expectations as Costs Surge, Report Says April 28, 2026
  • EU Pressures Google To Open Android’s AI To Rivals, Google Calls It “Unwarranted” April 28, 2026
  • Airtel Money links with Absa Bank Kenya to court SME payments April 28, 2026
  • China Blocks Meta’s $2B Manus Deal After Months Of Review April 27, 2026
  • Nigeria Lifts $32.8M Meta Fine For Privacy Breach, Raising Questions About Enforcement Trust April 27, 2026
  • Microsoft and OpenAI Restructure Partnership, End Revenue Sharing and Exclusivity April 27, 2026
  • Microsoft & Meta Reveal Large Layoffs Despite Massive AI Investment April 24, 2026

Browse Archives

April 2026
MTWTFSS
 12345
6789101112
13141516171819
20212223242526
27282930 
« Mar    

Quick Links

  • About TechBooky
  • Advertise Here
  • Contact us
  • Submit Article
  • Privacy Policy
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • Artificial Intelligence
  • Gadgets
  • Metaverse
  • Tips
  • AI Search
  • About TechBooky
  • Advertise Here
  • Submit Article
  • Contact us

© 2025 Designed By TechBooky Elite

Discover more from TechBooky

Subscribe now to keep reading and get access to the full archive.

Continue reading

Chat with TechBooky AI
💬
TechBooky AI ✕
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.