TechBooky AI Assistant
TechBooky AI Assistant
👋 Welcome to TechBooky AI Assistant

I can help with:
🔎 Tech News
🤖 AI Topics
💻 Gadgets
☁️ Cloud
✍️ Guest Posts
📢 Advertising
🔗 Backlinks
📩 Newsletter
  • AI Search
  • Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Home Research/How to do it

Extracting audio from visual information-MIT Research

Paul Balo by Paul Balo
August 5, 2014
in Research/How to do it
Share on FacebookShare on Twitter

A collaboration between MIT, Microsoft, and Adobe has resulted in an algorithm that reconstructs an audio signal using the subtle vibrations of various objects depicted in video. Notably, in one experiment, the team could reproduce recognizable speech from the vibrations of a potato chip bag, an impressive feat accomplished even from 15 feet away through soundproof glass.

Further experimentations involved extracting viable audio signals from the videos of different objects, such as aluminum foil, the surface of a glass of water, and plant leaves. The team’s findings are due to be shared at Siggraph, a leading computer graphics conference.

Abe Davis, a postgraduate student of electrical engineering and computer science at MIT and the primary author of the paper, explains that vibrations caused by sound create a subtly perceptible signal often missed by the naked eye. Working alongside Davis is an ensemble of respected figures from both academia and industry, including MIT professors Frédo Durand and Bill Freeman, student Neal Wadhwa, and industry leaders Michael Rubinstein of Microsoft Research and Gautham Mysore of Adobe Research.

Reconstructing audio requires a high video frame rate, often surpassing the peak 60 fps achieved by some smartphones but not quite reaching the highest rates of commercial cameras. Nevertheless, the team was still able to deduce information about high-frequency vibrations from standard 60 fps video. While not as accurate as that gathered from high-speed cameras, the audio was good enough to identify speaker’s gender within a room or even the number of speakers.

Davis is particularly excited about the potential of this “new kind of imaging” that recovers sounds from objects, allowing for insights not only about surrounding sounds but also the object itself. His team is actively exploring the potential of identifying material and structural properties of objects based on their reactions to sound bursts.

The team’s innovative algorithm distills filter outputs to determine an object’s overall movement when impacted by sound waves – even when object edges move in discrete directions. The researchers also designed an alternative algorithm to work with the peculiarities of conventional video, repurposing the distortions associated with cost-effective sensor design to gather information about high-frequency vibrations. This data, too, can be turned into a valuable audio signal.

Alexei Efros, an associate professor at the University of California at Berkeley, praises the innovation, likening its impact to a Hollywood thriller. He also hints at possible future applications of the technology that might have not been imagined yet, suggesting that this type of groundbreaking innovation can often trigger a domino effect of scientific discovery.

This article was updated in 2025 to reflect current trends and insights.

Related Posts:

  • media_12153ee7e1793e302d7df9f27b4a1d9a2f00e8e33
    Adobe Launches Firefly AI Audio and Video Tools
  • Microsoft-Copilot-GPT-4
    Copilot Adds Audio Generation with Expressive Voices
  • media_1ce13353b25021da3fdf085cf6ca3dcbb98a3f0ab
    Adobe Expands Premiere Pro & After Effects with New…
  • openai logo
    OpenAI Released A Voice Cloning Model That Needs…
  • Apple-Studio-Display-and-Studio-Display-XDR-260303_big.jpg.large_2x
    Apple Launches Studio Display Line with 120Hz and mini-LED
  • google-pixel-8-pro
    Here Is The AI Centric $999 Google Pixel 8 Pro
  • adobe_premiere_1768910852105
    Premiere Adds Firefly AI as Adobe Rolls Out New…
  • VideoOverviewBanner.width-1200.format-webp
    Google Updates NotebookLM with Studio Panel, Video Overviews

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Paul Balo

Paul Balo

Paul Balo is the founder of TechBooky and a highly skilled wireless communications professional with a strong background in cloud computing, offering extensive experience in designing, implementing, and managing wireless communication systems.

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

subscription from
Loading

Freshly Squeezed

  • Amazon Raised Anthropic AI Security Concerns Before US Crackdown on Fable 5 and Mythos 5 June 14, 2026
  • Europe Calls Anthropic AI Ban a ‘Wake-Up Call’ as US Shuts Off Access to Fable 5 and Mythos 5 June 14, 2026
  • US Orders Anthropic to Disable Claude Fable 5 and Mythos 5 Over National Security Concerns June 14, 2026
  • Elon Musk Hits $1.1 Trillion as SpaceX Surpasses $2 Trillion Valuation June 13, 2026
  • SpaceX Prices Record $75 Billion IPO as Elon Musk Nears Trillionaire Status June 12, 2026
  • DoorDash Launches AI Chatbot for Food Orders June 12, 2026
  • Pool Launches App That Makes Screenshots More Useful June 12, 2026
  • Deezer Launches Tool to Detect AI-Generated Music June 12, 2026
  • Coinbase Introduces Platform for Agents to Trade Assets and Buy Premium Insights June 12, 2026
  • Meta Expands Edits App With AI Features and Desktop Access June 12, 2026
  • Ready-made LMS and custom development. Pros and cons of each path. June 11, 2026
  • TELCOs Pay 75 Million Users For Poor Network Service June 10, 2026

Browse Archives

June 2026
MTWTFSS
1234567
891011121314
15161718192021
22232425262728
2930 
« May    

Quick Links

  • About TechBooky
  • Advertise Here
  • Contact us
  • Submit Article
  • Privacy Policy
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • Artificial Intelligence
  • Gadgets
  • Metaverse
  • Tips
  • AI Search
  • About TechBooky
  • Advertise Here
  • Submit Article
  • Contact us

© 2025 Designed By TechBooky Elite

Discover more from TechBooky

Subscribe now to keep reading and get access to the full archive.

Continue reading

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.