Novel-View Acoustic Synthesis From 3D Reconstructed Rooms

We examine the good thing about combining blind audio recordings with 3D scene data for novel-view acoustic synthesis. Given audio recordings from 2-4 microphones and the 3D geometry and materials of a scene containing a number of unknown sound sources, we estimate the sound anyplace within the scene. We establish the principle challenges of novel-view acoustic synthesis as sound supply localization, separation, and dereverberation. Whereas naively coaching an end-to-end community fails to provide high-quality outcomes, we present that incorporating room impulse responses (RIRs) derived from 3D reconstructed rooms permits the identical community to collectively sort out these duties. Our technique outperforms current strategies designed for the person duties, demonstrating its effectiveness at using 3D visible data. In a simulated research on the Matterport3D-NVAS dataset, our mannequin achieves near-perfect accuracy on supply localization, a PSNR of 26.44dB and a SDR of 14.23dB for supply separation and dereverberation, leading to a PSNR of 25.55 dB and a SDR of 14.20 dB on novel-view acoustic synthesis. We launch our code and mannequin on our challenge web site at https://github.com/apple/ml-nvas3d. Please put on headphones when listening to the outcomes.

Source link

Novel-View Acoustic Synthesis From 3D Reconstructed Rooms

Geekbench has an AI benchmark now

Step-by-Step Guide for Building Interactive Calendars in Plotly

Related Posts

Zyphra Releases Zamba2-1.2B-Instruct and Zamba2-2.7B-Instruct: A New State-of-the-Art Small Language Model Series that Outperforms Gemma2-2B-Instruct

AI-Powered Corrosion Detection for Industrial Equipment: A Scalable Approach with AWS

Create your fashion assistant application using Amazon Titan models and Amazon Bedrock Agents | Amazon Web Services

Conducting Vulnerability Assessments with AI

Modeling relationships to solve complex problems efficiently

People are using Google study software to make AI podcasts—and they’re weird and amazing

Step-by-Step Guide for Building Interactive Calendars in Plotly

TikTok compares itself to foreign-owned American news outlets as it fights forced sale or ban

Perform generative AI-powered data prep and no-code ML over any size of data using Amazon SageMaker Canvas | Amazon Web Services

Leave a Reply Cancel reply

Mechrevo launches affordable Yao M510 gaming mouse with up to 4800 DPI & triple connectivity – Gizmochina

DJI RC Pro Review (Everything You Need to Know)

Windows 11 24H2 is out! @ AskWoody

Watch the mind-bending new trailer for sci-fi epic ‘3 Body Problem’ (video)

The Explorer 2025 is the first Ford to run its new Android infotainment system

iPhone 16 and iPhone 16 Plus to Get More RAM, Faster Wi-Fi: Report

Google Pixel 9 range tipped for major display brightness upgrade

AALTO achieves milestone HAPS regulation, with Design Organisation Approval from UK Civil Aviation Authority

OpenAI Launches Custom GPT Store: How to Access and Use It Right Now

Amazon boosts Throne and Liberty server caps as players flood to try the free MMORPG

Can you replace the Meta Quest 3S cloth head strap?

Amkor and TSMC sign an MOU to collaborate on advanced chip packaging for AI, HPC, PC, and mobile processors at Amkor's planned ~$2B facility in Peoria, Arizona (Anton Shilov/Tom's Hardware)

If You’ve Already Bought AirPods Pro 2, This Insane Prime Day Price Will Make You Jealous

Google is making it easier to protect your data if your phone gets stolen

Survival hit The Planet Crafter terraforms a whole new world in its first DLC

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password