MIT's new AI system can tune individual musical instruments straight out of a concert video

PixelPlayer can recognize pixels making specific soundwaves. (Source: MIT CSAIL)

A new deep learning AI algorithm developed by MIT CSAIL called PixelPlayer can isolate soundwaves of individual instruments in a video by recognizing the pixels from where the sound is coming thus enabling tuning of individual frequencies. MIT says the system is 'self-supervised' and doesn't require any human annotations.

Vaidyanathan Subramaniam, Published 07/17/2018

Software

Ever wished for an easy way to tune the guitar or the saxophone in an old video footage lying in the attic instead of having to re-master the entire audio track? The Computer Science and Artificial Intelligence Laboratory (CSAIL) of the Massachusetts Institute of Technology (MIT) has developed a new deep learning artificial intelligence (AI) algorithm that might just be what the doctor ordered.

CSAIL calls the system PixelPlayer and it has the capability to identify, isolate, and tune individual musical instruments from a footage with just a click. CSAIL researchers lead by Hang Zhao say that the program has received over 60 hours of video training and it can perform instrument isolation by identifying from which pixels the particular soundwaves emanate from — all without any human supervision or annotation even on never-before-seen videos.

The ability to isolate specific musical instruments from a video recording opens up immense possibilities, the researchers say. It gives engineers an easy way to repair/restore old concert footage or even swap instruments to preview what they sound like. The team says that in its current form, PixelPlayer can distinguish between sounds of more than 20 common instruments and it has the potential to 'learn' more if sufficient training data is provided. It does, however, face certain challenges with respect to identifying subtle differences between instrument subclasses. While there have been previous attempts to isolate soundwaves using AI from audio files, the inclusion of the visual element makes PixelPlayer 'self-supervised'. This 'self-supervision' adds a whole new complexity to the mix as it makes it difficult for the team to understand every aspect of how the system learns. Sounds a lot like Skynet, doesn't it?

Zhao says that PixelPlayer uses deep learning using neural networks trained on existing videos. There are three neural networks that individually perform the tasks of analyzing the visuals, analyzing the audio, and synthesizing soundwaves with specific pixels for isolation. Zhao and co-authors will be presenting their work at the European Conference on Computer Vision (ECCV) slated to take place in September this year in Munich.

Have a look at the video below to appreciate the AI in action and let us know your thoughts.

▶ load Youtube video

Source(s)

MIT News

The Roland Mood Pan relaxes users with soothing percussion instrument sounds and serene soundtracks for meditation. (Image source: Roland)

Roland unveils Mood Pan: All-in-one hand percussion instrument and Bluetooth speaker for meditation and relaxation 06/26/2025

Raspberry Pi: Transform the popular single-board computer into a synthesizer. (Image source: u/geo_mcclell)

Raspberry Pi: Transform the popular single-board computer into a synthesizer 07/02/2020

Endel joins Lil Pump, Michael Buble and Red Hot Chili Peppers among others on Warner Music (Image source: Endel)

Endel: The mood music app becomes the first algorithm to be signed by a major record label 03/25/2019

The Huawei P20 Pro's AI will decide the winner of the 'Spark A Renaissance' contest.

Huawei's 'Spark A Renaissance' photography competition has an AI judge 07/22/2018

Apple's freshly minted Chief of Machine Learning and AI Strategy, John Giannandrea. (Source: Apple)

Better late than never, Apple has appointed an AI Chief to save Siri 07/11/2018

Tachyum's Universal Processing Platform has the scalability to take the human brain. (Source: HPCWire)

Tachyum's Prodigy Universal Processor Platform aims to rewrite everything we know about CPUs and GPUs 07/01/2018

Bonsai team to join Microsoft (Source: The Official Microsoft Blog)

Microsoft acquires AI firm Bonsai 06/21/2018

Samsung Exynos 7 9610 now official (Source: Samsung Newsroom)

The Exynos 7 9610 processor coming later this year with deep learning and faster image processing 03/22/2018

The new MIT AI chip is bringing power efficient neural-net processing to mobile and IoT devices. (Source: MIT)

MIT reveals superefficient AI chips for smartphones 02/15/2018

Lamborghini and MIT take bodywork aerodynamics to the next level. (Source: Lamborghini)

Lamborghini and MIT collaborate on self-healing, semi-autonomous supercar 11/07/2017

Intel's NCS allows accessing deep neural networks at affordable prices. (Source: AnandTech)

Get deep learning and AI processing in your pocket for $79 with Intel's Movidius Neural Compute Stick 07/22/2017

The ultra-dense nanotube lanes of communication between the memory and logic layers. (Source: Nature)

MIT and Stanford present a 3-D chip that integrates CPU and RAM 07/12/2017

The Google Pixel was the first smartphone that featured Google's aptly-named personal assistant, Google Assistant. (Source: Bloomberg)

Smartphone vendors are focusing on artificial intelligence in 2017 01/24/2017

Samsung and MIT claim to have found a way for batteries to last 'indefinitely' 08/19/2015

Loading Comments

Comment on this article

Samsung's first LPDDR5 DRAM memory ...

AMD and Nvidia GPUs see additional ...

Vaidyanathan Subramaniam - Managing Editor - 2026 articles published on Notebookcheck since 2012

Though a cell and molecular biologist by training, I have been drawn towards computers from a very young age ever since I got my first PC in 1998. My passion for technology grew quite exponentially with the times, and it has been an incredible experience from being a much solicited source for tech advice and troubleshooting among family and friends to joining Notebookcheck in 2017 as a professional tech journalist. Now, I am a Lead Editor at Notebookcheck covering news and reviews encompassing a wide gamut of the technology landscape for Indian and global audiences. When I am not hunting for the next big story or taking complex measurements for reviews, you can find me unwinding to a nice read, listening to some soulful music, or trying out a new game.

contact me via: @Geeky_Vaidy

Please share our article, every link counts!