You're going to get better performance if you split the audio off to it's own track anyway. You know zooming into compressed formats doesn't work well, there's always a lag. Duplicate the track, glue one so it turns to wav. Ignore audio on the video track. Group the items.
|