Did you run LatencyMon? If any system drivers are causing interference, it will let you know.
With most common audio devices, the drivers are never very good. In my previous setups (at least 4 different devices), I could never get my buffer lower than 256, and generally had to raise it to 1024 by the end of the project. I upgraded to an RME card and now I can run at 128 from start to finish. I bet the guy in the vid has a pro device from RME or similar.
|