Almost certainly, but it would need a different encoding system.
This display works by chaining combinators together such that when they start outputting, each constant combinator adds its value to the stream. Basically it's delta-encoded, with one combinator per frame of video.
Each frame is made up of 15 rows, each row having a 20-bit number (4:3 aspect ratio). With this system you could squeeze in 18 rows of 24-bit numbers, however then the substation range becomes an issue! And there'd be no room left for the audio channels, due to the 20-signal limit on constant combinators.
The reason I did it this way is to allow data to be streamed at 60fps - any kind of "video decoder" would need more combinators actually doing work, and getting that to sync up properly felt like too much hassle compared to just streaming raw data!