We discovered together that the CRC are different and that the 32bit padded sample can be brought back to 16bit without any digital difference. And that it was expected on my side. But you're saying I'm hearing a "placebo" difference: so, different CRC and placebo difference heard?
The CRC of the
file/container is different, because it has been padded.
The CRC of the
audio is the same.
If it were altering the audio, the source file and the 16-bit file after conversion to/from 32-bit would not match.
If I encode a WAV file to FLAC I get a different file CRC too, but the audio CRC is the same.
James, what if Mojo actually takes the 32bit Int and does NOT change it back to 16bit? Question asked to Rob.
Devices don't play zeros.
It's not going to make any difference.
Let's say you are examining a PCM to find the True Peak Level. Usually SW just go 4x (that's actually the CPU limit for realtime) and call it a day. Some others will go above (offline) and some others notably above thus scoring the most faithful mapping of original analog sound available with CPU equipment. This does NOT mean that they will output to that oversampling factor. But means that they'll have mapped the PCM reasonably closer to the original analog source (with CPU equipment).
4x is all that's required to get a good result from analysis, and allows you to process files many times faster than real-time.
Real-time would be
extremely slow for analyzing an audio library.
I set up a test and was able to analyze three hours of tracks in under four seconds from an SSD - roughly 2700x speed.
Media Center only uses 8 of the 16 cores my CPU has available, and my tests with other software shows that it would be about 70% faster if it used all of them (4600x).
Even if I run the analysis single-threaded, it's done in 25 seconds - over 400x speed.
4x is far from the limit of real-time playback for computers.
For one thing, Media Center itself offers up to 8xDSD upsampling, which is 22.6 MHz - though few DACs, and not all CPUs can handle it.
Mojo and bigger brothers do that in realtime @104Mhz and arguing that the best possible reconstruction in digital form of original analog domain before feeding the DAC's own analog section is useless and overkill because there is no input or output for the oversampling frequency been used is something any serious Audio Engineer would disagree.
Even more: the rationale behind that multiplying factor is at the core of DSD rationale: as close as possible to analog.
Any "serious audio engineer" would know that you don't need oversampling anything like that to get a clean analog output from a DAC.
Oversampling like that is one of those things I was talking about where there's a kernel of truth (higher oversampling rate = digital representation of the waveform being closer to the analog waveform) that these companies can use to claim that their product is the best, but which makes no practical difference when you can get the same analog output from a DAC operating at a much lower internal sample rate.
I recommend that you set aside 25 minutes to watch this presentation, as it does a good job of explaining the basics of digital sampling with excellent demonstrations:
https://www.youtube.com/watch?v=cIQ9IXSUzuM