The ultimate solution does exist which is to deal with all the laws of physics and fluids and fully model every aspect of the room and the speakers that excite it.
And, which ultimately results in anechoic listening rooms with electronic systems designed to simulate a given type of room.
Fortunately (now we can move into perception a minute) our ears and brains are quite good at separating out some playback room effects, thanks to a variety of mechanisms from short-term loudness adaptation in the periphery to various cognitive filtering techniques.