Enhancing Deep Learning towards Exascale with the DEEP-EST Modular Supercomputer Architecture