What are the minimum requirements regarding RAM and GPU memories for performing only inferences over the [Bloom](https://huggingface.co/bigscience/bloom) model?