Run multiple models

When having tensor parallel, and running multiple vllm servers offering multiple models do I have to define the memory usage in each like 50%?