Is there any working Colab notebook using vLLM with TPU v5e?

Ronan · March 27, 2025, 12:18pm

I can connect to a tpu but have never seen a working notebook for a model like Gemma or llama.

bvrockwell · March 28, 2025, 5:34am

do these instructions not work? Other AI accelerators — vLLM

I don’t actually have access to v5e through colab, so can’t test this, but it should be very similar to VM.

just keep in mind it’s using v6e, not v5e (HBM per chip on v6e is 32GB, whereas v5e is 16GB)

Please share a bug if any of these don’t work - thanks!

Ronan · April 16, 2025, 10:21pm

Oh wow, I’m not sure if it would be easy to do via Colab, but I ran those scripts via cloud and everything worked like butter?

Any idea on whether it is possible to test the v7? I’m looking to make a video for the YouTube.com/@TrelisResearch channel, cheers

Topic		Replies	Views
Running Gemma 3 on multi-chip TPU failure Google TPU Support	5	382	May 1, 2025
Can anyone help me? Why is this not working? It used 😭 NVIDIA GPU Support	1	584	May 8, 2025
vLLM install for 5090 General	1	1272	August 2, 2025
Not able to run google/gemma-3n-E4B-it General	3	66	September 22, 2025
Make install easier General	11	189	July 24, 2025