Does this work with vLLM?

#9
by nickandbro - opened

Looking to hopefully get this running on vLLM making use of the cuda graphs their.

Sign up or log in to comment