Pipeline Parallellism

by leo98xh - opened

I can run this model with -tp 8 and -tp 16, but fail to run with -tp 8 -pp 2.
Does anyone know why?

Cognitive Computations org

This is not an issue within the scope of this repo, please report this upstream.

v2ray changed discussion status to closed

Sign up or log in to comment