Skip to content

Tensor parallelism

# Tensor & Pipeline Parallelism

Coming soon — splitting layers across GPUs, Megatron-LM style.