|
.. _parallelisms: |
|
|
|
Parallelisms |
|
------------ |
|
|
|
NeMo Megatron supports 4 types of parallelisms (can be mixed together arbitraritly): |
|
|
|
Distributed Data parallelism |
|
^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
|
|
|
.. image:: images/ddp.gif |
|
:align: center |
|
:width: 800px |
|
:alt: Distributed Data Parallel |
|
|
|
|
|
Tensor Parallelism |
|
^^^^^^^^^^^^^^^^^^ |
|
|
|
.. image:: images/tp.gif |
|
:align: center |
|
:width: 800px |
|
:alt: Tensor Parallel |
|
|
|
Pipeline Parallelism |
|
^^^^^^^^^^^^^^^^^^^^ |
|
|
|
.. image:: images/pp.gif |
|
:align: center |
|
:width: 800px |
|
:alt: Pipeline Parallel |
|
|
|
Sequence Parallelism |
|
^^^^^^^^^^^^^^^^^^^^ |
|
|
|
.. image:: images/sp.gif |
|
:align: center |
|
:width: 800px |
|
:alt: Sqeuence Parallel |
|
|
|
Parallelism nomenclature |
|
^^^^^^^^^^^^^^^^^^^^^^^^ |
|
|
|
When reading and modifying NeMo Megatron code you will encounter the following terms. |
|
|
|
.. image:: images/pnom.gif |
|
:align: center |
|
:width: 800px |
|
:alt: Parallelism nomenclature |
|
|