A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

Dec 31, 2021 1 min read

memory_efficient_attention.pytorch

A human-readable PyTorch implementation of “Self-attention Does Not Need O(n^2) Memory” (Rabe&Staats’21).

def efficient_attention(query: torch.Tensor,
                        key: torch.Tensor,
                        value: torch.Tensor,
                        chunk_size: int = None,
                        checkpointing: bool = False,
                        out_of_place: bool = False
                        ) -> torch.Tensor:
    """ A sub-quadratic complexity implementation of self-attention

    Args:
        query: query of shape BxHxNxD
        key: key of shape BxHxN'xD
        value: value of shape BxHxN'xD
        chunk_size: chunk size to divide the query. If None (default), sqrt(N) is used.
        checkpointing: True to enable checkpointing.
        out_of_place: True to disable inplace operations.

        where B is the batch size, H is the number of heads, N is the sequence length of the query,
        N' is the sequence length of the key and value (can be N), and D is the feature size.

    Returns: output of self-attention of shape BxHxNxD

    """
    ...

Requirements

Python>=3.9
PyTorch>=1.10

Installation

pip install -U git+https://github.com/moskomule/memory_efficient_attention.pytorch

Reference

@misc{rabe2021selfattention,
      title={Self-attention Does Not Need $O(n^2)$ Memory}, 
      author={Markus N. Rabe and Charles Staats},
      year={2021},
      eprint={2112.05682},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

GitHub

View Github

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

memory_efficient_attention.pytorch

Requirements

Installation

Reference

GitHub

John

Lucky Balls: gambling game where user try to guess 6 numbers from 1 to 48 that computer has picked

Lets you remove all friends, leave GCs, and leave servers, in an instant

memory_efficient_attention.pytorch

Requirements

Installation

Reference

GitHub

Lucky Balls: gambling game where user try to guess 6 numbers from 1 to 48 that computer has picked

Lets you remove all friends, leave GCs, and leave servers, in an instant

You might also like...